GitHub Archive Program: Preserving Source Codes in the Arctic

GitHub Archive Program: Preserving Source Codes in the Arctic

GitHub Archive Program – A  mission to preserve open-source software for future generations by storing your code in an archive built to last a thousand years called the GitHub Arctic Code Vault.

Each code’s journey began in Piql’s facility in Drammen, Norway where the boxes with 186 film reels were shipped to Oslo Airport and then loaded into the belly of the plane which provides passenger service to Svalbard.

Svalbard, roughly 600 miles (1000 km) north of the European mainland, just recently opened up to visitors from countries within the Schengen Area and the European Economic Area.

The codes landed in Longyearbyen, a town of a few thousand people on Svalbard, where boxes were met by a local logistics company and taken into intermediate secure storage overnight.

The next morning, it traveled to the decommissioned coal mine set in the mountain, and then to a chamber deep inside hundreds of meters of permafrost, where the code now resides fulfilling their mission of preserving the world’s open-source code for over 1,000 years.

Millions of developers around the world contributed to the open-source software to be stored in the Arctic Code Vault. To recognize and celebrate these contributions,  the Arctic Code Vault Badge was created which is shown in the highlights section of a developer’s profile on GitHub.

Collaborators

The Internet Archive is a well-known, widely beloved non-profit digital library that provides free public access to collections of digitized materials.

In partnership with the GitHub Archive Program, the Internet Archive (IA) commenced its ongoing archive of GitHub public repositories on April 13 of this year.

At present, IA is using a two-pronged approach.

First, their well-known Wayback Machine is accessing and archiving raw GitHub data as WARCs, or Web ARChive files.

As of this writing, they have archived some 55TB of data.

Second, they have the goal of making entire archived GitHub repositories available via “git clone,” while also keeping repo comments, issues, and other metadata easily accessible on the web.

This second initiative is well underway and initial archiving is expected to commence this month.

Software Heritage is a nonprofit, a multi-stakeholder initiative launched by Inria in collaboration with UNESCO with the goal to collect, preserve, and share the source code of our software commons.

They already archive more than 130 million projects, with their full development history, and we are delighted to announce that 100 million of these are from GitHub.

Thanks to the collaboration announced at GitHub Universe 2019, the archival engine is being improved with the goal to keep it up to speed with GitHub‘s growth, but if the project you are interested in, or its latest version, is not archived yet, you do not need to wait, it’s easy to trigger its archival right now in a few clicks on https://save.softwareheritage.org.

Project Silica is developing the first storage technology designed and built from the media up for cloud-scale storage of long-lived data.

By leveraging recent discoveries in ultrafast laser optics, data is stored in quartz glass, through a process that permanently changes the physical structure of the glass material.

Quartz glass is a durable storage media that offers unparalleled data lifetimes of upwards of tens of thousands of years. It is resilient to electromagnetic interference, water, and heat, making it the ideal storage medium for ensuring the world’s open-source software is forever preserved for future generations.

As a partner in the GitHub Archive Program, Project Silica is committed to driving storage innovation, and developing a storage technology that addresses the need for a sustainable and reliable storage technology for the world’s long-lived data.

They have archived over  6,000 of the world’s most popular repositories as a proof of concept for future archives.

Code, culture, history, and technology: The Tech Tree

Every reel of the archive includes a copy of the “Guide to the GitHub Code Vault” in five languages, written with input from GitHub’s community and available at the Archive Program’s own GitHub repository.

In addition, the archive will include a separate human-readable reel which documents the technical history and cultural context of the archive’s contents. This is called the Tech Tree.

Inspired by the Long Now’s Manual for Civilization, the Tech Tree will consist primarily of existing works, selected to provide a detailed understanding of modern computing, open-source and its applications, modern software development, popular programming languages, etc.

It will also include works that explain the many layers of technical foundations that make software possible: microprocessors, networking, electronics, semiconductors, and even pre-industrial technologies.

This will allow the archive’s inheritors to better understand today’s world and its technologies, and may even help them recreate computers to use the archived software.

Encapsulating the world’s cultural context and technical history is a challenging prospect.

The Tech Tree is expected to evolve and iterate over time.

If you would like to learn more information contact: alex.bell@madebychameleon.com.

More about Irish Tech News and Business Showcase here

FYI the ROI for you is => Irish Tech News now gets over 1.5 million monthly views, and up to 900k monthly unique visitors, from over 160 countries. We have over 860,000 relevant followers on Twitter on our various accounts & were recently described as Ireland’s leading online tech news site and Ireland’s answer to TechCrunch, so we can offer you a good audience!

Since introducing desktop notifications a short time ago, which notify readers directly in their browser of new articles being published, over 50,000 people have now signed up to receive them ensuring they are instantly kept up to date on all our latest content. Desktop notifications offer a unique method of serving content directly to verified readers and bypass the issue of content getting lost in people’s crowded news feeds.

Drop us a line if you want to be featured, guest post, suggest a possible interview, or just let us know what you would like to see more of in our future articles. We’re always open to new and interesting suggestions for informative and different articles. Contact us, by email, twitter or whatever social media works for you and hopefully we can share your story too and reach our global audience. We are agile, responsive, quick and talented, we look forward to working with you!

Irish Tech News


If you would like to have your company featured in the Irish Tech News Business Showcase, get in contact with us at Simon@IrishTechNews.ie or on Twitter: @SimonCocking

Patrick O Brien

Recent Posts

Spanish Point expands UK operations following 31% CAGR and Microsoft milestone

Spanish Point Technologies, a software engineering company and founding Microsoft Partner, has announced the expansion…

6 hours ago

Why You Must Prioritise AI Empowerment in 2026

Most leadership teams are trying to be responsible about AI. They want clearer rules and…

8 hours ago

AI FORWARD > Supercomputing the Future: Rare Open Day at Ireland’s Most Advanced AI Infrastructure

CloudCIX, in conjunction with AlloComp, will host AI FORWARD > Supercomputing the Future, a one-day…

1 day ago

MTU to Host National Workshop on Strengthening Rural Life and the Future of Farming

Munster Technological University (MTU) will host a major stakeholder workshop exploring the future of rural…

1 day ago

More about Irish Tech News


Irish Tech News are Ireland’s No. 1 Online Tech Publication and often Ireland’s No.1 Tech Podcast too.


You can find hundreds of fantastic previous episodes and subscribe using whatever platform you like via our Anchor.fm page here: https://anchor.fm/irish-tech-news


If you’d like to be featured in an upcoming Podcast email us at Simon@IrishTechNews.ie now to discuss.


Irish Tech News have a range of services available to help promote your business. Why not drop us a line at Info@IrishTechNews.ie now to find out more about how we can help you reach our audience.


You can also find and follow us on Twitter, LinkedIn, Facebook, Instagram, TikTok and Snapchat.