Data Hoarding - Archives

On this page you will find links to data archives from various countries. These archives contain data that was gathered and saved for the public good.

Topics: Books Climate Culture Gaming Government Health History Law Science Technology War World
Data types: Text Images Videos Software

Health Text

101 Cookbooks

Description:
101 Cookbooks is a food blog from California that archived thousands of healthy recipes, made available for free.

Links:

Science Text

Academic Torrents

Description:
Making over 127.15TB of research data available, this site provides a distributed system for sharing enormous datasets for researchers, by researchers. The result is a scalable, secure, and fault-tolerant repository for data, with blazing fast download speeds.

Links:

War Text Images Videos

Airwars

Description:
Airwars is a not-for-profit transparency watchdog which tracks, assesses, archives and investigates civilian harm claims in conflict-affected nations. Founded in 2014 they are today a leading authority on conflict violence as it affects civilian communities.

Links:

Books Science History Text

Anna's Archive

Description:
Described as the largest truly open library in human history. This site mirrors Sci-Hub and LibGen. They also scrape and open-source Z-Lib, DuXiu, and more. Currently hosting over 42 million books, 98 million papers, preserved forever. All their code and data are completely open source.

Links:

Technology Text

Appropedia

Description:
Appropedia is a site for original research on sustainable development and appropriate technologies. The wiki documents over 4,000 projects.

Links:

Science Text History

Archaeology Data Service

Description:
ADS is the leading accredited repository in the UK for archaeology and historic environment data, with over 25 years of experience supporting research, learning and teaching with free, high quality and dependable digital resources.

Links:

World Text

Archive.today

Description:
Archive.today is a time capsule for web pages! It takes a 'snapshot' of a webpage that will always be online even if the original page disappears. It saves a text and a graphical copy of the page for better accuracy and provides a short and reliable link to an unalterable record of any web page.

Links:

Images Culture Videos

Art UK

Description:
Art UK brings together art from over 3,400 British institutions in one of the UK's biggest-ever arts partnerships. It shows over 600,000 works by over 60,000 artists and is growing all the time.

Links:

Science Text

arXiv

Description:
arXiv is a free distribution service and an open-access archive for nearly 2.4 million scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. arXiv is a community of volunteer authors, readers, moderators, advisory board members, supporting members, donors, and third-party collaborators that are supported by the staff at Cornell University.

Links:

History Law Text

Avalon Project

Description:
The Avalon Project by Yale University collects documents in law, history and diplomacy. It provides the full text of important documents from 4,000 BCE to today.

Links:

World Technology Text Images Science

AWS Data Exchange

Description:
AWS Data Exchange makes it easy to find datasets made publicly available through AWS services. Browse available data and learn how to register your own datasets.

Links:

Government Health Climate Science Text

CAFE

Description:
The Convene-Accelerate-Foster-Expand (CAFE) site is an open collection designed to support and enhance global research initiatives focused on understanding and mitigating the health impacts of climate change. It's hosted by Harvard University, Boston University and contains hundreds of datasets, mostly from US Gov web sites.

Links:

Government Law Text

Caselaw Access Project

Description:
The Caselaw Access Project (CAP) scanned the entirety of the Harvard Law School Library's physical collection of American case law and made it machine-readable in a consistent format available online. To facilitate that agreement, the Library Innovation Lab (LIL) maintained the case.law website as the primary access point for the data. CAP includes all official, book-published state and federal United States case law through 2020, every volume or case designated as an official report of decisions by a court within the United States.

Links:

Science Text Software Videos

CERN Open Data

Description:
Explore more than five petabytes of open data from particle physics on this CERN web site.

Links:

History Text Images Videos War

Chartlann Mhileata Military Archives

Description:
The Military Archives offers a diverse range of collections documenting Ireland's military history, including pensions and historical documents.

Links:

Science Text Videos

ChemSpider

Description:
A free chemical structure database providing access to millions of chemical structures and properties, hosted by the Royal Society of Chemistry, London.

Links:

Technology Software Images Videos

CivitAI

Description:
CivitAI is an online platform and marketplace for generative AI content, primarily focused on AI-generated images and models.

Links:

Climate Images Videos Software Text

Climate Data Store

Description:
The C3S Climate Data Store (CDS) is a one-stop shop for information about the climate: past, present and future. It provides easy access to a wide range of climate datasets via a searchable catalogue.

Links:

Government Climate Text

Climate Mirror Project

Description:
The Climate Mirror Project is trying to mirror and safely archive US Gov websites and datasets related to climate, climate change, and global warming. It provides mirrors of official NOAA and other government web sites.

Links:

Text Climate

Climate TRACE

Description:
Climate TRACE provides comprehensive emissions tracking over 662 million emitting assets in 10 sectors over 10+ years. The site is an AI powered platform for realtime GHG emissions reporting, built by a global, not-for-profit coalition of over 100 universities, scientists, and AI experts

Links:

World Text

Common Crawl

Description:
Common Crawl maintains a free, open repository of web crawl data that can be used by anyone. They believe that everyone should have the opportunity to indulge their curiosities, analyze the world, and pursue brilliant ideas. The latest crawl contains over 2.74 billion web pages.

Links:

Government Technology Text Software

Common Vulnerabilities and Exposures (CVE)

Description:
The CVE program identifies, defines, and catalogs publicly disclosed cybersecurity vulnerabilities. There are currently over 274,000 CVE Records accessible through the program. While it depends on US Government funding, there are several alternative databases also available.

Links:

Gaming Software

Console Mods

Description:
This wiki contains information on game console modding and game dumping tools.

Links:

World History Text

Cross-National Time-Series Data

Description:
CNTS provides more than 200 years of annual data from 1815 onward, including 196 demographic, political, legislative, economic and social science variables.

Links:

History Text Images Culture

Cultural Japan

Description:
This site aims to collect information related to Japanese culture published in museums, libraries, etc. around the world, and to provide them with a common and reusable format.

Links:

World Images History

Curationist

Description:
Curationist is a free online resource that brings together arts and culture communities to find, share, collaborate, and reimagine cultural narratives. Since its launch in 2022, Curationist enables global users to search more than 4.4 million images of works from the open access collections of museums and archives worldwide.

Links:

Culture World Text Images

CURIOSity Digital Collections

Description:
CURIOSity Is the online portal of Harvard Library. It provides success to thousands of art pieces, maps, books, photographs and more.

Links:

History Text Images

DAACS

Description:
The Digital Archaeological Archive of Comparative Slavery (DAACS) is a Web-based initiative designed to foster inter-site, comparative archaeological research on slavery throughout the Chesapeake, the Carolinas, and the Caribbean.

Links:

Climate Science Text Images

Data Basin

Description:
Data Basin is a science-based mapping and analysis platform that supports learning, research, and sustainable environmental stewardship. It publishes maps, datasets, visualizations, drawings, & analyses.

Links:

Government

Data Liberation Project

Description:
The Data Liberation Project is an initiative to identify, obtain, reformat, clean, document, publish, and disseminate US Gov datasets of public interest.

Links:

Government Text

Data Lumos

Description:
DataLumos is an ICPSR archive for valuable government data resources. ICPSR has a long commitment to safekeeping and disseminating US government and other social science data. DataLumos accepts deposits of public data resources from the community and recommendations of public data resources that ICPSR itself might add to DataLumos. The site is hosted by the University of Michigan.

Links:

Government Text

Data Rescue Project

Description:
The Data Rescue Project is a coordinated effort among a group of data organizations focusing on rescue-related efforts and data access points for public US governmental data that are currently at risk. It provides resources, collections of datasets and news updates.

Links:

Images World

David Rumsey Map Collection

Description:
The David Rumsey Map Collection was started over 35 years ago and contains more than 200,000 maps. The collection focuses on rare 16th through 21st century maps of North and South America, as well as maps of the World, Asia, Africa, Europe, and Oceania.

Links:

Government Text

Deportation Data Project

Description:
The Deportation Data Project collects and posts public, anonymized U.S. government immigration enforcement datasets. We expect these datasets to be used by journalists, researchers, lawyers, and policymakers.

Links:

History Text Images

Digital Archive Ontario

Description:
Digital Archive Ontario collects digitized items held by Toronto Public Library, including over 100,000 historical photos, maps, postcards & more from Ontario.

Links:

Books Images

Digital Comic Museum

Description:
The Digital Comic Museum claims to be the best site for downloading FREE public domain Golden Age Comics.

Links:

History Books Text Videos

Digital Public Library of America

Description:
The DPLA highlights millions of items from libraries, archives and museums across the United States, organized into easy-to-navigate topics through a single catalog.

Links:

Images Text History Science

DigitalCommons@UNO

Description:
DigitalCommons@UNO is an institutional repository with the goal of collecting and making visible the intellectual output of the University of Nebraska at Omaha. It contains collections about science, technology, historical records, scanned newspapers, community engagement, conferences and events, public safety, and much more.

Links:

Gaming Software

DOS Zone

Description:
Enjoy classic games completely free and without ads on dos.zone! They play in your browser without the need to download anything.

Links:

Technology Software

Drivers Collection

Description:
Drivers Collection is one of largest free web library of device drivers for computer hardware. It contains over 6 million drivers from various hardware vendors.

Links:

Science Text

Dryad

Description:
Dryad is an open data publishing platform and a community committed to the open availability and routine re-use of all research data. Their multi-stakeholder community of academic and research institutions, research funders, scholarly societies and publishers is committed to leading in best practices for open data sharing and reuse.

Links:

Climate Text Science

Earth and Space Science Open Archive

Description:
The Earth and Space Science Open Archive is a community server established to accelerate the open discovery and dissemination of earth, environmental, and space science research by archiving and sharing early research outputs, including preprints, presentations from major scientific meetings, and important documents of scholarly societies.

Links:

Government Text

End-of-Term web archive

Description:
The End of Term Web Archive captures and saves U.S. Government websites at the end of presidential administrations. The EOT has thus far preserved websites from administration changes in 2008, 2012, 2016, and 2020. The End of Term Web Archive contains federal government websites (.gov, .mil, etc) in the Legislative, Executive, or Judicial branches of the government.

Links:

Science Images Text Videos

European Space Agency

Description:
The European Space Agency provides datasets on space science and observation data. The ESA Hubble portal catalogs all news releases, images and videos captured by the Hubble Space Telescope. The Earth Online portal functions as access point for a wide variety of Earth observation resources. Copernicus provides free instant access to a wide range of data and services from the Copernicus Sentinel missions.

Links:

History Images Text Videos

Europeana

Description:
The Europeana website provides cultural heritage enthusiasts, professionals, teachers, and researchers with access to Europe's digital cultural heritage. It contains thousands of items related to archaeology, art, newspapers, fashion, music, photography and more.

Links:

Technology Software

Files dot Dog

Description:
This site contains a large collection of Microsoft Developer Network (MSDN) files, along with random other files.

Links:

Gaming Software

Flashpoint

Description:
Flashpoint provides a frontend player software and a database of over 200,000 Flash games and animations, preserved from the early web.

Links:

History Images

Fortepan

Description:
Fortepan is a copyright-free and community-based photo archive with over 200,000 Hungarian photographs available for anyone to browse and download in high-resolution, free of charge.

Links:

Science Text Images

Free GIS Data

Description:
This page contains a categorized list of links to over 500 sites providing freely available geographic datasets, all ready for loading into a Geographic Information System (GIS).

Links:

Gaming Images Videos Software

Games Database

Description:
Games Database is one of the biggest source for manuals, videos, music and artwork. The site provides over 32k videos, 8k music files, 14k manuals, 5k game adverts, 822 TV commercials for 126 systems.

Links:

World History Text Videos

GDELT

Description:
GDELT is the largest, most comprehensive, and highest resolution open database of human society ever created. Creating a platform that monitors the world's news media from nearly every corner of every country in print, broadcast, and web formats, in over 100 languages, every moment of every day and that stretches back to January 1, 1979 through present day.

Links:

Science Text

Global Biodiversity Information Facility

Description:
GBIF (the Global Biodiversity Information Facility) is an international network and data infrastructure funded by the world's governments and aimed at providing anyone, anywhere, open access to data about all types of life on Earth. It provides access to over 110,000 datasets.

Links:

Climate Text Images

Global Energy Monitor

Description:
Global Energy Monitor develops and analyzes data on energy infrastructure, resources, and uses. They provide open access to information that is essential to building a sustainable energy future.

Links:

Images Culture

Google Arts & Culture

Description:
Google Arts & Culture is a non-commercial initiative. They work with museums, cultural institutions and artists around the world to preserve and bring the world's art and culture online so it's accessible to anyone, anywhere.

Links:

Images World History

Google News Archive

Description:
Google News Archive is an extension of Google News providing free access to scanned archives of newspapers and links to other newspaper archives on the web, both free and paid. The site covers hundreds of individual publications, with thousands of scanned pages since the 18th century, with full text search.

Links:

Images Text History Videos

Historic England

Description:
Historic England is the public body that helps people care for, enjoy and celebrate England's spectacular historic environment. They hold an outstanding range of photographs, plans and drawings in their public archive, covering the historic environment of England.

Links:

History Text Images Videos

Historic Environment Scotland

Description:
Historic Environment Scotland is the lead public body established to investigate, care for and promote Scotland’s historic environment, managing over 45,000 objects in its collections.

Links:

Technology Software

Hugging Face

Description:
Hugging Face is the platform where the machine learning community collaborates on models, datasets, and applications. It contains the largest collection of open source AI models and focuses on machine learning tasks.

Links:

Technology Software

Ibiblio

Description:
Ibiblio (then called SunSITE) began mirroring open source software in 1992, and was one of only three such repositories available on the internet. Now almost 30 years later mirroring and open source software has evolved.

Links:

Government Text Culture Videos

ICPSR

Description:
ICPSR is research science data and resources on topics like social media, politics, economics, social sciences, government, GIS, & more. ICPSR is part of the Institute for Social Research at the University of Michigan.

Links:

Science Text

INSDC

Description:
The International Nucleotide Sequence Database Collaboration (INSDC) archives nucleotide sequence data, from raw to assembled and annotated sequences, from around the world.

Links:

Technology Software

Interesting DOS Programs

Description:
This is an archive of various DOS software and other DOS related websites. Most are freeware but a few are shareware and commercial programs.

Links:

Technology World Text Images Videos Software Books

Internet Archive

Description:
The Internet Archive is an American non-profit organization founded in 1996 by Brewster Kahle that runs a digital library website, archive.org. It provides free access to collections of digitized media including websites, software applications, music, audiovisual, and print materials.

Links:

Science Health World Text Videos

IPUMS

Description:
IPUMS provides census and survey data from around the world integrated across time and space. IPUMS integration and documentation makes it easy to study change, conduct comparative research, merge information across data types, and analyze individuals within family and community contexts. Data and services available free of charge.

Links:

Climate Text Images

IUCN Red List of Threatened Species

Description:
Established in 1964, The International Union for Conservation of Nature's Red List of Threatened Species has evolved to become the world's most comprehensive information source on the global conservation status of animal, fungi and plant species.

Links:

History Text Images

Japan Disasters Archive

Description:
A project of Harvard University’s Reischauer Institute of Japanese Studies, the Japan Disasters Digital Archive (JDA) collects archived materials from all over the web, including websites, images, video, audio, news articles, individual testimonials, tweets, and other content.

Links:

World Books History Text Images

JSTOR

Description:
JSTOR provides access to more than 12 million journal articles, books, images, and primary sources in 75 disciplines.

Links:

Technology World Science Text

Kaggle

Description:
Kaggle is one of the largest collection of datasets, mostly focusing on statistics, science, world affairs and technology. It contains 430K high-quality public datasets. Everything from avocado prices to video game sales.

Links:

Gaming Software Videos

Keitai Game Preservation

Description:
This wiki is dedicated to cataloging games from Japanese Feature Phones (keitai), pre-Android/iPhone mobile devices released in Japan. (e.g. i-Mode game, i-Appli game, EZweb game, S!Appli game). They also provide information and support for preserving Japanese feature phone games.

Links:

Gaming Text Images

Kirkland's Manual Labor

Description:
This site contains thousands of video gaming manuals for retro consoles such as N64, SNES, GameBoy, and more.

Links:

Technology World Science Books Text Software

Kiwix

Description:
3 billion people have no or little access to internet. This can be because of costs, lack of infrastructure, or outright censorship. Kiwix provides offline versions of popular web sites like Wikipedia, Wikibooks and Project Gutenberg.

Links:

Books Text Images

Library Genesis

Description:
Library Genesis (LibGen) is the largest free library in history, giving access to 84 million scholarly journal articles, 6.6 million academic and general interest books, 2.2 million comics and 381 thousand magazines.

Links:

Science Books Text

LibreTexts

Description:
LibreTexts is the adaptable, user-friendly open education resource platform that educators trust for creating, customizing, and sharing accessible, interactive textbooks, adaptive homework, and ancillary materials. We collaborate with individuals and organizations to champion open education initiatives, support institutional publishing programs, drive curriculum development projects, and more. The LibreText Commons hosts curated Open Educational Resources from all 16 libraries in the LibreVerse in one convenient location.

Links:

Books Images

MangaDex

Description:
MangaDex is one of many websites dedicated to archiving scanned mangas and other Asian comic books. These sites provide thousands of titles to read for free, compiled by volunteers.

Links:

World Text

Mirror Service

Description:
The UK Mirror Service provides a collection of mirrors of FTP, web and rsync sites of interest to academic users. The service is provided by the University of Kent's School Of Computing.

Links:

History Text Images Videos

Montreal Archives

Description:
The site of the Montreal Archives contains the greatest collection of historical documents and photographs about Montreal's history, including the history of early Canadian life.

Links:

Technology History Text Images Videos

Museum of Obsolete Media

Description:
A unique online museum of physical media formats showcasing developments in audio, video, film and data storage, the Museum preserves the memory of those objects that held our memories, and every format listed in the Museum is represented by at least one example in the collection.

Links:

Videos War History

Museum of Stolen Art

Description:
The museum is a virtual 3D space exhibiting digital copies of Ukrainian artworks which were stolen, destroyed or disappeared as a result of the full-scale invasion of Russia.

Links:

Gaming Software

My Abandonedware

Description:
On My abandonware you can download all the old video games from 1965 to 2012 for free. You can play Pacman, Arkanoid, Tetris, Galaxian, Alter Ego, or Blackthorne, Civilization, Sim City, Prince of Persia, Xenon 2, King's quest, Ultima, Kyrandia, The Incredible Machine, Another World, Test Drive, Flashback, Lemmings and more. For each game, they offer all related information included publication year, publisher, developer, size of the game, language, review of the game, instructions to play, the game manual and, of course, the game archive that you can download for free.

Links:

Culture Text Images

My Figure Collection

Description:
MyFigureCollection.net is a service for Japanese pop-culture (anime, manga, video-games, etc.) goods (figures, artbooks, CDs, cups, collectibles, dakimakura, etc.) collectors. Join our friendly community for free and start listing and managing your collection online: databases, pictures, calendar, budget manager, encyclopedia and much more.

Links:

World Government Text Images Videos

National Archives

Description:
The National Archives is a common term to designate a government funded archival institution focused on cataloging and making available historically significant works from the country in question, usually through government mandated processes.

Links:

Science Text

National Center for Biotechnology Information

Description:
NCBI is the one-stop shop for finding, browsing, and downloading genomic data.

Links:

Videos Culture

National Film Board of Canada

Description:
In addition to being a public producer and distributor of Canadian content, the National Film Board of Canada (NFB) is the caretaker of over 7,000 productions available for free for personal use.

Links:

Books History World Government Text

National Library

Description:
A national library is a library established by a government as a country's preeminent repository of information. They often include numerous rare, valuable, or significant works.

Links:

Government Technology Text War

National Security Archive

Description:
Founded in 1985 by journalists and scholars to check rising government secrecy, the National Security Archive combines a unique range of functions: investigative journalism center, research institute on international affairs, library and archive of declassified U.S. documents.

Links:

Gaming Software Videos

Nexus Mods

Description:
Nexus Mods is one of several gaming mods archives, hosting over 300,000 mods for over 3,500 PC games. Several other sites also provide smaller archives of gaming mods.

Links:

Gaming Software

Old Games

Description:
Old-Games.com provides 10,000+ old PC games free to download, along with screenshots and descriptions.

Links:

World Images

Old Maps Online

Description:
Browse historical places and search for old maps with timeline, through a web or mobile app.

Links:

World Culture Videos

Open Culture

Description:
Take online courses from the world’s top universities for free. Here, you will find 1,700 free online courses from universities like Yale, MIT, Harvard, Oxford and more.

Links:

Books Text

Open Library

Description:
Open Library is an initiative of the Internet Archive and provides access to thousands of books, out of print and otherwise. It provides an open, editable library catalog, building towards a web page for every book ever published.

Links:

Climate Science Text

OpenEI

Description:
The Open Energy Data Initiative (OEDI) enables research, collaboration, and transparency by providing open access to energy data and information. The OpenEI Data Lake is a centralized repository of datasets aggregated from the U.S. Department of Energy’s Programs, Offices, and National Laboratories. It provides links to over 4.19 PB of data.

Links:

Technology Software

OpenML

Description:
OpenML is an open platform for sharing datasets, algorithms, and experiments. It contains thousands of datasets and machine learning tasks running openly.

Links:

Technology Text Images War

OSINT Ukraine

Description:
This is a public repository of tools, resources and an archive of Telegram messages related to the war in Ukraine. Note that some of the media on the site are very graphic.

Links:

World Text Images

Our World in Data

Description:
Our World in Data is a project of the Global Change Data Lab, a non-profit organization providing analysis from thousands of researchers around the world about poverty, disease, hunger, climate change, war, existential risks, and inequality.

Links:

Climate Science Text Images

Pangea

Description:
The information system PANGAEA is operated as an Open Access library aimed at archiving, publishing and distributing georeferenced data from earth system research. PANGAEA is open to any project, institution, or individual scientist to use or to archive and publish data.

Links:

Videos Images Text History

Penn Museum

Description:
Home to over a million extraordinary artifacts and archaeological finds, the Penn Museum has been uncovering our shared humanity across continents and millennia since 1887.

Links:

Books Text

Project Gutenberg

Description:
Project Gutenberg is a library of over 75,000 free eBooks. Everything from Project Gutenberg is gratis, libre, and completely without cost to readers. Michael Hart, founder of Project Gutenberg, invented eBooks in 1971 and his memory continues to inspire the creation of eBooks and related content today. The Project Gutenberg Literary Archive Foundation (PGLAF) is the non-profit corporation that oversees operation of the project.

Links:

Climate Science Government Text

Public Environmental Data Project

Description:
The Public Environmental Data Project is committed to preserving and providing public access to federal environmental data. They are a volunteer coalition of several environmental, justice, and policy organizations, researchers across several universities, archivists, and students who rely on federal datasets and tools to support critical research, advocacy, policy, and litigation work. Several datasets are available on their site.

Links:

Government Text

Publications Office of the European Union

Description:
The Publications Office of the European Union is the official provider of publishing services to all EU institutions, bodies and agencies. As such, it is the central point of access to EU law, and also to publications, data, research results, procurement notices and other official information.

Links:

Technology History Text Images

Radio Museum

Description:
The radio museum contains a vast library of data about radio devices. It contains over 350K radio models, 2.8M pictures including 1M schematics, and 79K tubes/semiconductors.

Links:

Books Images

Read All Comics

Description:
Read All Comics provides a web interface to read thousands of comics for free.

Links:

Books Gaming Images

RetroMags

Description:
This site indexes and makes available for free download thousands of retro gaming magazines and strategy guides from 10 years ago and earlier.

Links:

Technology Text Software Videos

SANS Institute

Description:
The SANS institute provides a number of cybersecurity resources including training, tools, posters, videos, publications and more.

Links:

Science Books Text

Sci-Hub

Description:
Sci-Hub started as a tool for providing quick access to articles from scientific journals - such articles are the main medium of communication of scientific knowledge today. Now Sci-Hub has grown a database of over 88 millions research articles and books freely accessible for anyone to read and download.

Links:

Government Science Climate Text Images

SciOp

Description:
SciOp is the curated catalog/index of Safeguarding Research & Culture (SRC), an alternative infrastructure for archival and dissemination of cultural heritage and scientific knowledge in the form of a federated BitTorrent tracker.

Links:

History Text Images

SHAFR

Description:
The Society for Historians of American Foreign Relations maintains a large list of Asian resources including newspaper scans, private collections, cultural archives and more.

Links:

Technology Text

Sigma AI

Description:
Sigma AI provides a list of open AI related datasets from various other sites.

Links:

Images World History Videos

Smithsonian Open Access

Description:
On the Smithsonian Open Access web site, you can download, share, and reuse more than 5.1 million 2D and 3D digital items from their collections.

Links:

Technology Software Videos

Software Heritage

Description:
The long term goal of the Software Heritage initiative is to collect all publicly available software in source code form together with its development history, replicate it massively to ensure its preservation, and share it with everyone who needs it. The Software Heritage archive is growing over time as they crawl new source code from software projects and development forges.

Links:

Climate Science Government Text Software

Source COOP

Description:
Source Cooperative is a data publishing utility that allows trusted organizations and individuals to share data using standard HTTP methods. It contains large data collections and mirrors of various sites, mostly centered around science, government and climate.

Links:

Technology Text Software

TextFiles

Description:
TEXTFILES.COM has been online for nearly 25 years providing text files, focusing mostly on the years 1980-1995.

Links:

Gaming Images Videos Software

The Cutting Room Floor

Description:
The Cutting Room Floor is a site dedicated to unearthing and researching unused and cut content from video games. From debug menus, to unused music, graphics, enemies, and levels.

Links:

Technology Text Images

The Eye

Description:
The Eye is a very large archive of files of all types covering decades. It provides archives of various sub-reddits, Telegram channels, AI models, books, website crawls, 3D models, images and more.

Links:

Gaming Text Images Software

The Old Computer

Description:
Home to the largest collection of roms and emulators anywhere on the web with over 500,000 ROMs and Emulators for every major computer, console, arcade machine, pinball table and mobile device. Box Scans, Manuals, Magazines and a 179,000+ strong user community.

Links:

Technology Text Software

The Unix Heritage

Description:
The Unix Heritage Society's aims include the preservation and maintenance of historical and non-mainstream UNIX systems; the further development of existing UNIX systems; and the continual fostering of the Unix community spirit. They host historical Unix distribution and packages available for download.

Links:

History Text

ToposText

Description:
ToposText is an indexed collection of ancient texts and mapped places relevant to the history and mythology of the ancient Greeks from the Neolithic period up through the 2nd century CE. It archives thousands of places, people and texts.

Links:

Government Text Images Videos

UK Web Archive

Description:
UKWA captures, preserves and makes accessible UK central government information published on the web. The Web Archive includes videos, tweets, images and websites dating from 1996 to the present day.

Links:

World Text Images

United Nations Digital Library

Description:
The Digital Library is the central search engine to access millions of records from all UN bodies and agencies, including letters, reports, maps, speeches, voting data, and more.

Links:

World Text War

Uppsala Conflict Data Program

Description:
The Uppsala Conflic Data Program (UCDP) is the world's largest collection of wartime and organized violence data, covering over 40 years of conflicts, based at Uppsala University in Sweden and in collaboration with the Peace Research Institute in Oslo.

Links:

Climate Images

US Drought Monitor

Description:
The U.S. Drought Monitor provides climate maps weekly since 1999. It's produced jointly by the NDMC, NOAA and USDA.

Links:

World Images Government Videos

US Geological Survey

Description:
USGS is a US governmental agency providing services and resources around the disciplines of biology, geography, geology, and hydrology. The site provides access to images and videos related to maps, topographic surveys and more.

Links:

Technology Images Videos

Web Design Museum

Description:
The Web Design Museum exhibits thousands of screenshots and videos of old websites, mobile apps and software from 1990s to mid-00s.

Links:

World Text

Wikipedia

Description:
Wikipedia is a free-content online encyclopedia written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. It is the largest and most-read reference work in history. Wikipedia is hosted by the Wikimedia Foundation, a non-profit organization that also hosts a range of other projects.

Links:

History World Text Images

Wilson Center Digital Archive

Description:
Gain critical insights into international history and policy through declassified documents from governments, organizations, and individuals worldwide, curated by the Wilson Center.

Links:

Gaming Technology

WinWorld

Description:
WinWorld is an online museum created in 2003 dedicated to the preservation and sharing of vintage, abandoned, and pre-release software. It offers information, media and downloads for a wide variety of computers and operating systems. Get classic operating systems, applications, games and betas for every platform from PC to Mac to Amiga, right here from the software library on WinWorld.

Links:

World Government Text Images Videos

World Bank Open Data

Description:
The World Bank Open Data portal provides free and open access to global development data, mostly focusing on economic datasets.

Links:

Technology Text

Your.Org

Description:
Your.Org is a hosting company that provides hundreds of terrabytes of data for various sites. They also host a mirror of various open source software including Linux distributions, FreeBSD, Wikipedia database dumps, other websites such as Microsoft, Corel, IBM and much more.

Links: