On this page you will find links to data archives from various countries. These archives contain data that was gathered and saved for the public good.
Health Text
101 Cookbooks
Description:
101 Cookbooks is a food blog from California that archived thousands of healthy recipes, made available for free.
- 101cookbooks.com/archives.html - List of recipes.
Software Technology Gaming
Abandonware DOS
Description:
Abandonware DOS is an ever-expanding archive of classic PC games originally released for DOS, Windows, and Macintosh. A database of games that date back to the 80s and 90s, and they are available for download.
- abandonwaredos.com - Home page.
Science Text Torrents
Academic Torrents
Description:
Making over 127.15TB of research data available, this site provides a distributed system for sharing enormous datasets for researchers, by researchers. The result is a scalable, secure, and fault-tolerant repository for data, with blazing fast download speeds.
- academictorrents.com - List of torrents.
Books Technology Text
ACM Digital Library
Description:
The ACM Digital Library is a research, discovery and networking platform containing all ACM publications, including journals, conference proceedings, technical magazines, newsletters and books, along with a collection of curated and hosted full-text publications from select publishers.
- dl.acm.org - Home page.
World Culture Text Images Videos
African Online Digital Library
Description:
AODL provides free universal access to cultural heritage materials from and about African countries and communities. It brings together tens of thousands of digitized photographs, videos, archival documents, maps, interviews and oral histories in numerous African languages, many of which are contained in curated thematic galleries and teaching resources.
- aodl.org - Home page.
War Text Images Videos
Airwars
Description:
Airwars is a not-for-profit transparency watchdog which tracks, assesses, archives and investigates civilian harm claims in conflict-affected nations. Founded in 2014 they are today a leading authority on conflict violence as it affects civilian communities.
- airwars.org - Home page.
- youtube.com/@airwarsorg - Videos channel.
Images Software Audio Gaming
Amiga Paradise
Description:
Preserving the musical heritage of the Commodore Amiga since 1997 and currently serving over 1,179 software packages and 6,125 music files.
- amigaparadise.com - Home page.
Books Science History Text Torrents
Anna's Archive
Description:
Described as the largest truly open library in human history. This site mirrors Sci-Hub and LibGen. They also scrape and open-source Z-Lib, DuXiu, and more. Currently hosting over 42 million books, 98 million papers, preserved forever. All their code and data are completely open source.
- annas-archive.org - Main web page.
- annas-archive.se - Mirror site.
- annas-archive.li - Mirror site.
Images Videos World Government
AP Newsroom
Description:
AP has the world's largest collection of news related images and videos, with millions of items in its archives.
- newsroom.ap.org/editorial-photos-vi... - Home page.
- youtube.com/@aparchive - Videos channel.
Technology Text
Appropedia
Description:
Appropedia is a site for original research on sustainable development and appropriate technologies. The wiki documents over 4,000 projects.
- appropedia.org - Home page.
Science Text History
Archaeology Data Service
Description:
ADS is the leading accredited repository in the UK for archaeology and historic environment data, with over 25 years of experience supporting research, learning and teaching with free, high quality and dependable digital resources.
- archaeologydataservice.ac.uk - Home page.
- archaeologydataservice.ac.uk/archiv... - List of archives.
World Text
Archive.today
Description:
Archive.today is a time capsule for web pages! It takes a 'snapshot' of a webpage that will always be online even if the original page disappears. It saves a text and a graphical copy of the page for better accuracy and provides a short and reliable link to an unalterable record of any web page.
- archive.today - The web site archives.
World Culture History Text
Archives Portal Europe
Description:
Archives Portal Europe is a single access to find, browse and discover information on archives about Europe held by thousands of cultural heritage institutions from more than 30 countries. Search over 650,000 archival collections including hundreds of millions of documents, the persons and organizations that created and used these documents, and the institutions holding the archival material today.
- archivesportaleurope.net - Home page.
History Text Images World
Ariadne
Description:
The ARIADNE Portal offers a central point of access to the archaeological resources made available from partner institutions throughout Europe.
- portal.ariadne-infrastructure.eu - Search portal.
Text World
Arquivo.pt
Description:
Arquivo is a Portuguese web archive, focused on archiving the entire Portuguese web since 1996.
- arquivo.pt/?l=en - Home page.
Images Culture Videos
Art UK
Description:
Art UK brings together art from over 3,400 British institutions in one of the UK's biggest-ever arts partnerships. It shows over 600,000 works by over 60,000 artists and is growing all the time.
- artuk.org - Home page.
- youtube.com/@ArtUKdotorg - Videos channel.
Science Text
arXiv
Description:
arXiv is a free distribution service and an open-access archive for nearly 2.4 million scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. arXiv is a community of volunteer authors, readers, moderators, advisory board members, supporting members, donors, and third-party collaborators that are supported by the staff at Cornell University.
- arxiv.org - Browse articles.
World Technology Text Images Science Government
AWS Data Exchange
Description:
AWS Data Exchange makes it easy to find datasets made publicly available through AWS services. Browse available data and learn how to register your own datasets.
- aws.amazon.com/marketplace/search/r... - List of all Data Exchange applications.
- aws.amazon.com/marketplace/pp/prodv... - A corpus of web crawl data composed of over 50 billion web pages.
- aws.amazon.com/earth - Registry of Earth related datasets.
- archives.gov/developer/national-arc... - National Archives Catalog on the AWS Registry of Open Data.
- s3.us-east-1.amazonaws.com/rds.nsrl... - National Software Reference Library (NSRL)
Videos Culture
BBC Archive
Description:
BBC Archive from around the UK. Explore thousands of BBC archive films. This unique collection shows life and events across the UK since the 1940s.
- bbc.co.uk/archive - Home page.
- youtube.com/@bbcarchive - Videos channel.
Technology Software Images
BetaArchive
Description:
BetaArchive is a contribution based website that contains over 100 TB of beta and abandonware software for Windows and MacOS. It also provides a popular forum and gallery of screenshots from thousands of applications.
- betaarchive.com - Home page.
History Images Videos
British Museum
Description:
The British Museum is a public history museum established in 1753 and receiving over 5 million visitors each year. The online collection allows access to almost five million objects in more than two million records.
- britishmuseum.org/collection - List of online collections.
- youtube.com/@britishmuseum - Videos channel.
Government Health Climate Science Text World
CAFE
Description:
The Convene-Accelerate-Foster-Expand (CAFE) site is an open collection designed to support and enhance global research initiatives focused on understanding and mitigating the health impacts of climate change. It's hosted by Harvard University, Boston University and contains hundreds of datasets, mostly from US Gov web sites.
- dataverse.harvard.edu/dataverse/CAF... - Index of CAFE datasets.
Government Law Text
Caselaw Access Project
Description:
The Caselaw Access Project (CAP) scanned the entirety of the Harvard Law School Library's physical collection of American case law and made it machine-readable in a consistent format available online. To facilitate that agreement, the Library Innovation Lab (LIL) maintained the case.law website as the primary access point for the data. CAP includes all official, book-published state and federal United States case law through 2020, every volume or case designated as an official report of decisions by a court within the United States.
- case.law/caselaw - List of law volumes per state.
War World Text Images
Centre for Information Resilience
Description:
The Centre for Information Resilience (CIR) is an independent organization dedicated to exposing human rights violations and threats to democracy. Their projects are at the forefront of efforts to investigate and document human rights abuses and war crimes.
- info-res.org - Home page.
Science Text Software Videos
CERN Open Data
Description:
Explore more than five petabytes of open data from particle physics on this CERN web site.
- opendata.cern.ch - Open Data web site.
- youtube.com/@CERN - Videos channel.
History Text Images Videos War
Chartlann Mhileata Military Archives
Description:
The Military Archives offers a diverse range of collections documenting Ireland's military history, including pensions and historical documents.
- militaryarchives.ie/en/online-colle... - Browse the collections.
Science Text Videos
ChemSpider
Description:
A free chemical structure database providing access to millions of chemical structures and properties, hosted by the Royal Society of Chemistry, London.
- chemspider.com - Search engine.
- youtube.com/@royalsocietyofchemistr... - Videos channel.
Technology Software Images Videos Torrents
CivitAI
Description:
CivitAI is an online platform and marketplace for generative AI content, primarily focused on AI-generated images and models.
- civitai.com - Home page.
- civitaiarchive.com - NSFW models archive site.
- diffusionarc.com - Alternative database of images models.
- civitasbay.org - List of CivitAI torrents.
- miyukiai.com - Backup site.
- youtube.com/@civitai - Videos channel.
Climate Images Videos Software Text
Climate Data Store
Description:
The C3S Climate Data Store (CDS) is a one-stop shop for information about the climate: past, present and future. It provides easy access to a wide range of climate datasets via a searchable catalogue.
- cds.climate.copernicus.eu/#!/home - List of datasets.
- confluence.ecmwf.int/display/CKB/Ge... - Copernicus documentation site.
- cds.climate.copernicus.eu/applicati... - List of real-time climate applications.
- charts.ecmwf.int - Weather charts.
- ecmwf.int/en/publications - Climate publications.
- youtube.com/@CopernicusECMWF - Videos channel.
Government Climate Text Torrents
Climate Mirror Project
Description:
The Climate Mirror Project is trying to mirror and safely archive US Gov websites and datasets related to climate, climate change, and global warming. It provides mirrors of official NOAA and other government web sites.
- climate.daknob.net - List of datasets and torrents.
Text Climate Images
Climate TRACE
Description:
Climate TRACE provides comprehensive emissions tracking over 662 million emitting assets in 10 sectors over 10+ years. The site is an AI powered platform for realtime GHG emissions reporting, built by a global, not-for-profit coalition of over 100 universities, scientists, and AI experts
- climatetrace.org - Home page.
History World Text Images
Coleccion Aruba
Description:
In October 2022, the National Archive, National Library, and UNOCA signed an agreement to collaborate on digitizing the history, culture, and heritage of Aruba. The intention is to create a platform where Aruba’s cultural heritage can be easily accessible.
- coleccion.aw/pages/en/home-en - Home page.
- coleccion.aw/browse - List of collections.
World Text
Common Crawl
Description:
Common Crawl maintains a free, open repository of web crawl data that can be used by anyone. They believe that everyone should have the opportunity to indulge their curiosities, analyze the world, and pursue brilliant ideas. The latest crawl contains over 2.74 billion web pages.
- commoncrawl.org - Home page.
Government Technology Text Software
Common Vulnerabilities and Exposures (CVE)
Description:
The CVE program identifies, defines, and catalogs publicly disclosed cybersecurity vulnerabilities. There are currently over 274,000 CVE Records accessible through the program. While it depends on US Government funding, there are several alternative databases also available.
- cve.org - Main CVE website.
- github.com/CVEProject/cvelistV5 - Official list of all CVEs.
- vulnerability.circl.lu - Main vulnerability lookup site.
- vulnerability-lookup.org - Vulnerability lookup software.
- gcve.eu - Decentralized CVE alternative.
- euvd.enisa.europa.eu - EU vulnerability database.
- cyber.gc.ca/en/alerts-advisories - Canadian vulnerability database.
- seclists.org - Archive of popular cybersecurity mailing lists.
- infosec.exchange - Mastodon community surrounding InfoSec.
Gaming Software
Console Mods
Description:
This wiki contains information on game console modding and game dumping tools.
- consolemods.org - Console mods wiki
- edump.org - Mod dumping wiki
World History Text
Cross-National Time-Series Data
Description:
CNTS provides more than 200 years of annual data from 1815 onward, including 196 demographic, political, legislative, economic and social science variables.
- cntsdata.com - List of databanks.
History Text Images Culture
Cultural Japan
Description:
This site aims to collect information related to Japanese culture published in museums, libraries, etc. around the world, and to provide them with a common and reusable format.
- cultural.jp/en - Home page.
World Images History Culture
Curationist
Description:
Curationist is a free online resource that brings together arts and culture communities to find, share, collaborate, and reimagine cultural narratives. Since its launch in 2022, Curationist enables global users to search more than 4.4 million images of works from the open access collections of museums and archives worldwide.
- curationist.org - Home page.
Culture World Text Images
CURIOSity Digital Collections
Description:
CURIOSity Is the online portal of Harvard Library. It provides success to thousands of art pieces, maps, books, photographs and more.
- curiosity.lib.harvard.edu - List of collections.
History Text Images
DAACS
Description:
The Digital Archaeological Archive of Comparative Slavery (DAACS) is a Web-based initiative designed to foster inter-site, comparative archaeological research on slavery throughout the Chesapeake, the Carolinas, and the Caribbean.
- daacs.org - Home page.
Text Technology Software
DaFont
Description:
DaFont provides over 70,000 fonts free of charge, for use on Windows, MacOS or Linux.
- dafont.com - Home page.
- dafont.com/illustrate-it.charmap - Popular brand names.
Climate Science Text Images
Data Basin
Description:
Data Basin is a science-based mapping and analysis platform that supports learning, research, and sustainable environmental stewardship. It publishes maps, datasets, visualizations, drawings, & analyses.
- databasin.org - Home page.
Government Text
Data Liberation Project
Description:
The Data Liberation Project is an initiative to identify, obtain, reformat, clean, document, publish, and disseminate US Gov datasets of public interest.
- data-liberation-project.org/dataset... - List of published datasets.
Government Text
Data Lumos
Description:
DataLumos is an ICPSR archive for valuable government data resources. ICPSR has a long commitment to safekeeping and disseminating US government and other social science data. DataLumos accepts deposits of public data resources from the community and recommendations of public data resources that ICPSR itself might add to DataLumos. The site is hosted by the University of Michigan.
- datalumos.org/datalumos/search/stud... - List of datasets.
Government Text
Data Rescue Project
Description:
The Data Rescue Project is a coordinated effort among a group of data organizations focusing on rescue-related efforts and data access points for public US governmental data that are currently at risk. It provides resources, collections of datasets and news updates.
- datarescueproject.org - Home page.
- datarescueproject.org/resources - List of resources.
- baserow.datarescueproject.org/publi... - List of datasets.
Images World
David Rumsey Map Collection
Description:
The David Rumsey Map Collection was started over 35 years ago and contains more than 200,000 maps. The collection focuses on rare 16th through 21st century maps of North and South America, as well as maps of the World, Asia, Africa, Europe, and Oceania.
- davidrumsey.com - Home page.
Government Text
Deportation Data Project
Description:
The Deportation Data Project collects and posts public, anonymized U.S. government immigration enforcement (ICE) datasets. They expect these datasets to be used by journalists, researchers, lawyers, and policymakers.
- deportationdata.org - Home page.
- deportationdata.org/data.html - List of datasets.
- app.resistmap.com/feed - Live feed of user submitted raid reports.
- icelist.is/ice/category/people/ice-... - List of ICE agent profiles.
History Text Images
Digital Archive Ontario
Description:
Digital Archive Ontario collects digitized items held by Toronto Public Library, including over 100,000 historical photos, maps, postcards & more from Ontario.
- digitalarchiveontario.ca - Home page.
Books Images
Digital Comic Museum
Description:
The Digital Comic Museum claims to be the best site for downloading FREE public domain Golden Age Comics.
- digitalcomicmuseum.com - Home page.
History Books Text Videos
Digital Public Library of America
Description:
The DPLA highlights millions of items from libraries, archives and museums across the United States, organized into easy-to-navigate topics through a single catalog.
- dp.la - Home page.
- ebooks.dp.la/the-banned-book-club - The banned books club.
- youtube.com/@digpublib - Videos channel.
Images Text History Science
DigitalCommons@UNO
Description:
DigitalCommons@UNO is an institutional repository with the goal of collecting and making visible the intellectual output of the University of Nebraska at Omaha. It contains collections about science, technology, historical records, scanned newspapers, community engagement, conferences and events, public safety, and much more.
- digitalcommons.unomaha.edu - Home page.
- digitalcommons.unomaha.edu/communit... - List of collections.
Technology Text
Distributed Denial of Secrets
Description:
Distributed Denial of Secrets is a non-profit in the US that archives and publishes hacked and leaked documents in the public interest.
- ddosecrets.com - Home page.
Gaming Software
DOS Zone
Description:
Enjoy classic games completely free and without ads on dos.zone! They play in your browser without the need to download anything.
- dos.zone - Home page.
Technology Software Torrents
Drivers Collection
Description:
Drivers Collection is one of largest free web library of device drivers for computer hardware. It contains over 6 million drivers from various hardware vendors.
- driverscollection.com - Drivers home page.
- web.archive.org/web/20211220061945/... - Torrent links for driver packs.
- vogonsdrivers.com - Alternative drivers archive.
Science Text
Dryad
Description:
Dryad is an open data publishing platform and a community committed to the open availability and routine re-use of all research data. Their multi-stakeholder community of academic and research institutions, research funders, scholarly societies and publishers is committed to leading in best practices for open data sharing and reuse.
- datadryad.org/stash - Dryad datasets.
Climate Text Science World
Earth and Space Science Open Archive
Description:
The Earth and Space Science Open Archive is a community server established to accelerate the open discovery and dissemination of earth, environmental, and space science research by archiving and sharing early research outputs, including preprints, presentations from major scientific meetings, and important documents of scholarly societies.
- essopenarchive.org - Home page.
Government Text
End-of-Term web archive
Description:
The End of Term Web Archive captures and saves U.S. Government websites at the end of presidential administrations. The EOT has thus far preserved websites from administration changes in 2008, 2012, 2016, and 2020. The End of Term Web Archive contains federal government websites (.gov, .mil, etc) in the Legislative, Executive, or Judicial branches of the government.
- eotarchive.org - EOT web archive.
Government Text
Environment Data & Governance Initiative
Description:
The Website Governance Project Team monitors, documents, and analyzes changes to US federal websites, and advocates for improvements in public information policies.
- envirodatagov.org - Home page.
- environmentalenforcementwatch.org - Environmental Enforcement Watch.
- docs.google.com/spreadsheets/d/1eqZ... - List of datasets.
Law Text
Epstein Files Archive
Description:
An automatically processed, OCR'd, searchable archive of publicly released documents related to the Jeffrey Epstein case.
- epstein-docs.github.io - Searchable text archives.
- github.com/epstein-docs/epstein-doc... - OCR pipeline source code.
- vault.fbi.gov/jeffrey-epstein - Original image archives.
- neko2077.net/epstein - Mirror site.
Science Images Text Videos World
European Space Agency
Description:
The European Space Agency provides datasets on space science and observation data. The ESA Hubble portal catalogs all news releases, images and videos captured by the Hubble Space Telescope. The Earth Online portal functions as access point for a wide variety of Earth observation resources. Copernicus provides free instant access to a wide range of data and services from the Copernicus Sentinel missions.
- esa.int - Home page.
- esahubble.org - Hubble Space Telescope.
- earth.esa.int/eogateway - Earth Online.
- dataspace.copernicus.eu - Copernicus.
- data.esa.int - Data query browser.
- youtube.com/@europeanspaceagency - Videos channel.
History Images Text Videos World Government Culture Science
Europeana
Description:
The Europeana website provides cultural heritage enthusiasts, professionals, teachers, and researchers with access to Europe's digital cultural heritage. It contains thousands of items related to archaeology, art, newspapers, fashion, music, photography and more.
- europeana.eu/en - Home page.
- europeana.eu/en/themes - List of collections.
- youtube.com/@EuropeanaEU - Videos channel.
Technology Software
Evolt Browser Archive
Description:
Evolt.org is a web development community founded in 1998. While the community site closed, it still hosts an archive of hundreds of web browsers.
- browsers.evolt.org - Browsers list.
Government Science Text
Fairdata
Description:
Fairdata services are part of the digital preservation services offered by the Ministry of Education and Culture, Finland (“Minedu”).
- fairdata.fi/en/# - Home page.
- etsin.fairdata.fi/datasets - List of datasets.
Technology Software
Files dot Dog
Description:
This site contains a large collection of Microsoft Developer Network (MSDN) files, along with random other files.
- files.dog - Files archive.
Gaming Software
Flashpoint
Description:
Flashpoint provides a frontend player software and a database of over 200,000 Flash games and animations, preserved from the early web.
- flashpointarchive.org - Home page.
- flashpointproject.github.io/flashpo... - Database.
Science Text
Flora of North America
Description:
Flora of North America (FNA) presents for the first time, in one published reference source, information on the names, taxonomic relationships, continent-wide distributions, and morphological characteristics of all plants native and naturalized found in North America north of Mexico.
- floranorthamerica.org/Main_Page - Home page.
History Images World
Fortepan
Description:
Fortepan is a copyright-free and community-based photo archive with over 200,000 Hungarian photographs available for anyone to browse and download in high-resolution, free of charge.
- fortepan.hu/en - Home page.
- fortepan.hu/en/photo-uploads - All collections.
- fortepan.us - US clone.
Science Text Images
Free GIS Data
Description:
This page contains a categorized list of links to over 500 sites providing freely available geographic datasets, all ready for loading into a Geographic Information System (GIS).
- freegisdata.rtwilson.com - Archives listing.
Text Law
Free Law
Description:
Free Law Project seeks to provide free access to primary legal materials, develop legal research tools, and support academic research on legal corpora. Currently Free Law Project sponsors the development of CourtListener, Juriscraper, and RECAP.
- free.law - Free Law home page.
- courtlistener.com - Court Listener archive.
- github.com/freelawproject/juriscrap... - Juriscraper tool.
- free.law/recap - PACER tools.
Gaming Images Videos Software
Games Database
Description:
Games Database is one of the biggest source for manuals, videos, music and artwork. The site provides over 32k videos, 8k music files, 14k manuals, 5k game adverts, 822 TV commercials for 126 systems.
- gamesdatabase.org - Main web site.
World History Text Videos
GDELT
Description:
GDELT is the largest, most comprehensive, and highest resolution open database of human society ever created. Creating a platform that monitors the world's news media from nearly every corner of every country in print, broadcast, and web formats, in over 100 languages, every moment of every day and that stretches back to January 1, 1979 through present day.
- gdeltproject.org - Home page.
- gdeltproject.org/data.html#rawdataf... - List of datasets.
- youtube.com/@gdeltproject - Videos channel.
Science Government Climate Images
GEO.ca
Description:
GEO.ca is the definitive source for Canada’s open geospatial information. Open data. Applications. Maps. And more. Discover it all on GEO.ca, along with the tools you need to visualize, analyze and share the insights you create. Unlock the power of location here.
- geo.ca/home - Home page.
Climate Government Text Images
German Meteorological Service
Description:
Within its legal mandate, the German Meteorological Service (DWD) offers weather and climate data free of charge on its Open Data server. Data includes webcam footage, satellites and radar maps, radiation data, local and historical forecasts.
- cdc.dwd.de/portal - Climate Data Center.
- opendata.dwd.de - Open Data Server.
Science Text World
Global Biodiversity Information Facility
Description:
GBIF (the Global Biodiversity Information Facility) is an international network and data infrastructure funded by the world's governments and aimed at providing anyone, anywhere, open access to data about all types of life on Earth. It provides access to over 110,000 datasets.
- gbif.org - Main web site.
Climate Text Images World
Global Energy Monitor
Description:
Global Energy Monitor develops and analyzes data on energy infrastructure, resources, and uses. They provide open access to information that is essential to building a sustainable energy future.
- globalenergymonitor.org - Home page.
Images Culture
Google Arts & Culture
Description:
Google Arts & Culture is a non-commercial initiative. They work with museums, cultural institutions and artists around the world to preserve and bring the world's art and culture online so it's accessible to anyone, anywhere.
- artsandculture.google.com/partner - List of collections.
Images World History
Google News Archive
Description:
Google News Archive is an extension of Google News providing free access to scanned archives of newspapers and links to other newspaper archives on the web, both free and paid. The site covers hundreds of individual publications, with thousands of scanned pages since the 18th century, with full text search.
- news.google.com/newspapers - List of archived newspapers.
Government Text
GovInfo
Description:
GovInfo is a service of the United States Government Publishing Office (GPO), which is a Federal agency in the legislative branch. GovInfo provides free public access to official publications from all three branches of the Federal Government.
- govinfo.gov - Home page.
Culture Images
Guggenheim New York
Description:
Featuring over 1,900 artworks by more than 625 artists, the Collection Online presents a searchable database of selected artworks from the Guggenheim's permanent collection of approximately 8,000 artworks. The selection reflects the breadth, diversity, and tenor of the Solomon R. Guggenheim Foundation's extensive holdings from the late 19th century through the present day.
- guggenheim.org/collection-online - Collection online.
Technology Images
GUIdebook
Description:
GUIdebook is a website dedicated to preserving and showcasing Graphical User Interfaces, as well as various materials related to them. It features hundreds of screenshots from GUIs, icons, desktops and more.
- guidebookgallery.org/index - Home page.
Text Science
HAL Open Science
Description:
HAL is a multidisciplinary open archive for sharing research results in open access. It is at the service of researchers affiliated with academic institutions, whether public or private. HAL is the national archive chosen by the France scientific and academic communities for the open dissemination of its research results. It contains over 1.5M scientific papers and 4M references.
- hal.science - Home page.
Text Images Technology
HiFi Engine
Description:
The HiFi Engine library has images, specifications and reviews for thousands of audio components, along with owners manuals, service manuals, schematics and product catalogues.
- hifiengine.com - Home page.
Images Text History Videos
Historic England
Description:
Historic England is the public body that helps people care for, enjoy and celebrate England's spectacular historic environment. They hold an outstanding range of photographs, plans and drawings in their public archive, covering the historic environment of England.
- historicengland.org.uk - Home page.
- historicengland.org.uk/images-books - List of collections.
- youtube.com/@historicengland - Videos channel.
History Text Images Videos
Historic Environment Scotland
Description:
Historic Environment Scotland is the lead public body established to investigate, care for and promote Scotland’s historic environment, managing over 45,000 objects in its collections.
- historicenvironment.scot - Home page.
- historicenvironment.scot/archives-a... - Online exhibitions.
- trove.scot/explore/objects?rsrc=his... - Database of objects.
- youtube.com/@HistoricEnvironmentSco... - Videos channel.
Technology Software
Hugging Face
Description:
Hugging Face is the platform where the machine learning community collaborates on models, datasets, and applications. It contains the largest collection of open source AI models and focuses on machine learning tasks.
- huggingface.co - Main web page.
Technology Software Images
Ibiblio
Description:
Ibiblio (then called SunSITE) began mirroring open source software in 1992, and was one of only three such repositories available on the internet. Now almost 30 years later mirroring and open source software has evolved.
- ibiblio.org - Main web page.
- ibiblio.org/catalog/exhibits/show/s... - List of mirrored projects.
- ibiblio.org/wm - Web museum.
Government Text Culture Videos
ICPSR
Description:
ICPSR is research science data and resources on topics like social media, politics, economics, social sciences, government, GIS, & more. ICPSR is part of the Institute for Social Research at the University of Michigan.
- icpsr.umich.edu/web/pages - Main web page.
- icpsr.umich.edu/web/pages/ICPSR/acc... - List of datasets.
- youtube.com/@icpsr - Videos channel.
Culture Images Text
IKEA Museum
Description:
The digital museum showcases the story of IKEA and is open to anyone who is curious about IKEA and life at home, anywhere and anytime. It includes the full catalogs from 1950 to 2021.
- ikeamuseum.com/en - Home page.
- ikeamuseum.com/en/explore/ikea-cata... - Catalog archives.
- archive.org/details/ikea-catalogs/I... - Internet Archive mirror.
War Images Text World
Imperial War Museums
Description:
Since its foundation in 1917 IWM has been building its collections in order to illustrate and record all aspects of conflict in the twentieth and twenty-first centuries. IWM's collection of over 1 million items covers all aspects of conflict involving Britain, its former Empire and the Commonwealth, from the First World War to the present day. As well as objects, it includes a range of media, from art, film and photographs to printed materials, documents and sound.
- iwm.org.uk/collections - Explore collections.
Science Text
INSDC
Description:
The International Nucleotide Sequence Database Collaboration (INSDC) archives nucleotide sequence data, from raw to assembled and annotated sequences, from around the world.
- insdc.org - Link to archival sites.
Technology Software
Interesting DOS Programs
Description:
This is an archive of various DOS software and other DOS related websites. Most are freeware but a few are shareware and commercial programs.
- dosprograms.info.tt - Home page.
Law Text Images War World
International Court of Justice
Description:
The International Court of Justice (ICJ), the International Criminal Court (ICC) along with temporary International Criminal Tribunals (ICT) established for special cases of crimes against humanity have their archives spread on various sites across the web.
- icj-cij.org/list-of-all-cases - List of cases and supporting documents.
- irmct.org/en - International Residual Mechanism for Criminal Tribunals.
- icc-cpi.int/case-records - International Criminal Court records.
- icj-cij.org/sites/default/files/doc... - Nuremberg trials archives ICJ pamphlet.
- nuremberg.law.harvard.edu - Archive of the Nuremberg trials at Harvard Law.
- unictr.irmct.org/en - International Criminal Tribunal for Rwanda archives.
- archive.eccc.gov.kh/en - Khmer Rouge Tribunal.
- rscsl.org - Residual Special Court for Sierra Leone.
Technology World Text Images Videos Books Audio Torrents Culture Gaming History Science
Internet Archive
Description:
The Internet Archive is an American non-profit organization founded in 1996 by Brewster Kahle that runs a digital library website, archive.org. It provides free access to collections of digitized media including websites, software applications, music, audiovisual, and print materials.
- archive.org - The main archive web site.
- web.archive.org - The Wayback Machine.
- ww.bibalex.org/isis/frontend/archiv... - Bibalex mirror of the Wayback Machine.
- archive-it.org - Archive-it collections.
- openlibrary.org - Free online library.
Science Health World Text Videos
IPUMS
Description:
IPUMS provides census and survey data from around the world integrated across time and space. IPUMS integration and documentation makes it easy to study change, conduct comparative research, merge information across data types, and analyze individuals within family and community contexts. Data and services available free of charge.
- ipums.org - List of IPUMS datasets.
- youtube.com/@mpcipums - Videos channel.
Climate Text Images World
IUCN Red List of Threatened Species
Description:
Established in 1964, The International Union for Conservation of Nature's Red List of Threatened Species has evolved to become the world's most comprehensive information source on the global conservation status of animal, fungi and plant species.
- iucnredlist.org - Home page.
- iucnredlist.org/resources/grid - List of resources.
Government Text Videos
January 6th Archives
Description:
The purpose of these sites is to archive videos and documents in the public interest with regard to the January 6th, 2021 coup attempt in the United States.
- extremism.gwu.edu/january-6-data - January 6th data.
- jan6archive.com - January 6th multimedia archive.
- citizen.org/january-6-committee-arc... - Public Citizen archive.
- citizen.org/january-6-committee-arc... - Library of Congress archive.
- archive.org/details/j6tapes - Internet Archive mirror.
History Text Images
Japan Disasters Archive
Description:
A project of Harvard University’s Reischauer Institute of Japanese Studies, the Japan Disasters Digital Archive (JDA) collects archived materials from all over the web, including websites, images, video, audio, news articles, individual testimonials, tweets, and other content.
- jdarchive.org/en - Home page.
- jdarchive.org/en/collectionsearch - List of collections.
World Books History Text Images
JSTOR
Description:
JSTOR provides access to more than 12 million journal articles, books, images, and primary sources in 75 disciplines.
- jstor.org - JSTOR home page.
- jstor.org/site/collection-list - All collections.
- jstor.org/site/south-asia-open-arch... - South Asia Open Archives.
- jstor.org/site/collection-list/?col... - Artstor art collection.
Technology World Science Text
Kaggle
Description:
Kaggle is one of the largest collection of datasets, mostly focusing on statistics, science, world affairs and technology. It contains 430K high-quality public datasets. Everything from avocado prices to video game sales.
- kaggle.com/datasets - List of Kaggle datasets.
Gaming Software Videos
Keitai Game Preservation
Description:
This wiki is dedicated to cataloging games from Japanese Feature Phones (keitai), pre-Android/iPhone mobile devices released in Japan. (e.g. i-Mode game, i-Appli game, EZweb game, S!Appli game). They also provide information and support for preserving Japanese feature phone games.
- keitaiwiki.com - Keitai wiki
- youtube.com/watch?v=I1VJw_yYI1U - Documentary on Lessons from Keitai Game Preservation
Gaming Text Images
Kirkland's Manual Labor
Description:
This site contains thousands of video gaming manuals for retro consoles such as N64, SNES, GameBoy, and more.
- videogamemanual.com - Home page.
Technology World Science Books Text Software
Kiwix
Description:
3 billion people have no or little access to internet. This can be because of costs, lack of infrastructure, or outright censorship. Kiwix provides offline versions of popular web sites like Wikipedia, Wikibooks and Project Gutenberg.
- kiwix.org/en - Home page.
- kiwix.org/en/applications - List of offline applications.
- library.kiwix.org/#lang=eng - Library of content for Kiwix.
Culture Images
Le Louvre
Description:
Le Louvre is a national art museum in Paris, France, and one of the most famous museums in the world. Several Virtual tours of the museum's rooms and galleries showcase its architecture and collections.
- louvre.fr/en/online-tours - List of virtual tours.
History Text Images
LiBER
Description:
LiBER is a project by the Institute of Heritage Science and the Institute for Studies on the Mediterranean cataloging over 6,000 Linear B inscriptions, mostly from palatial contexts dating from the 14th-13th centuries BC. The vast majority of them are Ancient Greece clay tablets of economic content.
- liber.cnr.it - Home page.
Books Text Images
Library Genesis
Description:
Library Genesis (LibGen) is the largest free library in history, giving access to 84 million scholarly journal articles, 6.6 million academic and general interest books, 2.2 million comics and 381 thousand magazines.
- libgen.li - Home page.
Science Books Text
LibreTexts
Description:
LibreTexts is the adaptable, user-friendly open education resource platform that educators trust for creating, customizing, and sharing accessible, interactive textbooks, adaptive homework, and ancillary materials. We collaborate with individuals and organizations to champion open education initiatives, support institutional publishing programs, drive curriculum development projects, and more. The LibreText Commons hosts curated Open Educational Resources from all 16 libraries in the LibreVerse in one convenient location.
- libretexts.org - Project home page.
- commons.libretexts.org - Index of textbooks.
Books Audio
LibriVox
Description:
LibriVox volunteers record chapters of books in the public domain, and then we release the audio files back onto the net for free. Their goal is to make all books in the public domain available, narrated by real people and distributed for free, in audio format on the internet.
- librivox.org - Home page.
Culture Audio
Lyrics
Description:
Lyrics.com is a vast compilation of song lyrics, album details, and featured video clips for a seemingly endless array of artists.
- lyrics.com - Home page.
- azlyrics.com - Alternative site.
Software Technology Gaming
Macintosh Garden
Description:
The Macintosh Garden is an abandonware archive, dedicated in particular to supporting the Macintosh computer platform. Software featured on the Macintosh Garden has been discontinued by their publishers and is no longer commercially available. The Macintosh Garden aims to preserve these treasures for future generations, providing documentation and downloads of the original files.
- macintoshgarden.org - Home page.
Text Technology
Mailing List ARChives
Description:
The Mailing list ARChives (MARC) is an RDBMS (MySQL, to be exact) driven database of mailing list messages, viewable and browsable by list, thread, author, or searchable via a full-text search engine. Its interface is no-frills but highly functional, designed to be useable even over slow links or with text-only browsers like lynx. As of 2014-04-02, the MARC archive had 70 million emails across about 3,500 mailing lists, from over seven million different authors. It gets about 350,000 new mails per month, and over 35 million total web-hits per month.
- marc.info - List archives.
Technology Software Gaming Videos
MajorGeeks
Description:
For over two decades, MajorGeeks.com has been a pivotal platform in the digital landscape, championing the discovery and distribution of innovative software and cybersecurity solutions.
- majorgeeks.com - Home page.
- youtube.com/@majorgeeks - Videos channel.
Software Technology
MalwareBazaar
Description:
MalwareBazaar is a platform from abuse.ch and Spamhaus, dedicated to sharing malware samples with the infosec community, antivirus vendors, and threat intelligence providers. Upload malware samples and explore the database for valuable intelligence. Set alerts to track newly observed malware, use APIs to seamlessly push or pull signals, and automate bulk queries.
- bazaar.abuse.ch - Home page.
Images Culture
Manchester Digital Collections
Description:
Manchester Digital Collections supports The University of Manchester’s mission to advance education, knowledge and wisdom for the good of society. It provides a new resource for exploring high-quality images of cultural collections and research projects at The University of Manchester.
- digitalcollections.manchester.ac.uk... - List of collections.
- library.manchester.ac.uk/rylands/sp... - The Humanitarian Archive.
Books Images
MangaDex
Description:
MangaDex is one of many websites dedicated to archiving scanned mangas and other Asian comic books. These sites provide thousands of titles to read for free, compiled by volunteers.
- mangadex.org - MangaDex web site.
- mangakatana.com - Manga Katana web site.
- bato.to - Bato web site.
- mangahere.cc - MangaHere web site.
- weebcentral.com - Weeb Central web site.
Books Text
Memory of the World
Description:
This site contains over 150,000 curated novels and other rare books available online for free access.
- library.memoryoftheworld.org - Home page.
Culture History Images
Metropolitan Museum of Art
Description:
The Metropolitan Museum of Art, colloquially referred to as the Met, is an encyclopedic art museum in New York City. By floor area, it is the third-largest museum in the world and the largest art museum in the Americas. Travel around the world and across 5,000 years of history through 490,000+ works of art.
- metmuseum.org/art/collection - Online collection.
World Text
Mirror Service
Description:
The UK Mirror Service provides a collection of mirrors of FTP, web and rsync sites of interest to academic users. The service is provided by the University of Kent's School Of Computing.
- ww.mirrorservice.org/sites - List of mirrored sites.
Culture Images
MoMA
Description:
The Museum of Modern Arts (MoMA) of New York has an evolving collection that contains almost 200,000 works of modern and contemporary art. More than 105,000 works are currently available online.
- moma.org/collection - List of collections.
History Text Images Videos
Montreal Archives
Description:
The site of the Montreal Archives contains the greatest collection of historical documents and photographs about Montreal's history, including the history of early Canadian life.
- archivesdemontreal.com - Home page.
- archivesdemontreal.com/expositions-... - Virtual expositions.
- flickr.com/photos/archivesmontreal/... - Photo albums.
- youtube.com/@ArchivesMtl - Videos channel.
Technology History Text Images Videos
Museum of Obsolete Media
Description:
A unique online museum of physical media formats showcasing developments in audio, video, film and data storage, the Museum preserves the memory of those objects that held our memories, and every format listed in the Museum is represented by at least one example in the collection.
- obsoletemedia.org - Museum collection.
Videos War History
Museum of Stolen Art
Description:
The museum is a virtual 3D space exhibiting digital copies of Ukrainian artworks which were stolen, destroyed or disappeared as a result of the full-scale invasion of Russia.
- museumofstolen.art/en - Portal page.
Gaming Software
My Abandonedware
Description:
On My abandonware you can download all the old video games from 1965 to 2012 for free. You can play Pacman, Arkanoid, Tetris, Galaxian, Alter Ego, or Blackthorne, Civilization, Sim City, Prince of Persia, Xenon 2, King's quest, Ultima, Kyrandia, The Incredible Machine, Another World, Test Drive, Flashback, Lemmings and more. For each game, they offer all related information included publication year, publisher, developer, size of the game, language, review of the game, instructions to play, the game manual and, of course, the game archive that you can download for free.
- myabandonware.com - Index of software.
Culture Text Images
My Figure Collection
Description:
MyFigureCollection.net is a service for Japanese pop-culture (anime, manga, video-games, etc.) goods (figures, artbooks, CDs, cups, collectibles, dakimakura, etc.) collectors. Join our friendly community for free and start listing and managing your collection online: databases, pictures, calendar, budget manager, encyclopedia and much more.
- myfigurecollection.net - Home page.
World Government Text Images Videos
National Archives
Description:
The National Archives is a common term to designate a government funded archival institution focused on cataloging and making available historically significant works from the country in question, usually through government mandated processes.
- archives.gov - US National Archives.
- library-archives.canada.ca/eng - Library and Archives Canada.
- nationalarchives.gov.uk - UK National Archives.
- naa.gov.au - National Archives of Australia.
- archives-nationales.culture.gouv.fr - Archives Nationales de France.
- nationaalarchief.nl - Nationaal Archief.
- en.rigsarkivet.dk - Rigsarkivet.
- nas.gov.sg/archivesonline - National Archives Singapore.
Science Text
National Center for Biotechnology Information
Description:
NCBI is the one-stop shop for finding, browsing, and downloading genomic data.
- ncbi.nlm.nih.gov/datasets - Search datasets.
Videos Culture
National Film Board of Canada
Description:
In addition to being a public producer and distributor of Canadian content, the National Film Board of Canada (NFB) is the caretaker of over 7,000 productions available for free for personal use.
- nfb.ca - Home page.
- youtube.com/@nfb - Videos channel.
Books History World Government Text
National Library
Description:
A national library is a library established by a government as a country's preeminent repository of information. They often include numerous rare, valuable, or significant works.
- loc.gov/collections - Library of Congress collections.
- ndl.go.jp/en - National Diet Library, Japan.
- kansalliskirjasto.fi/en - National Library of Finland.
- deutsche-digitale-bibliothek.de/?la... - Deutsche Digitale Bibliothek.
- natlib.govt.nz/collections - National Library of New Zealand.
- gallica.bnf.fr/accueil/en/html/accu... - Bibliotheque Nationale de France.
- nlb.gov.sg/main/nlonline - National Library Singapore.
- bl.uk/collection - British Library.
Government Technology Text War
National Security Archive
Description:
Founded in 1985 by journalists and scholars to check rising government secrecy, the National Security Archive combines a unique range of functions: investigative journalism center, research institute on international affairs, library and archive of declassified U.S. documents.
- nsarchive.gwu.edu - Home page.
Gaming Software Videos
Nexus Mods
Description:
Nexus Mods is one of several gaming mods archives, hosting over 300,000 mods for over 3,500 PC games. Several other sites also provide smaller archives of gaming mods.
- nexusmods.com - Nexus Mods web site.
- moddb.com - Mod DB web site.
- curseforge.com - Curse Forge web site.
- loverslab.com - Adults mods site.
- ayakamods.cc - Alternative mods site.
- youtube.com/@NexusModsOfficial - Videos channel.
Government Text Images
NSA Files
Description:
The NSA files refer to a collection of documents and data related to the activities and operations of the National Security Agency (NSA) leaked by former contractor Edward Snowden in 2013, revealing extensive global surveillance programs.
- github.com/iamcryptoki/snowden-arch... - Snowden leaked documents.
- aclu.org/nsa-documents-released-to-... - ACLU documents archive.
- eff.org/nsa-spying - EFF archive of NSA spying data.
- theguardian.com/world/interactive/2... - NSA Files Decoded.
- en.wikipedia.org/wiki/2010s_global_... - Wikipedia page.
Science Text Images
NZ Flora
Description:
This site aims to be the definitive reference to New Zealand plants.
- nzflora.info - Home page.
Gaming Software
Old Games
Description:
Old-Games.com provides 10,000+ old PC games free to download, along with screenshots and descriptions.
- old-games.com - Main web site.
Technology Text Images
Old Manuals
Description:
These are some of the biggest computer and electronics manual databases on the internet with over 1.7M manuals. They have every brand from Acer to Zanussi.
- aboutmanual.com - About Manual.
- manualslib.com - Manuals Lib.
- manua.ls - Manua.ls.
- manualowl.com - Manual Owl.
- manuals.plus - Manuals Plus.
- manuals.ca - Manuals.ca.
- hparchive.com - Vintage Hewlett-Packard Archive.
- archive.org/details/computermanuals - Computer manuals collection at Internet Archive.
World Images
Old Maps Online
Description:
Browse historical places and search for old maps with timeline, through a web or mobile app.
- oldmapsonline.org - Home page.
Technology Software Gaming
OldVersion
Description:
Sometimes upgrading to a newer version can be a good thing. Other times, your computer may not be compatible with the new version, the new version is bloated, or all the options you liked are no longer available. OldVersion.com has been supplying the online community with old versions of various programs since 2001.
- ww.oldversion.com - Home page.
World Culture Videos
Open Culture
Description:
Take online courses from the world’s top universities for free. Here, you will find 1,700 free online courses from universities like Yale, MIT, Harvard, Oxford and more.
- openculture.com - Home page.
- openculture.com/freeonlinecourses - List of free classes.
Books Text
Open Library
Description:
Open Library is an initiative of the Internet Archive and provides access to thousands of books, out of print and otherwise. It provides an open, editable library catalog, building towards a web page for every book ever published.
- openlibrary.org - Open Library home page.
- openlibrary.org/collections/banned-... - Banned books collection.
Text Science
OpenEdition
Description:
OpenEdition brings together four platforms dedicated to electronic resources in the humanities and social sciences from France. It archives thousands of books, journals, blogs and events.
- openedition.org/?lang=en - Home page.
Climate Science Text
OpenEI
Description:
The Open Energy Data Initiative (OEDI) enables research, collaboration, and transparency by providing open access to energy data and information. The OpenEI Data Lake is a centralized repository of datasets aggregated from the U.S. Department of Energy’s Programs, Offices, and National Laboratories. It provides links to over 4.19 PB of data.
- data.openei.org/data_lakes#Data-Lak... - Links to datasets.
Technology Software
OpenML
Description:
OpenML is an open platform for sharing datasets, algorithms, and experiments. It contains thousands of datasets and machine learning tasks running openly.
- openml.org - Home page.
Technology Text Images War
OSINT Ukraine
Description:
This is a public repository of tools, resources and an archive of Telegram messages related to the war in Ukraine. Note that some of the media on the site are very graphic.
- osintukraine.com - OSINT Ukraine home page.
- ukraine.osintukraine.com - Repository of Telegram messages.
World Text Images
Our World in Data
Description:
Our World in Data is a project of the Global Change Data Lab, a non-profit organization providing analysis from thousands of researchers around the world about poverty, disease, hunger, climate change, war, existential risks, and inequality.
- ourworldindata.org - Main page.
- ourworldindata.org/data - Data catalog.
Government Text Videos
Panama Papers
Description:
The Panama Papers Project unveils the complex world of offshore finance and its impact on the world's wealthiest individuals. This effort showcases the incredible collaboration of investigative journalists from around the globe, who exposed one of the largest data leaks in history.
- panamapapers.org - Home page.
- offshoreleaks.icij.org/search - Offshore Leaks Database.
- en.wikipedia.org/wiki/Panama_Papers - Wikipedia page.
Climate Science Text Images
Pangea
Description:
The information system PANGAEA is operated as an Open Access library aimed at archiving, publishing and distributing georeferenced data from earth system research. PANGAEA is open to any project, institution, or individual scientist to use or to archive and publish data.
- pangaea.de - List of datasets.
Videos Images Text History
Penn Museum
Description:
Home to over a million extraordinary artifacts and archaeological finds, the Penn Museum has been uncovering our shared humanity across continents and millennia since 1887.
- penn.museum - Home page.
- penn.museum/collections/search.php - List of collections.
- youtube.com/@pennmuseum - Videos channel.
History Technology Videos War
Periscope Film
Description:
Periscope Film LLC is a leading stock footage provider and publisher of historic military, aviation, railroad and transportation books and DVDs. Through their Open Archive program, they have saved, digitalized and archived thousands of films and documentaries.
- stock.periscopefilm.com - Home page.
- youtube.com/@periscopefilm - Videos channel.
History Text
Persée
Description:
Established in 2003, Persée is a joint service unit that brings together the ENS of Lyon, the CNRS and the University of Lyon and is supported by the France Ministry of Higher Education, Research and Innovation (MESRI). The Persée digital portal brings together complete collections of journals, conference proceedings, series and books. It now hosts more than 400 collections, representing over one million documents, with new titles added regularly.
- persee.fr - Home page.
- persee.fr/disciplines - Online collections.
Science Text Images World
Plants of the World Online
Description:
Plants of the World Online (POWO) is an international collaborative program that has as a primary aim to make available digitized data of the world's flora gathered from the past 250 years of botanical exploration and research. The site contains over 1.4M global plant names, 531,000 detailed descriptions, and 502,000 images.
- powo.science.kew.org - Home page.
Torrents
Private Tracker List
Description:
The Private Trackers List compiles data about private torrent servers, users and files.
- hdvinnie.github.io/Private-Trackers... - Home page.
Books Text
Project Gutenberg
Description:
Project Gutenberg is a library of over 75,000 free eBooks. Everything from Project Gutenberg is gratis, libre, and completely without cost to readers. Michael Hart, founder of Project Gutenberg, invented eBooks in 1971 and his memory continues to inspire the creation of eBooks and related content today. The Project Gutenberg Literary Archive Foundation (PGLAF) is the non-profit corporation that oversees operation of the project.
- gutenberg.org - The main project web site.
- ww.mirrorservice.org/sites/ftp.ibib... - Mirror site.
Climate Science Government Text
Public Environmental Data Project
Description:
The Public Environmental Data Project is committed to preserving and providing public access to federal environmental data. They are a volunteer coalition of several environmental, justice, and policy organizations, researchers across several universities, archivists, and students who rely on federal datasets and tools to support critical research, advocacy, policy, and litigation work. Several datasets are available on their site.
- screening-tools.com - The screening tools site.
Government Text
Publications Office of the European Union
Description:
The Publications Office of the European Union is the official provider of publishing services to all EU institutions, bodies and agencies. As such, it is the central point of access to EU law, and also to publications, data, research results, procurement notices and other official information.
- op.europa.eu/en/home - Home page.
- data.europa.eu/en - European Data.
- eur-lex.europa.eu/homepage.html?loc... - EU-Lex - Access law and treaty information.
- cordis.europa.eu - Cordis - EU research and development projects.
- op.europa.eu/en/web/general-publica... - List of EU publications.
Health Government Text
PubMed
Description:
PubMed comprises more than 39 million citations for biomedical literature from MEDLINE, life science journals, and online books. Citations may include links to full text content from PubMed Central and publisher web sites.
- pubmed.ncbi.nlm.nih.gov - Home page.
Culture Text
Qualitative Data Repository
Description:
The Qualitative Data Repository (QDR) is a dedicated archive for storing and sharing digital data (and accompanying documentation) generated or collected through qualitative and multi-method research in the social sciences and related disciplines.
- qdr.syr.edu - Home page.
Technology History Text Images
Radio Museum
Description:
The radio museum contains a vast library of data about radio devices. It contains over 350K radio models, 2.8M pictures including 1M schematics, and 79K tubes/semiconductors.
- radiomuseum.org - Collection of radio data.
Books Images
Read All Comics
Description:
Read All Comics provides a web interface to read thousands of comics for free.
- readallcomics.com - Home page.
Technology Software Images
Renderosity
Description:
Renderosity is a community of 3D artists, providing forums, a store front and a large quantity of free 3D assets.
- renderosity.com - Home page.
- renderosity.com/freestuff - List of free stuff.
Text Gaming
Replacement Docs
Description:
The replacementdocs.com site provides high quality scanned images of game instruction manuals in their full, original format with all original artwork and other graphical elements intact.
- ww.replacementdocs.com - Home page.
- ww.replacementdocs.com/download.php - List of manuals.
Technology Text
Request for Comments
Description:
A Request for Comments (RFC) is a publication in a series from the principal technical development and standards-setting bodies for the Internet, most prominently the Internet Engineering Task Force (IETF). An RFC is authored by individuals or groups of engineers and computer scientists in the form of a memorandum describing methods, behaviors, research, or innovations applicable to the working of the Internet and Internet-connected systems.
- rfc-editor.org - Home page.
- rfc-editor.org/rfc-index-100a.html - List of RFCs.
- neko2077.net/RFCs - Mirror site.
Books Gaming Images
RetroMags
Description:
This site indexes and makes available for free download thousands of retro gaming magazines and strategy guides from 10 years ago and earlier.
- retromags.com - Home page.
Culture Images
Rijksmuseum
Description:
The iconic Rijksmuseum in the heart of Amsterdam is one of the things you need to see when you visit the Netherlands. Rijksmuseum moves you through more than 8,000 works of Dutch art and history including masterpieces by Vermeer, Rembrandt, and Van Gogh.
- rijksmuseum.nl/en/collection - Online collection.
Technology Text Software Videos
SANS Institute
Description:
The SANS institute provides a number of cybersecurity resources including training, tools, posters, videos, publications and more.
- sans.org - Home page.
- sans.org/security-resources - List of security resources.
- youtube.com/@SANSInstitute - Videos channel.
Science Books Text
Sci-Hub
Description:
Sci-Hub started as a tool for providing quick access to articles from scientific journals - such articles are the main medium of communication of scientific knowledge today. Now Sci-Hub has grown a database of over 88 millions research articles and books freely accessible for anyone to read and download.
- sci-hub.st - Sci-Hub mirror site.
- sci-hub.se - Sci-Hub mirror site.
- sci-hub.ru - Sci-Hub mirror site.
Government Science Climate Text Images Torrents
SciOp
Description:
SciOp is the curated catalog/index of Safeguarding Research & Culture (SRC), an alternative infrastructure for archival and dissemination of cultural heritage and scientific knowledge in the form of a federated BitTorrent tracker.
- sciop.net - Home page.
- sciop.net/datasets - List of datasets.
History Text Images
SHAFR
Description:
The Society for Historians of American Foreign Relations maintains a large list of Asian resources including newspaper scans, private collections, cultural archives and more.
- shafr.org/asia-archives-resources - Asia archives.
Technology Text
Sigma AI
Description:
Sigma AI provides a list of open AI related datasets from various other sites.
- sigma.ai/open-datasets - List of datasets.
Government Text
Singapore Open Data Portal
Description:
This is the central data portal of the Government of Singapore, with 5,000+ datasets from 65+ government agencies, free for commercial or personal use.
- data.gov.sg - Home page.
- data.gov.sg/datasets - List of datasets.
History Text Images Videos
Smithsonian Museum of Natural History
Description:
Their mission is to promote understanding of the natural world and our place in it. The museum's collections tell the history of the planet and are a record of human interaction with the environment and one another.
- naturalhistory.si.edu - Home page.
- naturalhistory.si.edu/visit/virtual... - Virtual tour.
- collections.nmnh.si.edu/search - Online collections.
- youtube.com/@nationalmuseumofnatura... - Videos channel.
Images World History Videos
Smithsonian Open Access
Description:
On the Smithsonian Open Access web site, you can download, share, and reuse more than 5.1 million 2D and 3D digital items from their collections.
- si.edu/openaccess - Search collections.
- youtube.com/@smithsonian - Videos channel.
Technology Software Videos
Software Heritage
Description:
The long term goal of the Software Heritage initiative is to collect all publicly available software in source code form together with its development history, replicate it massively to ensure its preservation, and share it with everyone who needs it. The Software Heritage archive is growing over time as they crawl new source code from software projects and development forges.
- archive.softwareheritage.org - Source files archive.
- wiki.softwareheritage.org/wiki/Main... - Wiki documentation.
Climate Science Government Text Software
Source COOP
Description:
Source Cooperative is a data publishing utility that allows trusted organizations and individuals to share data using standard HTTP methods. It contains large data collections and mirrors of various sites, mostly centered around science, government and climate.
- source.coop - List of projects.
- source.coop/repositories/harvard-li... - Full mirror of data.gov.
- github.com/harvard-lil/data-vault - Scripts used to scrape data.gov.
Science Text
Taylor & Francis Online
Description:
Taylor & Francis provides access to over 5,431,000 articles and 2,600 peer-reviewed journals on a variety of scientific topics through its open access program.
- tandfonline.com - Home page.
Technology Text Software
TextFiles
Description:
TEXTFILES.COM has been online for nearly 25 years providing text files, focusing mostly on the years 1980-1995.
- extfiles.com - Collection of text files.
- d.textfiles.com/directory.html - Collection of shareware files.
- mirror3.preterhuman.net/textfiles - Mirror site.
- textfiles.meulie.net - Mirror site.
- textfiles.vistech.net - Mirror site.
Gaming Images Videos Software
The Cutting Room Floor
Description:
The Cutting Room Floor is a site dedicated to unearthing and researching unused and cut content from video games. From debug menus, to unused music, graphics, enemies, and levels.
- tcrf.net - Home page.
Technology Text Images
The Eye
Description:
The Eye is a very large archive of files of all types covering decades. It provides archives of various sub-reddits, Telegram channels, AI models, books, website crawls, 3D models, images and more.
- the-eye.eu - Home page.
- beta.the-eye.eu/public - Files directory.
Gaming Text Images Software
The Old Computer
Description:
Home to the largest collection of roms and emulators anywhere on the web with over 500,000 ROMs and Emulators for every major computer, console, arcade machine, pinball table and mobile device. Box Scans, Manuals, Magazines and a 179,000+ strong user community.
- theoldcomputer.com - Home page.
Software Technology
The UNIX Files
Description:
The UNIX Files is an abandonware archive dedicated to preserving software for propriatary UNIX operating systems such as IRIX, Solaris, AIX and HP-UX. Software featured on The UNIX Files has been discontinued by their publishers and is no longer commercially available. The UNIX Files aims to preserve these treasures for future generations, providing documentation and downloads of the original files.
- unixfiles.org - Home page.
Technology Text Software
The Unix Heritage
Description:
The Unix Heritage Society's aims include the preservation and maintenance of historical and non-mainstream UNIX systems; the further development of existing UNIX systems; and the continual fostering of the Unix community spirit. They host historical Unix distribution and packages available for download.
- tuhs.org - Heritage home page.
- wiki.tuhs.org/doku.php?id=source:un... - Unix archive wiki.
- tuhs.org/cgi-bin/utree.pl - Unix source code.
History Text
ToposText
Description:
ToposText is an indexed collection of ancient texts and mapped places relevant to the history and mythology of the ancient Greeks from the Neolithic period up through the 2nd century CE. It archives thousands of places, people and texts.
- topostext.org - Home page.
Government Culture Text Images History Science
Trove Australia
Description:
Trove is a collaboration between the National Library of Australia and hundreds of Partner organizations around Australia. You will find archived books, magazines, research, web sites and more.
- trove.nla.gov.au - Home page.
- webarchive.nla.gov.au/collection - Archived sites.
Government Text
Trump Twitter Archive
Description:
Trump's Truth is a public archive of Donald Trump's communications on TRUTH Social. Meanwhile, the twitter archive used to check Twitter every 60 seconds and record every Trump tweet into a database. Before the site launched in 2016, all available tweets were captured and added to the database for perpetuity.
- trumpstruth.org - Trump's Truth.
- thetrumparchive.com - Trump twitter archive.
Government Text Images Videos
UK Web Archive
Description:
UKWA captures, preserves and makes accessible UK central government information published on the web. The Web Archive includes videos, tweets, images and websites dating from 1996 to the present day.
- nationalarchives.gov.uk/webarchive - Home page.
World Text Images
United Nations Digital Library
Description:
The Digital Library is the central search engine to access millions of records from all UN bodies and agencies, including letters, reports, maps, speeches, voting data, and more.
- digitallibrary.un.org/?ln=en - UN Digital Library.
- unesdoc.unesco.org - UNESCO Digital Library.
- whc.unesco.org/en/list - World Heritage list.
World Text War
Uppsala Conflict Data Program
Description:
The Uppsala Conflic Data Program (UCDP) is the world's largest collection of wartime and organized violence data, covering over 40 years of conflicts, based at Uppsala University in Sweden and in collaboration with the Peace Research Institute in Oslo.
- ucdp.uu.se - Main UCDP page.
- prio.org - Peace Research Institute.
Climate Images
US Drought Monitor
Description:
The U.S. Drought Monitor provides climate maps weekly since 1999. It's produced jointly by the NDMC, NOAA and USDA.
- droughtmonitor.unl.edu/Maps/MapArch... - Maps archive.
World Images Government Videos
US Geological Survey
Description:
USGS is a US governmental agency providing services and resources around the disciplines of biology, geography, geology, and hydrology. The site provides access to images and videos related to maps, topographic surveys and more.
- usgs.gov - Home page.
- usgs.gov/programs/national-geospati... - Historical maps collection.
- youtube.com/@usgs - Videos channel.
Text Technology
Usenet Archives
Description:
Usenet is a worldwide distributed Internet discussion system. It was developed from the general purpose UUCP dial-up network architecture. Duke University graduate students Tom Truscott and Jim Ellis conceived the idea in 1979 and it was established in 1980. Users read and post messages to one or more categories, known as newsgroups. Usenet was mostly replaced by Internet forums, but various archives remain.
- usenetarchives.com - Usenet Archives site.
- archive.org/details/usenet?and%5B%5... - Usenet collection at Internet Archives.
- olduse.net - olduse.net.
- eternal-september.org - Free news archive
- usenet.blueworldhosting.com - Free news archive.
- news.nntp4.net - Free news archive.
- complete.org/quux-org-nncp-public-r... - QUUX public relay.
Technology Text
Vintage Machinery
Description:
The VintageMachinery.org web site is devoted to information on the history, restoration and use of vintage woodworking machinery. The site contains information concerning vintage machinery including downloadable publications, historical information and technical data.
- intagemachinery.org/home.aspx - Home page.
Technology Images Videos
Web Design Museum
Description:
The Web Design Museum exhibits thousands of screenshots and videos of old websites, mobile apps and software from 1990s to mid-00s.
- webdesignmuseum.org - Home page.
World Text
Wikileaks
Description:
WikiLeaks is a multi-national media organization and associated library. It was founded by Julian Assange in 2006. WikiLeaks specializes in the analysis and publication of large datasets of censored or otherwise restricted official materials involving war, spying and corruption. It has so far published more than 10 million documents and associated analyses.
- wikileaks.org/-Leaks-.html - Archive of leaked documents.
World Text
Wikipedia
Description:
Wikipedia is a free-content online encyclopedia written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. It is the largest and most-read reference work in history. Wikipedia is hosted by the Wikimedia Foundation, a non-profit organization that also hosts a range of other projects.
- en.wikipedia.org/wiki/Main_Page - English Wikipedia home page.
- wikipedia.org - Wikipedia in other languages.
- wikidata.org/wiki/Wikidata:Main_Pag... - Wikidata.
History World Text Images
Wilson Center Digital Archive
Description:
Gain critical insights into international history and policy through declassified documents from governments, organizations, and individuals worldwide, curated by the Wilson Center.
- digitalarchive.wilsoncenter.org - Main page.
Gaming Technology
WinWorld
Description:
WinWorld is an online museum created in 2003 dedicated to the preservation and sharing of vintage, abandoned, and pre-release software. It offers information, media and downloads for a wide variety of computers and operating systems. Get classic operating systems, applications, games and betas for every platform from PC to Mac to Amiga, right here from the software library on WinWorld.
- winworldpc.com/home - The WinWorld library.
World Government Text Images Videos
World Bank Open Data
Description:
The World Bank Open Data portal provides free and open access to global development data, mostly focusing on economic datasets.
- data.worldbank.org - Data portal.
- youtube.com/@WorldBank - Videos channel.
Technology Text
World Radio History
Description:
Over 140,000 documents and publications are made available about radio and broadcasting, covering magazines, technical publications, world pages and more.
- worldradiohistory.com - Home page.
Law World Text
WorldCourts
Description:
WorldCourts is one of the largest databases of international case law in the world. Established in 1999, it provides a single point of access to over 50,000 decisions of 52 international and internationalized judicial and quasi-judicial bodies, as well as hundreds of arbitral awards.
- worldcourts.com - Home page.
History Law Text Government Images Videos Audio
Yale Library Digital Collections
Description:
Yale Library Digital Collections provides online access to image and text-based materials. It includes millions of documents in multiple collections.
- library.yale.edu/explore-collection... - Home page.
- collections.library.yale.edu - Digital text collections.
- avcollections.library.yale.edu - Aviary audiovisual collection.
- avalon.law.yale.edu - The Avalon project collects 4,000 years of law documents.
- web.library.yale.edu/digital-collec... - Yale Daily News archives.
Technology Text Software
Your.Org
Description:
Your.Org is a hosting company that provides hundreds of terrabytes of data for various sites. They also host a mirror of various open source software including Linux distributions, FreeBSD, Wikipedia database dumps, other websites such as Microsoft, Corel, IBM and much more.
- ftpmirror.your.org/pub - Main mirror site.
- ftpmirror.your.org/pub/misc - Mirrors of other companies.
Books Text Science
Z-Library Project
Description:
Z-Library is one of the largest online libraries in the world that contains over 27 millions books and 91 millions articles. They aim to make literature accessible to everyone.
- z-library.sk - Home page.
- 1lib.sk - Mirror site.
- z-lib.gd - Mirror site.
- z-lib.gl - Mirror site.