Main Page/Archive

From Archiveteam
Jump to navigation Jump to search



And we've been trashing our history

Archive Team is a loose collective of rogue archivists, programmers, writers and loudmouths dedicated to saving our digital heritage. Since 2009 this variant force of nature has caught wind of shutdowns, shutoffs, mergers, and plain old deletions - and done our best to save the history before it's lost forever. Along the way, we've gotten attention, resistance, press and discussion, but most importantly, we've gotten the message out: IT DOESN'T HAVE TO BE THIS WAY.

This website is intended to be an offloading point and information depot for a number of archiving projects, all related to saving websites or data that is in danger of being lost. Besides serving as a hub for team-based pulling down and mirroring of data, this site will provide advice on managing your own data and rescuing it from the brink of destruction.

Currently Active Projects (Get Involved Here!)

Archive Team recruiting

Warrior-based projects

Current Running Warrior Project: Reddit

An updated Warrior virtual appliance (v3.2) is now available with better support for newer projects that utilize wget-at. Please download it using the link above.

Manual projects

  • 2019-202? coronavirus outbreak: Documenting and preserving data, events, and impacts of the virus on society. IRC Channel #coronarchive (on hackint)
  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).
  • Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. IRC Channel #archiveteam (on hackint)

Upcoming & proposed projects

  • Chrome Web Store: Google has announced a timeline of policy changes that will lead to content being removed between December 1, 2020 and 2023. IRC Channel #chromeweblore (on hackint).
  • Kinja: Deleting all user pages, maybe? IRC Channel #gokinjagokinjago (on hackint).
  • Twitter: Deleting inactive accounts 2019-12-11 sometime. IRC Channel #archiveteam-bs (on hackint).
  • YouTube: Archiving all YouTube metadata and selected videos afterwards soon. IRC Channel #down-the-tube (on hackint).
  • Imgur: Image hoster decided that using it for hosting images is not permitted. (Now it's been bought by MediaLab.) IRC Channel #imgone (on hackint).
  • JamiiForums: the Tanzanian government would like this gone. IRC Channel #archiveteam-bs (on hackint).
  • LiveJournal: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. IRC Channel #archiveteam-bs (on hackint).
  • Ownlog: Ownlog is losing popularity and support from its owners. IRC Channel #archiveteam-bs (on hackint).
  • The Pirate Bay: Recently came back up, grabbing an archive for sanity's sake. IRC Channel #archiveteam-bs (on hackint).
  • Roblox: Archiving Roblox due to the servers breaking down for 2 days. Soon archiving asset pages, user pages, user inventories, catalog pages, game pages, developer forums, assets, and games (a little work needed for games). Thank GOD it wasn't ransomware. IRC Channel #archiveteam-bs (on hackint)
  • Valhalla: Where to store what even the Internet Archive doesn't have space for? IRC Channel #huntinggrounds (on hackint).
  • Giphy: Bought by Facebook, to be "integrated" (assimilated) into Instagram

Recently finished projects

  • Afghanistan: Archiving the Afghan web due to recent events. IRC Channel #afghansites (on hackint).
  • YouTube Dislikes: YouTube is making dislike counts private so now is a good time to archive those counts as well as other basic metadata. IRC Channel #down-the-tube (on hackint)
  • YouTube Discussions: Discussion tabs from millions of channels will be lost on 2021-10-12. IRC Channel #down-the-tube (on hackint).
  • XTube: The shutdown on 5 September 2021 will surely leave a gaping hole in the web. IRC Channel #nevermind (on hackint).
  • CodePlex: Microsoft's self-archive will be permanently removed from its Recycle Bin after July 1, 2021. IRC Channel #plexicode (on hackint).
  • Google Drive: Google broke millions of shared Drive links on 13 September 2021. IRC Channel #googlecrash (on hackint)
  • Google Poly: A 3D art repository that Google will send to the trash compactors on June 30, 2021. New uploads cease April 30. IRC Channel #polygone (on hackint).
  • Bintray: JFrog is dismantling the software distribution platform used by numerous projects in May. IRC Channel #binnedtray (on hackint).

Hiatus / Missed the Mark

ArchiveTeam uses the hackint IRC network – ircs:// (TLS required) – webchat: info

Archive Team News


What is What

  • Deathwatch is where we keep track of sites that are sickly, dying or dead.
  • Fire Drill is where we keep track of sites that seem fine but a lot depends on them.
  • Projects is a comprehensive list of AT endeavors.
  • Philosophy describes the ideas underpinning our work.

Some Starting Points

  • Software will assist you in regaining control of your data by providing tools for information backup, archiving and distribution.
  • Formats will familiarise you with the various data formats, and how to ensure your files will be readable in the future.
  • Storage Media is about where to get it, what to get, and how to use it.

Quote of the Moment

"[Yahoo!] found the way to destroy the most massive amount of history in the shortest amount of time with absolutely no recourse"

Internet Atrocity! GeoCities' Demise Erases Web History
By Dan Fletcher, TIME Magazine, Monday, Nov. 09, 2009

Recently Ended Projects


Archive Team is in no way affiliated with the fine folks at ARCHIVE.ORG Archive Team can always be reached at