Projects

From Archiveteam
Jump to navigation Jump to search
Projects status
Online (298) · Special cases (43) · Endangered (70) · Closing (24) · Offline (389)
Rescued Sites (466) · Self-Saved (14) · Partially Rescued Sites (198) · In Progress (42) · Upcoming (12) · Not Saved Yet (395) · On hiatus (8) · Lost Sites (81)
Unknown Status (66)

This page should contain, or directly link to, almost all ArchiveTeam archiving endavours, categorized.

  • Current projects: currently active, upcoming and recently finished grandiose ArchiveTeam projects. (Extract of the next two categories.)
  • Warrior projects: projects that utilize(d) ArchiveTeam's distributed archiving system.
  • Manual projects that need(ed) much more effort than just pushing a button.
  • Small projects: small-scale website archiving projects usually done by a single individual.
  • Early projects: first archiving endavours on the dawn of ArchiveTeam, in a format nobody is apparently able/dare to touch.

(The box on the top counts projects having dedicated wiki pages, those numbers aren't complete and far don't contain all projects mentioned in the sections below.)

If you know of a website in danger, let us know that on IRC. If it's a larger site, please also mention it on the Deathwatch page. And, after a decision is made on IRC, or if it doesn't need a decision, then, to help things kept documented and up to date, you are encouraged to add projects, or modify their status

The box on the top is generated automatically from projects' dedicated wiki pages, so shouldn't be touched.

Important: Contents of sections below are embedded from other pages, that is, don't edit the section, nor this page, but use the "Edit this list" link! (That opens the corresponding page for editing, and after editing, you'll be forwarded to the page containing only that list: don't worry, you didn't delete the others.)

Current projects

Currently active team projects you can get involved in.

Edit this list

Archive Team recruiting

Warrior-based projects

Current Running Warrior Project: Telegram
  • DPReview: Amazon will be throwing this digital photography resource into the shredder on April 10, 2023. IRC Channel #dprived (on hackint).
  • Imgur: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on May 15, 2023. IRC Channel #imgone (on hackint).
  • Issuu: Interactive flipbook repository is clamping down on free users' upload limits and plans to make existing uploads falling foul of its new limits inaccessible to others. IRC Channel #wetakeissuu (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. Restricting access to APIs and data on June 19, 2023. Currently grabbing new material, grabbing past material is planned. IRC Channel #shreddit (on hackint).
  • Ukraine/Russian invasion: Archiving various .ua sites in the wake of the Russian government's invasion. IRC Channel #ucryne (on hackint).
  • Telegram: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. IRC Channel #telegrab (on hackint).
  • GitHub: Embraced-uh, I mean, bought by Microsoft. IRC Channel #gitgud (on hackint).
  • MediaFire: Not 'at-risk' but grabbing speculatively to save historic files IRC Channel #mediaonfire (on hackint).
  • Google Drive: Same as MediaFire. IRC Channel #googlecrash (on hackint).
  • URLs: A random collection of stuff. IRC Channel #// (on hackint).
  • URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).

An updated Warrior virtual appliance (v3.2) is now available with better support for newer projects that utilize wget-at. Please download it using the link above.

Manual projects

  • 2019-202? coronavirus outbreak: Documenting and preserving data, events, and impacts of the virus on society. IRC Channel #coronarchive (on hackint)
  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • Codearchiver: Dumping and archival of source code repositories and associated version control systems. IRC Channel #codearchiver (on hackint).
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).
  • Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. IRC Channel #archiveteam (on hackint)

Upcoming & proposed projects

Recently finished projects

  • Enjin: community hosting platform that no longer hosts communities as of 2023-04-30. IRC Channel #enjinxed (on hackint).
  • Zippyshare.com: File sharing host opens its wallet, discovers it looks nearly empty, but will keep hosting until March 31, 2023. IRC Channel #zippyshart (on hackint).
  • Classic Google Sites: Making more sites inaccessible to the public starting September 1, 2021 January 30, 2023 with Workspace accounts. IRC Channel #nearlylostmygoogles (on hackint).
  • TJ (aka TJournal): Russian news platform shutting down 2022-09-10 over Ukraine reporting. IRC Channel #journalthis (on hackint).

Hiatus / Missed the Mark

  • Webs: Vistaprint is killing off the Freewebs you knew from the 2000s on March 31 June 30, 2021 sometime, unless you pay up. IRC Channel #webbed (on hackint).
  • Tinkercad: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around May 24, 2021. IRC Channel #tinkerhad (on hackint).
  • Angelfire: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. IRC Channel #angelonfire (on hackint).
  • Audit 2014: It's time to verify our shit. IRC Channel #auditteam (on hackint). THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.
  • Flickr: Yahoo! SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. IRC Channel #flickrfckr (on hackint).
  • FTP: Help us find and download all FTP sites! IRC Channel #effteepee (on hackint).
  • Google Groups: "Gone within a year" (SketchCow, 2016-06-07).
  • Google News Archive: Let's store all newspapers at Google, WCGW? IRC Channel #papersplease (on hackint).
  • DevPort: This portfolio SaaS provider has reportedly been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.
  • INTERNETARCHIVE.BAK: Grab a slice of the big cake of The Archive! IRC Channel #internetarchive.bak (on hackint).
  • ISP Hosting: Finding ISP web hosting services before the Grim Reaper finds them. IRC Channel #webroasting (on hackint).
  • NewsGrabber: Saving all news articles. Currently paused. IRC Channel #newsgrabber (on hackint).
  • Project Newsletter: Archiving e-newsletters, currently in development. IRC Channel #projectnewsletter (on hackint).
  • Quizlet: Flashcards and other learning tools IRC Channel #quizletusin (on hackint).
  • Tumblr: Yahoo! considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. IRC Channel #tumbledown (on hackint).
  • yuku: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. IRC Channel #archiveteam-bs (on hackint).

ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info

Warrior projects

ArchiveTeam's past, current and future Warrior projects with details, in a table form.

Edit this list

Project IRC channel Status Began Finished Result Archive Location
Fotoalbum (script-only) #lookatthisfotograph (on hackint) Active
Google Sites (script-only) #nearlylostmygoogles (on hackint) Active
Github (script-only) #gitgud (on hackint) Active
Bitbucket (Mercurial repositories) #kickthebucket (on hackint) In Development
Reddit #shreddit (on hackint) In Development
Pastebin #pastalavista (on hackint) Active May 30, 2020
Google+ #googleminus (on EFnet) (abandoned) Downloads Finished March 5, 2019 April 2, 2019 Qualified Success archive
Flickr #flickrfckr (on hackint) Active January 9, 2019 archive
Tumblr #tumbledown (on hackint) Archive Posted December 8, 2018 December 17, 2018 Qualified Success archive
NUjij Archive Posted August 25, 2016 Success archive
Yahoo! Answers #noanswers (on hackint) Archive Posted August 21, 2016 archive
Orkut #throatkut (on EFnet) (abandoned) Archive Posted August 6, 2016 archive
Portalgraphics.net Archive Posted July 23, 2016 July 27, 2016 Success archive
DNS History #greatlookup (on EFnet) (abandoned) Aborted July 4, 2016 August 22, 2016 Failure
THOMAS Archive Posted July 3, 2016 July 5, 2016 Qualified Success archive
Coursera #cursera (on EFnet) (abandoned) Archive Posted June 26, 2016 June 30, 2016 Success archive
Olympe Downloads Finished June 5, 2016 June 6, 2016 Qualified Success
ZippCast Archive Posted June 3, 2016 June 10, 2016 Qualified Success archive
Arto Archive Posted May 8, 2016 June 29, 2016 Success archive
Bayimg Archive Posted April 28, 2016 archive
PDF 2016 #pdflush (on EFnet) (abandoned) Active April 8, 2016 archive
Virgin Media #virginsacrifice (on EFnet) (abandoned) Downloads Finished March 30, 2016 April 28, 2016 Qualified Success
LiveJournal #recordedjournal (on EFnet) (abandoned) Active March 12, 2016
GameTrailers #unhitchedtrailer (on EFnet) (abandoned) Archive Posted February 9, 2016 February 18, 2016 Qualified Success archive
Fotolog.com #fotologout (on EFnet) (abandoned) Active February 8, 2016 archive
Friends Reunited #friendsununited (on EFnet) (abandoned) Archive Posted February 5, 2016 February 26, 2016 Qualified Success archive
myVIP
(script-only)
#byevip (on EFnet) (abandoned) Archive Posted January 24, 2016 August 30, 2016 Success archive
MusicBrainz (external links) Archive Posted January 8, 2016 January 9, 2016 Success archive
OldFriends Archive Posted December 29, 2015 January 20, 2016 Success archive
Google Code #googlecodeblue (on EFnet) (abandoned) Active December 18, 2015 archive
Docstoc #docstop (on EFnet) (abandoned) Archive Posted November 24, 2015 December 1, 2015 Qualified Success archive
FTP (script-only) #effteepee (on hackint) Active November 30, 2015 archive
aDrive #bdrive (on EFnet) (abandoned) Archive Posted November 15, 2015 November 16, 2015 Qualified Success archive
Telenor personal websites #nohome (on EFnet) (abandoned) Archive Posted October 29, 2015 October 31, 2015 Qualified Success archive
WikiTeam (WARC format) #wikiteam (on hackint) Active October 26, 2015 archive
Yuku Active October 25, 2015 archive
GameFront #grillfront (on EFnet) (abandoned) Archive Posted October 20, 2015 April 29, 2016 Success archive
RuTracker #rutrasher (on EFnet) (abandoned) Archive Posted October 5, 2015 May 31, 2016 Success archive
Thingiverse Archive Posted September 23, 2015 January 24, 2016 Success archive
Skillfeed #skillessfeed (on EFnet) (abandoned) Archive Posted September 14, 2015 September 20, 2015 Success archive
Blingee #tragedee (on EFnet) (abandoned) Archive Posted August 16, 2015 October 8, 2015 Qualified Success archive
Google Moderator #moderhater (on EFnet) (abandoned) Archive Posted July 21, 2015 July 22, 2015 Success archive
Toshiba Support #toshibah (on EFnet) (abandoned) Archive Posted June 24, 2015 July 5, 2015 Success archive
Xfire Social Website #xfired (on EFnet) (abandoned) Archive Posted June 19, 2015 July 9, 2015 Qualified Success archive
Zoocasa #zoohouse (on EFnet) (abandoned) Archive Posted June 18, 2015 June 25, 2015 Success archive
SourceForge #coldstorage (on EFnet) (abandoned) Aborted June 17, 2015 June 19, 2015
Pomf.se #pomfret (on EFnet) (abandoned) Archive Posted June 9, 2015 June 17, 2015 Success archive
Google Baraza #bonanza (on EFnet) (abandoned) Archive Posted April 28, 2015 May 7, 2015 Success archive
Google Helpouts #helpus (on EFnet) (abandoned) Archive Posted April 16, 2015 April 21, 2015 Success archive
LayerVault #layersalt (on EFnet) (abandoned) Archive Posted April 6, 2015 April 11, 2015 Success archive
FriendFeed #humancentifeed (on EFnet) (abandoned) Archive Posted April 2, 2015 April 9, 2015 Qualified Success archive
Last.fm #lastchance.fm (on EFnet) (abandoned) Archive Posted March 30, 2015 August 28, 2015 Qualified Success archive
FurAffinity #iceking (on EFnet) (abandoned) Archive Posted March 26, 2015 June 15, 2015 Success archive
Madden GIFERATOR #jiferator (on EFnet) (abandoned) Archive Posted March 21, 2015 March 23, 2015 Success archive
RapidShare #rapidscare (on EFnet) (abandoned) Archive Posted March 20, 2015 March 29, 2015 Qualified Success archive
Trovebox #treasuretrove (on EFnet) (abandoned) Archive Posted March 14, 2015 June 27, 2015 Success archive
Google Business Sitebuilder #sitebreaker (on EFnet) (abandoned) Archive Posted March 9, 2015 March 10, 2015 Success archive
Blogger #frogger (on EFnet) (abandoned) Aborted February 25, 2015 May 6, 2015
TestFlight #crashed (on EFnet) (abandoned) Archive Posted February 13, 2015 February 25, 2015 Success archive
Cobook #cookbook (on EFnet) (abandoned) Archive Posted February 9, 2015 February 11, 2015 Success archive
Ovi Store #downlovi (on EFnet) (abandoned) Archive Posted February 3, 2015 February 15, 2015 Qualified Success archive
Inkblazers #inkerasers (on EFnet) (abandoned) Archive Posted January 18, 2015 January 31, 2015 Success archive
Brace.io #braceyourself (on EFnet) (abandoned) Archive Posted January 12, 2015 January 18, 2015 Success archive
Vstreamers #destreamers (on EFnet) (abandoned) Archive Posted January 6, 2015 January 10, 2015 Success archive
Nokia Memories #backtorubber (on EFnet) (abandoned) Archive Posted December 30, 2014 December 30, 2014 Success archive
Microsoft Clip Art #clipfart (on EFnet) (abandoned) Archive Posted December 23, 2014 December 29, 2014 Success archive
Roon #rooined (on EFnet) (abandoned) Archive Posted December 20, 2014 December 21, 2014 Success archive
ZipList #zipyourlips (on EFnet) (abandoned) Archive Posted December 2, 2014 December 4, 2014 Success archive
Viddy #viddiot (on EFnet) (abandoned) Archive Posted December 2, 2014 December 15, 2014 Success archive
Halo
(Halo 2 & 3 stuff)
#yolohalo (on EFnet) (abandoned) Archive Posted November 6, 2014 June 23, 2015 Success archive
GameMaker Sandbox Archive Posted October 15, 2014 October 19, 2014 Success archive
Qwiki #quickie (on EFnet) (abandoned) Archive Posted September 28, 2014 November 1, 2014 Qualified Success archive
Quizilla #fizzilla (on EFnet) (abandoned) Archive Posted September 4, 2014 October 1, 2014 Success archive
Ancestry.com #ancienthistory (on EFnet) (abandoned) Archive Posted September 19, 2014 November 5, 2014 Success archive
TwitPic #quitpic (on EFnet) (abandoned) Archive Posted September 4, 2014 January 2, 2015 Qualified Success archive
Verizon Personal Web Space #verizoff (on EFnet) (abandoned) Archive Posted September 2, 2014 October 1, 2014 Qualified Success archive
Swipnet #swiped (on EFnet) (abandoned) Archive Posted August 19, 2014 September 1, 2014 Success archive
Canv.as #canvas (on EFnet) (abandoned) Archive Posted August 11, 2014 August 12, 2014 Success archive
Twitch.tv #burnthetwitch (on EFnet) (abandoned) Archive Posted August 9, 2014 August 24, 2014 Qualified Success archive
Fotopedia #fotofinished (on EFnet) (abandoned) Archive Posted August 5, 2014 August 7, 2014 Success archive
Yahoo! Voices #shutup (on EFnet) (abandoned) Archive Posted July 28, 2014 July 31, 2014 Success archive
Justin.tv #justouttv (on EFnet) (abandoned) Archive Posted June 5, 2014 June 15, 2014 Success archive
Viddler #fiddler (on EFnet) (abandoned) Cancelled February 21, 2014 February 27, 2014 Qualified Success archive
Bebo #cockandballs (on EFnet) (abandoned) Hiatus February 18, 2014 archive
My Opera #fatlady (on EFnet) (abandoned) Archive Posted February 16, 2014 March 3, 2014 Success archive
Dogster #rawdogster (on EFnet) (abandoned) Archive Posted February 7, 2014 February 16, 2014 Success archive
Wretch & Yahoo! Blog #shipwretched (on EFnet) (abandoned) Archive Posted December 17, 2013 January 9, 2014 Qualified Success archives: Wretch, Yahoo Blog
Hyves #angerthehyve (on EFnet) (abandoned) Archive Posted November 10, 2013 December 2, 2013 Success archive
Blip.tv #blooper.tv (on EFnet) (abandoned) Archive Posted October 11, 2013 August 27, 2015 Qualified Success archive 1 archive 2
Zapd #crapd (on EFnet) (abandoned) Archive Posted October 1, 2013 October 8, 2013 Success archive
Xanga #jenga (on EFnet) (abandoned) Downloads Paused June 21, 2013 August 31, 2013 archive
Streetfiles.org #streetsoffire (on EFnet) (abandoned) Archive Posted April 28, 2013 April 30, 2013 Qualified Success archive
Yahoo! Upcoming #outgong (on EFnet) (abandoned) Archive Posted April 20, 2013 April 25, 2013 archive
Formspring #firespring (on EFnet) (abandoned) Archive Posted March 24, 2013 September 19, 2013 Success archive
Yahoo! Messages #BurnTheMessenger (on EFnet) (abandoned) Archive Posted March 20, 2013 March 31, 2013 archive
Storylane Archive Posted March 8, 2013 March 15, 2013 archive
Posterous #preposterous (on EFnet) (abandoned) Archive Posted February 23, 2013 June 29, 2013 archive
Xanga #jenga (on EFnet) (abandoned) Downloads Paused January 22, 2013 February 16, 2013 archive, user lookup, user list
Punchfork Archive Posted January 11, 2013 March 6, 2013 archive, user lookup
URLTeam #urlteam (on hackint) Active all releases
weblog.nl Archive Posted January 19, 2013 February 2, 2013 archive, user lookup
Yahoo! Blog #yahooblah (on EFnet) (abandoned) Archive Posted January 8, 2013 January 19, 2013 archive
GitHub Downloads Archive Posted December 13, 2012 December 17, 2012 Success archive, index
Daily Booth Archive Posted November 19, 2012 December 29, 2012 archive, user lookup
BT Internet Archive Posted October 10, 2012 November 2, 2012 Success archive
Webshots #webshots (on EFnet) (abandoned) Archive Posted October 4, 2012 November 18, 2012 archive, user lookup
City of Heroes Archive Posted September 3, 2012 December 1, 2012 Success archive
Cinch.FM Archive Posted August 20, 2012 August 22, 2012 Success archive
Tumblr (test project) Archive Posted August 9, 2012 August 19, 2012 archive (tar), archive (warc)
Picplz Archive Posted June 3, 2012 June 15, 2012 archive, user lookup, index
Tabblo Archive Posted May 23, 2012 May 26, 2012 Success archive, user lookup
FortuneCity #fortuneshitty (on EFnet) (abandoned) Archive Posted April 4, 2012 April 11, 2012 Qualified Success archive, user lookup
MobileMe Archive Posted April 3, 2012 Aug 8, 2012 Success archive, user lookup, index

Status

In Development
a future project
Active
start up a Warrior and join the fun; this one is in progress right now
Active (paused)
not running currently but stay tuned!
On Hold
project suspended indefinitely but not given up
Downloads Finished
we've finished downloading the data
Archived
the collected data has been properly archived
Archive Posted
the archive is available for download

Result

Success
downloaded all of the data and posted the archive publicly
Qualified Success
either we couldn't get all of the data, or the archive can't be made public
Failure
the site closed before we could download anything

Manual projects

Difficult, discussion-intensive, human-resource-intensive and audit projects.

Edit this list

Project IRC channel Description Status Started Finished Archives/Results
Yahoogroups-joiner #yahoosucks (on hackint) Filling out captchas to archive Yahoo Groups Active 2019-10-19 leaderboard
Project Newsletter #projectnewsletter (on hackint) Archiving all the email newsletters Active 2015-03-27
Woohoo #woohoo (on EFnet) (abandoned) Doing a census of all of Yahoo!'s products Active 2015-03-13 result
Froogle #froogle (on EFnet) (abandoned) Doing a census of all of Google's products Active 2015-03-13 result
INTERNETARCHIVE.BAK #internetarchive.bak (on hackint) Backing up the Internet Archive Active 2015-03-02 stats
ISP Hosting #webroasting (on hackint) Finding ISP web hosting services before the Grim Reaper finds them. Active 2014-12-30 see there
Project Valhalla #huntinggrounds (on hackint) Discussing where and how to store archives that are too big for the Internet Archive at the moment. Active 2014-09-18 see there
Audit2014 #auditteam (on hackint) We've uploaded a bunch of stuff. Let's go through the list and make sure it's categorized, has decent metadata, etc. Active 2014-07-16 list,
the content
ArchiveBot #archivebot (on hackint) IRC bot designed to automate the archival of smaller websites Active 2013-09-06 archives,
search
AOL #aohell (on hackint) Archiving the original AOL, not AOL's current website Active 2013-01-28 [1]
WikiTeam #wikiteam (on hackint) Exporting Mediawiki databases in XML dumps Active 2011-04-05 [2]
FTP #effteepee (on hackint) Downloading all the FTP sites Active e.g. [3]

Small projects

List of smaller website rescuing projects, usually done by single individuals.

Edit this list

See also what's been crawled by ArchiveBot: browse here.

For Hungarian websites, see bzc6p's userpage.

You should also try searching on http://archive.org including keyword archiveteam, or for browsing, directly in the Wayback Machine.

Website Site status Closure date Archiving status Archived by Started Finished Archives
Wikispot Closed 2015-07-27 Partially saved bzc6p 2015-06-30 2015-07-31 [4]
Pastebin Online In progress...

joepie91

2014-09-09
TechNet Closing 2014-03-28 Partially saved Arkiver, Mithrandir, Darkstar
Widgetbox Closed 2014-09-30 Saved Arkiver 2013-12-19
Quick.io Closed 2013-12-31

Saved

Arkiver 2013-12-13 2013-12-13
winamp.com

Saved

2013-11 2013-11 [5]

Early projects

List of ArchiveTeam's early endavours, for historical interest, not edited.

Edit this list

Archiveteam1.png Historical content

This page or section is not really edited any more, probably because the project got abandoned, information is collected somewhere else in a different form etc.

However, this is a good and important record of ArchiveTeam's ancient times, thus must be preserved, but merging it into an other article would be difficult and/or some pieces of information are missing for a new form.

So feel free to read this, but it has probably nothing to be added now. However, if you resurrect the project or find a way to move this data to a fresh place, you can remove this template.


Look at Archive Team Collection at Internet Archive too

Some archives available for downloading, by Archive Team or by other volunteers or groups.

Look at Archive Team Collection at Internet Archive too.


Available for download

Title/Download link Description Size
Geocities - The PATCHED Torrent (IA) The popular web hosting service founded in 1994. It was closed by Yahoo! in 2009 641.4 GB
URL Shortener Backup Torrent v4 URLTeam compressed backups of various URL shorteners (README) 75 GB
URL Shortener Backup Torrent v3 outdated, use v4 URLTeam compressed backups of various URL shorteners (README) 50 GB
URL Shortener Backup Torrent v2 outdated, use v4 URLTeam compressed backups of various URL shorteners (README) 48 GB
URL Shortener Backup Torrent v1 outdated, use v4 URLTeam compressed backups of various URL shorteners (README) 41.1 GB
Papers from Philosophical Transactions of the Royal Society This archive contains 18,592 scientific publications totaling 33GiB, all from Philosophical Transactions of the Royal Society and which should be available to everyone at no cost, but most have previously only been made available at high prices through paywall gatekeepers like JSTOR. 32.48 GB
The May 2011 Calufa Twitter Scrape 90+ million tweets from more than 6 million users 14.9 GB
Internet Gopher Archive 2007 (IA) Archive of gopher sites 14.8 GB
Encyclopedia Dramatica January 2010 Mirror lulz 11.7 GB
The TEXTFILES.COM Time Capsule This collection comprises all the major text-based sets of the TEXTFILES.COM site 11 GB
Salon Table Talk Threads of this talk site +6.0 GB
Usenet Archive of UTZOO Tapes Collection of .TGZ files of very early USENET posted data 2.0 GB
Quux.org Gopher Mirror Collection 2006 (IA) This is a collection of mirrors maintained by gopher.quux.org. These mirrors were taken offline in 2006 due to bandwidth constraints 1.5 GB
full-history-linux.git.tar GIT repository of Linux Kernel from 1991 to 2010 (details) 594 MB
Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape Almost 10 million tweets 425 MB
The 2010 Reddit Research Project Dataset on affinities of 60,000+ Reddit users, recorded in 2010 ~360 MB
Archive Team Starwars.Yahoo.Com Panic Download This is a panic download of the starwars.yahoo.com forums and profiles, done before the closure of same by Yahoo on December 15, 2009. This includes as many messages, profiles, and pages related to the site as could be easily brought in. ~250 MB
Social Structure of Facebook Networks Facebook Data Scrape Facebook data scrape related to paper "The Social Structure of Facebook Networks", by Amanda L. Traud, Peter J. Mucha, Mason A. Porter 197 MB
Archive Team's Etherpad Time Capsule This archive contains roughly 6,400 Etherpads, in their final state 125 MB
WikiTeam archives Archives about wikis. See WikiTeam +100 MB
Archive Team Archive Team.org Site Rip from August 03, 2011 75 MB
Boing Boing Posts Archive (2000-2011) Two collections of Boing Boing postings provided by the cultural website boingboing.net on its 5th and 11th anniversaries 42 MB
Archive Team Quotes Database Backup Amusing snatches of conversation from IRC and other online gathering places 5 MB
Mirror of Revelation Passage Series Website wget of a small author's website. ~500kb
Archive Team Powerblogs Shutdown Snapshot This is a 108-blog snapshot of the final month of Powerblogs, before their shutdown ?
BBC Closing Panic Archives Some BBC sites ?
stillflying.net A firefly fan fiction site that maded the rest of season 1 and season 2 pdf scripts for what would have been if firefly wasn't canceled. 408.1mb
Google Reader Text for 46M feeds, per-feed statistics, Reader Directory search results ~8800GB
Earbits Website, ~130,000 MP3s and metadata. ~650GB
SciMag 38 million scientific articles ~28TB
Google Video
Yahoo! Video

Archived but not available




The following three sections have been moved here without modification from the old Projects page.

Finished projects

This is a list of completed projects which do not have their own page on this wiki.

See Category:Rescued Sites for projects which do have their own page on this wiki.


  • (mirror | 4.5MB archive) The infoAnarchy wiki was archived by Scott.
    • infoAnarchy was down for several months in the first part of 2011, but is back up as of May 2011. There is now very little content updating on the site. As of 2014-06-02, infoAnarchy has a "Revive infoanarchy.org blog & wiki" notice and a request for donations, suggesting it may not have a future. As of 2014-06-02, a "database is locked" message will be given to logged-in users.
    • If there are future updates to that archive, they may be found at http://sdboyd56.com/archives/
    • FIXME - This archive has non-relative links, requiring it to be in /infoanarchy. It needs to be redone or edited to have relative links.
    • FIXME - This archive does not include the complete history, which is absolutely essential in this case, as significant editing history exists.
  • (mirror) The Cyberpunk Project was archived by Scott
    • Note that this wiki does not allow the Russian TLD, so the URL will have to be edited to be visited.
    • Most pages haven't been changed since 2007. It hasn't been updated or changed since April 2010.
    • FIXME - this mirror is incomplete, or its links are pointing to the live website.
  • (archive) Kasabi's data was retrieved and uploaded to archive.org by Edsu.
  • (archive) FoxyTunes was archived by Start
    • (it's less than 1MB!)
  • (archive) Emulation Zone was archived by Start
    • FIXME - vgaa.emulationzone.org-2014-0708.warc.gz got interrupted by a crash and needs to be re-archived

Other projects

Dead projects



Some more

You'll find traces of some other old projects on the historical IRC channel list: IRC/Old.


Fire DrillProjectsPhilosophy


v · t · e         Archive Team
Current events

Alive... OR ARE THEY · Deathwatch · Projects

Archiveteam.jpg
Archiving projects

APKMirror · Archive.is · BetaArchive · Government Backup (#datarefuge · ftp-gov· Gmane · Internet Archive · It Died · Megalodon.jp · OldApps.com · OldVersion.com · OSBetaArchive · TEXTFILES.COM · The Dead, the Dying & The Damned · The Mail Archive · UK Web Archive · WebCite · Vaporwave.me

Blogging

Blog.pl · Blogger · Blogster · Blogter.hu · Freeblog.hu · Fuelmyblog · Jux · LINE BLOG · LiveJournal · My Opera · Nolblog.hu · Open Diary · ownlog.com · Posterous · Powerblogs · Proust · Roon · Splinder · Tumblr · Vox · Weblog.nl · Windows Live Spaces · Wordpress.com · Xanga · Yahoo! Blog · Zapd

Cloud hosting/file sharing

aDrive · AnyHub · Box · Dropbox · Docstoc · Fast.io · Google Drive · Google Groups Files · iCloud · Fileplanet · LayerVault · MediaCrush · MediaFire · Mega · MegaUpload · MobileMe · OneDrive · Pomf.se · RapidShare · Ubuntu One · Yahoo! Briefcase

Corporations

Apple · IBM · Google · Loblaw · Lycos Europe · Microsoft · Yahoo!

Events

Arab Spring · Great Ape-Snake War · Spanish Revolution

Font Repos

DaFont · Google Web Fonts · GNU FreeFont · Fontspace

Forums/Message boards

4chan · Captain Luffy Forums · College Confidential · Discourse · DSLReports · ESPN Forums · Facepunch Forums · forums.starwars.com · HeavenGames · JamiiForums · Invisionfree · NeoGAF · Textream · The Classic Horror Film Board · Yahoo! Messages · Yahoo! Neighbors · Yuku.com · Zetaboards

Gaming

Atomicgamer · Bazaar.tf · City of Heroes · Club Nintendo · Clutch · Counter-Strike: Global Offensive · CS:GO Lounge · Desura · Dota 2 · Dota 2 Lounge · Emulation Zone · ESEA · GameBanana · GameMaker Sandbox · GameTrailers · Halo · Heroes of Newerth · HLTV.org · HQ Trivia · Infinite Crisis · joinDOTA · League of Legends · Liquipedia · Minecraft.net · Player.me · Playfire · Raptr · SingStar · Steam · SteamDB · SteamGridDB · Team Fortress 2 · TF2 Outpost · Warhammer · Xfire

Image hosting

500px · AOL Pictures · Blipfoto · Blingee · Canv.as · Camera+ · Cameroid · DailyBooth · Degree Confluence Project · DeviantART · Demotivalo.net · Flickr · Fotoalbum.hu · Fotolog.com · Fotopedia · Frontback · Geograph Britain and Ireland · Giphy · GTF Képhost · ImageShack · Imgh.us · Imgur · Inkblazers · Instagram · Kepfeltoltes.hu · Kephost.com · Kephost.hu · Kepkezelo.com · Keptarad.hu · Madden GIFERATOR · MLKSHK · Microsoft Clip Art · Microsoft Photosynth · Nokia Memories · noob.hu · Odysee · Panoramio · Photobucket · Picasa · Picplz · Pixiv · Portalgraphics.net · PSharing · Ptch · puu.sh · Rawporter · Relay.im · ScreenshotsDatabase.com · Sketch · Smack Jeeves · Snapjoy · Streetfiles · Tabblo · Tinypic · Trovebox · TwitPic · Wallbase · Wallhaven · Webshots · Wikimedia Commons

Knowledge/Wikis

arXiv · Citizendium · Clipboard.com · Deletionpedia · EditThis · Encyclopedia Dramatica · Etherpad · Everything2 · infoAnarchy · GeoNames · GNUPedia · Google Books (Google Books Ngram· Horror Movie Database · Insurgency Wiki · Knol · Lost Media Wiki · Neoseeker.com · Notepad.cc · Nupedia · OpenCourseWare · OpenStreetMap · Orain · Pastebin · Patch.com · Project Gutenberg · Puella Magi · Referata · Resedagboken · SongMeanings · ShoutWiki · The Internet Movie Database · TropicalWikis · Uncyclopedia · Urban Dictionary · Urban Exploration Resource · Webmonkey · Wikia · Wikidot · WikiHow · Wikkii · WikiLeaks · Wikipedia (Simple English Wikipedia· Wikispaces · Wikispot · Wik.is · Wiki-Site · WikiTravel · Word Count Journal

Magazines/Blogs/News

Cyberpunkreview.com · Game Developer Magazine · Gigaom · Hardware Canucks · Helium · JPG Magazine · Make Magazine · The Escapist · Polygamia.pl · San Fransisco Bay Guardian · Scoop · Regretsy · Yahoo! Voices

Microblogging

Heello · Identi.ca · Jaiku · Mommo.hu · Plurk · Sina Weibo · Tencent Weibo · Twitter · TwitLonger

Music/Audio

8tracks · AOL Music · Audimated.com · Cinch · digCCmixter · Dogmazic.net · Earbits · exfm · Free Music Archive · Gogoyoko · Indaba Music · Instacast · Instaudio · Jamendo · Last.fm · Music Unlimited · MOG · PureVolume · Reverbnation · ShareTheMusic · SoundCloud · Soundpedia · Spotify · This Is My Jam · TuneWiki · Twaud.io · WinAmp

People

Aaron Swartz · Michael S. Hart · Steve Jobs · Mark Pilgrim · Dennis Ritchie · Len Sassaman Project

Protocols/Infrastructure

FTP · Gopher · IRC · Usenet · World Wide Web
BitTorrent DHT

Q&A

Askville · Answerbag · Answers.com · Ask.com · Askalo · Baidu Knows · Blurtit · ChaCha · Experts Exchange · Formspring · GirlsAskGuys · Google Answers · Google Baraza · JustAnswer · MetaFilter · Quora · Retrospring · StackExchange · The AnswerBank · The Internet Oracle · Uclue · WikiAnswers · Yahoo! Answers

Recipes/Food

Allrecipes · Epicurious · Food.com · Foodily · Food Network · Punchfork · ZipList

Social bookmarking

Addinto · Backflip · Balatarin · BibSonomy · Bkmrx · Blinklist · BlogMarks · BookmarkSync · CiteULike · Connotea · Delicious · Designer News · Digg · Diigo · Dir.eccion.es · Evernote · Excite Bookmark · Faves · Favilous · folkd · Freelish · Getboo · GiveALink.org · Gnolia · Google Bookmarks · Hacker News · HeyStaks · IndianPad · Kippt · Knowledge Plaza · Licorize · Linkwad · Menéame · Microsoft Developer Network · myVIP · Mister Wong · My Web · Mylink Vault · Newsvine · Oneview · Pearltrees · Pinboard · Pocket · Propeller.com · Reddit · sabros.us · Scloog · Scuttle · Simpy · SiteBar · Slashdot · Squidoo · StumbleUpon · Twine · Voat · Vizited · Yummymarks · Xmarks · Yahoo! Buzz · Zootool · Zotero

Social networks

Bebo · BlackPlanet · Classmates.com · Cyworld · Dogster · Dopplr · douban · Ello · Facebook · Flixster · FriendFeed · Friendster · Friends Reunited · Gaia Online · Google+ · Habbo · hi5 · Hyves · iWiW · LinkedIn · Miiverse · mixi · MyHeritage · MyLife · Myspace · myVIP · Netlog · Odnoklassniki · Orkut · Plaxo · Qzone · Renren · Skyrock · Sonico.com · Storylane · Tagged · tvtag · Upcoming · Viadeo · Vine · VK · WeeWorld · Weibo · Wretch · Xuite · Yahoo! Groups · Yahoo! Stars India · Yahoo! Upcoming · more sites...

Shopping/Retail

Alibaba · AliExpress · Amazon · Apple Store · Barnes & Noble · DirectCanada · eBay · Kmart · NCIX · Printfection · RadioShack · Sears · Sears Canada · Target · The Book Depository · ThinkGeek · Toys "R" Us · Walmart

Software/code hosting

Android Development · Alioth · Assembla · BerliOS · Betavine · Bitbucket · BountySource · Codecademy · CodePlex · Freepository · Free Software Foundation · GNU Savannah · GitHost  · GitHub · GitHub Downloads · Gitorious · Gna! · Google Code · ibiblio · java.net · JavaForge · KnowledgeForge · Launchpad · LuaForge · Maemo · mozdev · OSOR.eu · OW2 Consortium · Openmoko · OpenSolaris · Ourproject.org · Ovi Store · Project Kenai · RubyForge · SEUL.org · SourceForge · Stypi · TestFlight · tigris.org · Transifex · TuxFamily · Yahoo! Downloads

Television/Radio

ABC · Austin City Limits · BBC · CBC · CBS · Computer Chronicles · CTV · Fox · G4 · Global TV · Jeopardy! · NBC · NHK · PBS · Penn & Teller: Bullshit! · The Howard Stern Show · TV News Archive (Understanding 9/11)

Torrenting/Piracy

ExtraTorrent · EZTV · isoHunt · KickassTorrents · The Pirate Bay · Torrentz · Library Genesis

Video hosting

Academic Earth · Bambuser · Blip.tv · Epic · Freshlive · Google Video · Justin.tv · Mixer · Niconico · Nokia Trailers · Oddshot.tv · Periscope · Plays.tv · Qwiki · Skillfeed · Stickam · TED Talks · Ticker.tv · Twitch.tv · Ustream · Videoplayer.hu · Viddler · Viddy · Vidme · Vimeo · Vine · Vstreamers · Yahoo! Video · YouTube · Famous Internet videos (Me at the zoo)

Web hosting

Angelfire · Brace.io · BT Internet · CableAmerica Personal Web Space · Claranet Netherlands Personal Web Pages · Comcast Personal Web Pages · Extra.hu · FortuneCity · Free ProHosting · GeoCities (patch· Google Business Sitebuilder · Google Sites · Internet Centrum · MBinternet · MSN TV · Nifty · Nwnyet · Parodius Networking · Prodigy.net · Saunalahti Iso G · Swipnet · Telenor · Tripod · University of Michigan personal webpages · Verizon Mysite · Verizon Personal Web Space · Webs · Webzdarma · Virgin Media

Web applications

Mailman · MediaWiki · phpBB · Simple Machines Forum · vBulletin

Information

A Million Ways to Die on the Web · Backup Tips · Cheap storage · Collecting items randomly · Data compression algorithms and tools · Dev · Discovery Data · DOS Floppies · Fortress of Solitude · Keywords · Naughty List · Nightmare Projects · Rescuing floppy disks · Rescuing optical media · Site exploration · The WARC Ecosystem · Working with ARCHIVE.ORG

Projects

ArchiveCorps · Audit2014 · Emularity · Faceoff · FlickrFckr · Froogle · INTERNETARCHIVE.BAK (Internet Archive Census· IRC Quotes · JSMESS · JSVLC · Just Solve the Problem · NewsGrabber · Project Newsletter · Valhalla · Web Roasting (ISP Hosting · University Web Hosting· Woohoo

Tools

ArchiveBot · ArchiveTeam Warrior (Tracker· Google Takeout · HTTrack · Video downloaders · Wget (Lua · WARC)

Teams

Bibliotheca Anonoma · LibreTeam · URLTeam · Yahoo Video Warroom · WikiTeam

Other

800notes · AOL · Akoha · Ancestry.com · April Fools' Day · Amplicate · AutoAdmit · Bre.ad · Circavie · Cobook · Co.mments · Countdown · Discourse · Distill · Dmoz · Easel · Eircode · Electronic Frontier Foundation · FanFiction.Net · Feedly · Ficlets · Forrst · FunnyExam.com · FurAffinity · Google Helpouts · Google Moderator · Google Poly · Google Reader · ICQmail · IFTTT · Jajah · JuniorNet · Lulu Poetry · Mobile Phone Applications · Mochi Media · Mozilla Firefox · MyBlogLog · NBII · Newgrounds · Neopets · Quantcast · Quizilla · Salon Table Talk · Shutdownify · Slidecast · Stack Overflow · SOPA blackout pages · starwars.yahoo.com · TechNet · Toshiba Support · USA-Gov · Volán · Widgetbox · Windows Technical Preview · Wunderlist · YTMND · Zoocasa

About Archive Team

Introduction · Philosophy · Who We Are · Our stance on robots.txt · Why Back Up? · Software · Formats · Storage Media · Recommended Reading · Films and documentaries about archiving · Talks · In The Media · FAQ