Difference between revisions of "Early projects"

From Archiveteam
Jump to navigation Jump to search
(+table talk)
m
Line 19: Line 19:
| [http://www.archive.org/details/textfiles-dot-com-2011 The TEXTFILES.COM Time Capsule] || This collection comprises all the major text-based sets of the [[TEXTFILES.COM]] site || 11 GB
| [http://www.archive.org/details/textfiles-dot-com-2011 The TEXTFILES.COM Time Capsule] || This collection comprises all the major text-based sets of the [[TEXTFILES.COM]] site || 11 GB
|-
|-
| [http://www.archive.org/details/archiveteam-tabletalk-panic the archives Salon Table Talk] || Threads of this talk site || +6.0 GB
| [http://www.archive.org/details/archiveteam-tabletalk-panic Salon Table Talk] || Threads of this talk site || +6.0 GB
|-
|-
| [http://www.archive.org/details/utzoo-wiseman-usenet-archive Usenet Archive of UTZOO Tapes] || Collection of .TGZ files of very early USENET posted data || 2.0 GB
| [http://www.archive.org/details/utzoo-wiseman-usenet-archive Usenet Archive of UTZOO Tapes] || Collection of .TGZ files of very early USENET posted data || 2.0 GB

Revision as of 20:11, 6 July 2011

Some archives available for downloading, by Archive Team or by other volunteers or groups. Sorted by size.

Available for download

Title/Download link Description Size
Geocities - The PATCHED Torrent (IA) The popular web hosting service founded in 1994. It was closed by Yahoo! in 2009 641.4 GB
The May 2011 Calufa Twitter Scrape 90+ million tweets from more than 6 million users 14.9 GB
Internet Gopher Archive 2007 (IA) Archive of gopher sites 14.8 GB
Encyclopedia Dramatica January 2010 Mirror lulz 11.7 GB
The TEXTFILES.COM Time Capsule This collection comprises all the major text-based sets of the TEXTFILES.COM site 11 GB
Salon Table Talk Threads of this talk site +6.0 GB
Usenet Archive of UTZOO Tapes Collection of .TGZ files of very early USENET posted data 2.0 GB
Quux.org Gopher Mirror Collection 2006 (IA) This is a collection of mirrors maintained by gopher.quux.org. These mirrors were taken offline in 2006 due to bandwidth constraints 1.5 GB
Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape Almost 10 million tweets 425 MB
The 2010 Reddit Research Project Dataset on affinities of 60,000+ Reddit users, recorded in 2010 ~360 MB
Archive Team Starwars.Yahoo.Com Panic Download This is a panic download of the starwars.yahoo.com forums and profiles, done before the closure of same by Yahoo on December 15, 2009. This includes as many messages, profiles, and pages related to the site as could be easily brought in. ~250 MB
Social Structure of Facebook Networks Facebook Data Scrape Facebook data scrape related to paper "The Social Structure of Facebook Networks", by Amanda L. Traud, Peter J. Mucha, Mason A. Porter 197 MB
Archive Team's Etherpad Time Capsule This archive contains roughly 6,400 Etherpads, in their final state 125 MB
WikiTeam archives Archives about wikis. See WikiTeam +100 MB
Boing Boing Posts Archive (2000-2011) Two collections of Boing Boing postings provided by the cultural website boingboing.net on its 5th and 11th anniversaries 42 MB
Archive Team Quotes Database Backup Amusing snatches of conversation from IRC and other online gathering places 5 MB
Archive Team Powerblogs Shutdown Snapshot This is a 108-blog snapshot of the final month of Powerblogs, before their shutdown ?
BBC Closing Panic Archives Some BBC sites ?
Total size ~692 GB

Archived but not available

See also

External links