Difference between revisions of "Main Page/Current Projects"

From Archiveteam
Jump to navigation Jump to search
m
 
(607 intermediate revisions by 55 users not shown)
Line 1: Line 1:
__NOTOC__
__NOTOC__
== Archive Team recruiting ==
== Archive Team recruiting ==
* [[Dev|Want to code for Archive Team? Here's a starting point.]]
* Help us: '''[[ArchiveTeam_Warrior|☞ Download and run your warrior ☜]]'''. What's on: [https://tracker.archiveteam.org/ online tracker].
* '''[[Donate]]''' to keep our projects going.
* Anything shutting down? Put it on the '''[[Deathwatch]]''' or tell us on '''[[Archiveteam:IRC|IRC]]'''!
* Want to code for Archive Team? [[Dev|Here's a starting point.]]


== Warrior based projects ==
== Warrior-based projects ==
* [[Blingee]]: >130 million images and 9 years of user content <strike>will be deleted on August 25, 2015.</strike> '''IRC Channel {{IRC|tragedee}}'''.
{{:CurrentWarriorProject}}
* [[Last.fm]]: Switching codebases in the first two weeks of April, some user-generated content might be lost. '''IRC Channel {{IRC|lastchance.fm}}'''.
 
* [[SourceForge]]: Old, ad supported, adware supported. '''IRC Channel {{IRC|coldstorage}}'''.
=== Short-term, urgent projects ===
<!-- Projects with strong deadline, deadline is in the future -->
<!-- sorted by deadline (soonest on top) -->
* [[Tistory]]: Will delete inactive blogs on {{datetime|2025-09-22}} '''IRC Channel {{IRC|tatteredstory}}'''
 
=== Medium-term projects ===
<!-- Projects for which the deadline has passed, deadline is unclear, but there is a moment they are "finished" -->
<!-- sorted alphabetically -->
* [[Meta Ad Library]]: Database for advertisements for Facebook and other products by Meta. '''IRC Channel {{IRC|fads}}'''
* [[Peing]]: A Japanese question/answer service, was slated to be shutdown on {{datetime|2025-08-29}}. '''IRC Channel {{IRC|peingpong}}'''
* [[US Government]]: Archiving the US government. '''IRC Channel {{IRC|UncleSamsArchive}}'''
** [[Radio Free Asia]]: Non-profit media organization owned by USAGM.
** [[Radio Free Europe|Radio Free Europe/Radio Liberty]]: Non-profit media organization owned by USAGM.
** [[Voice of America]]: An internationally-broadcasting state media network at risk of closure.
 
=== Long-term projects ===
<!-- Ongoing projects. No deadline, no moment of "finishing" -->
<!-- sorted alphabetically -->
* [[Telegram]]: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. '''IRC Channel {{IRC|telegrab}}'''.
* [[Twitch]]: Archiving metadata and select videos. '''IRC Channel {{IRC|burnthetwitch}}'''
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.
* [[Halo]]: Downloading all the read-only data from Halo. '''IRC Channel {{IRC|yolohalo}}'''. (Rate-limited, generally has enough volunteers.)
* [[URLs]]: A random collection of stuff. '''IRC Channel {{IRC|//}}'''.
* [[Blogger]]: Google was to restrict public access to adult blogs, but they backed down. We're backing up Blogger anyway. '''IRC Channel {{IRC|frogger}}'''. (Content grab coming soon.)
* [[YouTube]]: Archiving [[YouTube#Scope|selected videos]]. '''IRC Channel {{IRC|down-the-tube}}'''.


Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br>
=== Long-term, slower-paced projects ===
What's on: [http://tracker.archiveteam.org/ online tracker].<br>
These are projects that are actively running but generally only have small numbers of items available to complete at a time.
Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].
<!-- sorted alphabetically -->
* [[Blogger]]: Grabbing inactive Blogger blogs since Google began a mass purge of inactive Google accounts on or after {{datetime|2023-12-01}}. '''IRC Channel {{IRC|frogger}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud}}'''.
* [[Imgur]]: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on {{datetime|2023-05-15}}. '''IRC Channel {{IRC|imgone}}'''.
* [[MediaFire]]: [https://twitter.com/textfiles/status/1349516443654758401 Not 'at-risk' but grabbing speculatively to save historic files] '''IRC Channel {{IRC|mediaonfire}}'''.
* [[Microsoft Update]]: Removal of legacy Windows drivers announced. '''IRC Channel {{IRC|windowfixer}}'''
* [[Pastebin]]: Archiving the pastas. '''IRC Channel {{IRC|pastalavista}}'''.


== Manual projects ==
== Manual projects ==
* [[AOL]]: Climbing into the decaying walled garden. '''IRC Channel {{IRC|aohell}}'''
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''.
* [[Codearchiver]]: Dumping and archival of source code repositories and associated version control systems. '''IRC Channel {{IRC|codearchiver}}'''.
* [[FTP]]: Download all the FTP sites! '''IRC Channel {{IRC|effteepee}}'''.
* Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. '''IRC Channel {{IRC|archiveteam}}'''
* [[Froogle]]: Let's do a census of all of Google's products. '''IRC Channel {{IRC|froogle}}'''.
* [[Wikibot]] and [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channels {{IRC|wikibot}} {{IRC|wikiteam}}'''.
* [[INTERNETARCHIVE.BAK]]: How do we archive an archive? '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[Formats|File Formats]] and [[Just Solve the Problem 2012|Just Solve]]: Let's Document all the File Formats! also has contents on the likes of subdomains, e.g. [http://fileformats.archiveteam.org fileformats.archiveteam.org] and [http://justsolve.archiveteam.org justsolve.archiveteam.org]. '''IRC Channels {{IRC|justsolve}}'''
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
 
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
== Recently finished projects ==
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
<!-- projects that have finished in the last 30 days go here in reverse-chronogical order to be found easily and showcase recent work. additionally, keep projects here that are still in the tracker but not yet deleted so it won't confuse people. -->
* [[WikiTeam]]: permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.
* [[Typepad]]: A blogging service ceased to exist by the end of September 2025. '''IRC Channel {{IRC|typebad}}'''
* [[Woohoo]]: Yahoo is untrustworthy, let's do a census of all their products. '''IRC Channel {{IRC|woohoo}}'''.


== Upcoming projects ==
== Upcoming & proposed projects ==
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). -->
<!-- Top priority: could disappear anytime now -->
<!-- Top priority: could disappear anytime now -->
* [[RadioShack]]: RadioShack is going bankrupt. '''IRC Channel {{IRC|unshackled}}'''.
<!-- Shutting down, definite deadline given -->
<!-- Shutting down, definite deadline given -->
* [[Comcast Personal Web Pages]]: Comcast's web hosting, shutting down October 8, 2015.  '''IRC Channel {{IRC|comclose}}'''.
* [[Goo Blog]]: A blogging service closed on {{datetime|2025-11-25}} '''IRC Channel {{IRC|itsgoone}}'''
* [[Google Code]]: Google likes [[Github]] more. Shutting down on January 26, 2016. '''IRC Channel {{IRC|googlecodeblue}}'''.
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between {{datetime|2021-12-01}} and 2025. '''IRC Channel {{IRC|chromeweblore}}'''.
<!-- Shutting down, vague deadline given -->
<!-- Shutting down, vague deadline given -->
* [[Photobucket]]: Finally following through on over a year of email threats that free accounts are going to be mass deactivated if they don't pay up. '''IRC Channel {{IRC|photosucket}}'''.
* Abandoned iOS App Store & Google Play apps: Both Apple and Google are slimming down on abandoned apps, [https://www.cnet.com/tech/mobile/one-third-of-apple-and-google-apps-are-so-outdated-they-could-get-removed/ with an estimated ~1.5M of them at risk]. '''IRC Channel {{IRC|appocalypse}}'''.
* [[Twitter]]: General instability; deleting inactive accounts <s>{{datetime|2019-12-11}}</s> sometime. '''IRC Channel {{IRC|twitterdead|EFnet|abandoned}}'''.
<!-- Shutting down, no deadline given -->
<!-- Shutting down, no deadline given -->
* [[Panoramio]]: [[Google]] is migrating photos only (no metadata!) to Maps. '''IRC Channel {{IRC|paranormio}}'''.
<!-- Archiving the archives -->
<!-- Archiving the archives -->
* [[Orkut]]: Orkut got kut on September 30, 2014. It lives on as a public archive. '''IRC Channel {{IRC|throatkut}}'''.
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
* [[Picasa|Picasa Web Albums]]: Main page redirecting to Google Plus, future uncertain. '''IRC Channel {{IRC|picasso}}'''.
* [[Dailymotion]]: Archiving inactive videos. '''IRC Channel {{IRC|DailyDemotion}}'''
* [[Blipfoto]]: <s>Went into liquidation on March 11, 2015, future uncertain.</s> Acquired on March 25, 2015, we're still grabbing it anyway. '''IRC Channel {{IRC|fotofinish}}'''.
* [[VKontakte]]: A Russian equivalent of Facebook carries the risk of tumbling down under the weight of sanctions as a result of the government's invasion of Ukraine. '''IRC Channel {{IRC|lostkontakt}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums|EFnet|abandoned}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal|EFnet|abandoned}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee|EFnet|abandoned}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Giphy]]: Bought by <s>Facebook</s>Shutterstock, to be "integrated" (assimilated) into <s>Instagram</s> https://news.knowyourmeme.com/news/facebook-to-buy-giphy


== Proposed projects ==
== On hiatus ==
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). -->
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
 
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''.
== Recently finished ==
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''.
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.
* [[Club Nintendo]]: Ran out of lives. Game over for North America on July 30, 2015, September 30, 2015 for Europe and Japan. '''IRC Channel {{IRC|clubnintendont}}'''.
* [[Google Drive]]: Same as MediaFire. '''IRC Channel {{IRC|googlecrash}}'''. Currently on hiatus.
* [[Blip.tv]]: Disney/Maker Studios is killing Blip on August 20, 2015. '''IRC Channel {{IRC|blooper.tv}}'''.
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], {{datetime|2016-06-07}}).
* [[Skillfeed]]: Shutterstock is closing skillfeed.com on September 30, 2015. Grabbing instructional videos and metadata. '''IRC Channel {{IRC|skillessfeed}}'''
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.
 
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
<!-- == Hiatus / Missed the Mark == -->
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
* [[Miraheze]]: <s>Shutting down sometime between {{datetime|2023-09-01}} and {{datetime|2023-10-31}}.</s> Rescued by new volunteers!
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.
* [[Tinkercad]]: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around {{datetime|2021-05-24}}. '''IRC Channel {{IRC|tinkerhad}}'''.
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. '''IRC Channel {{IRC|tumbledown}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on {{datetime|2023-06-19}}.  '''IRC Channel {{IRC|shreddit}}'''.


<small>ArchiveTeam uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[IRC|More info]]</small>
<small>ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://chat.hackint.org/#/connect?join=archiveteam-bs – [[Archiveteam:IRC|More info]]</small>

Latest revision as of 07:43, 1 October 2025

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Typepad

Short-term, urgent projects

Medium-term projects

Long-term projects

Long-term, slower-paced projects

These are projects that are actively running but generally only have small numbers of items available to complete at a time.

Manual projects

Recently finished projects

  • Typepad: A blogging service ceased to exist by the end of September 2025. IRC Channel #typebad (on hackint)

Upcoming & proposed projects

On hiatus

  • Angelfire: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. IRC Channel #angelonfire (on hackint).
  • Audit 2014: It's time to verify our shit. IRC Channel #auditteam (on hackint).
  • Flickr: Yahoo! SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. IRC Channel #flickrfckr (on hackint).
  • FTP: Help us find and download all FTP sites! IRC Channel #effteepee (on hackint).
  • Google Drive: Same as MediaFire. IRC Channel #googlecrash (on hackint). Currently on hiatus.
  • Google Groups: "Gone within a year" (SketchCow, 2016-06-07).
  • Google News Archive: Let's store all newspapers at Google, WCGW? IRC Channel #papersplease (on hackint).
  • INTERNETARCHIVE.BAK: Grab a slice of the big cake of The Archive! IRC Channel #internetarchive.bak (on hackint).
  • ISP Hosting: Finding ISP web hosting services before the Grim Reaper finds them. IRC Channel #webroasting (on hackint).
  • Miraheze: Shutting down sometime between 2023-09-01 and 2023-10-31. Rescued by new volunteers!
  • Project Newsletter: Archiving e-newsletters, currently in development. IRC Channel #projectnewsletter (on hackint).
  • Quizlet: Flashcards and other learning tools IRC Channel #quizletusin (on hackint).
  • Tinkercad: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around 2021-05-24. IRC Channel #tinkerhad (on hackint).
  • Tumblr: Yahoo! considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. IRC Channel #tumbledown (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on 2023-06-19. IRC Channel #shreddit (on hackint).

ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://chat.hackint.org/#/connect?join=archiveteam-bsMore info