Difference between revisions of "Main Page/Current Projects"

From Archiveteam
Jump to navigation Jump to search
(We're not saving frontback anymore, it's staying and it looks like they won't delete anything.)
 
(609 intermediate revisions by 55 users not shown)
Line 1: Line 1:
__NOTOC__
__NOTOC__
== Archive Team recruiting ==
== Archive Team recruiting ==
* [[Dev|Want to code for Archive Team? Here's a starting point.]]
* Help us: '''[[ArchiveTeam_Warrior|☞ Download and run your warrior ☜]]'''. What's on: [https://tracker.archiveteam.org/ online tracker].
* '''[[Donate]]''' to keep our projects going.
* Anything shutting down? Put it on the '''[[Deathwatch]]''' or tell us on '''[[Archiveteam:IRC|IRC]]'''!
* Want to code for Archive Team? [[Dev|Here's a starting point.]]


== Warrior based projects ==
== Warrior-based projects ==
* [[Blingee]]: >130 million images and 9 years of user content <strike>will be deleted on August 25, 2015.</strike> '''IRC Channel {{IRC|tragedee}}'''.
{{:CurrentWarriorProject}}
* [[Last.fm]]: Switching codebases in the first two weeks of April, some user-generated content might be lost. '''IRC Channel {{IRC|lastchance.fm}}'''.
 
* [[SourceForge]]: Old, ad supported, adware supported. '''IRC Channel {{IRC|coldstorage}}'''.
=== Short-term, urgent projects ===
<!-- Projects with strong deadline, deadline is in the future -->
<!-- sorted by deadline (soonest on top) -->
* [[Tistory]]: Will delete inactive blogs on {{datetime|2025-09-22}} '''IRC Channel {{IRC|tatteredstory}}'''
 
=== Medium-term projects ===
<!-- Projects for which the deadline has passed, deadline is unclear, but there is a moment they are "finished" -->
<!-- sorted alphabetically -->
* [[Meta Ad Library]]: Database for advertisements for Facebook and other products by Meta. '''IRC Channel {{IRC|fads}}'''
* [[Peing]]: A Japanese question/answer service, was slated to be shutdown on {{datetime|2025-08-29}}. '''IRC Channel {{IRC|peingpong}}'''
* [[US Government]]: Archiving the US government. '''IRC Channel {{IRC|UncleSamsArchive}}'''
** [[Radio Free Asia]]: Non-profit media organization owned by USAGM.
** [[Radio Free Europe|Radio Free Europe/Radio Liberty]]: Non-profit media organization owned by USAGM.
** [[Voice of America]]: An internationally-broadcasting state media network at risk of closure.
 
=== Long-term projects ===
<!-- Ongoing projects. No deadline, no moment of "finishing" -->
<!-- sorted alphabetically -->
* [[Telegram]]: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. '''IRC Channel {{IRC|telegrab}}'''.
* [[Twitch]]: Archiving metadata and select videos. '''IRC Channel {{IRC|burnthetwitch}}'''
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.
* [[Halo]]: Downloading all the read-only data from Halo. '''IRC Channel {{IRC|yolohalo}}'''. (Rate-limited, generally has enough volunteers.)
* [[URLs]]: A random collection of stuff. '''IRC Channel {{IRC|//}}'''.
* [[Blogger]]: Google was to restrict public access to adult blogs, but they backed down. We're backing up Blogger anyway. '''IRC Channel {{IRC|frogger}}'''. (Content grab coming soon.)
* [[YouTube]]: Archiving [[YouTube#Scope|selected videos]]. '''IRC Channel {{IRC|down-the-tube}}'''.


Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br>
=== Long-term, slower-paced projects ===
What's on: [http://tracker.archiveteam.org/ online tracker].<br>
These are projects that are actively running but generally only have small numbers of items available to complete at a time.
Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].
<!-- sorted alphabetically -->
* [[Blogger]]: Grabbing inactive Blogger blogs since Google began a mass purge of inactive Google accounts on or after {{datetime|2023-12-01}}. '''IRC Channel {{IRC|frogger}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud}}'''.
* [[Imgur]]: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on {{datetime|2023-05-15}}. '''IRC Channel {{IRC|imgone}}'''.
* [[MediaFire]]: [https://twitter.com/textfiles/status/1349516443654758401 Not 'at-risk' but grabbing speculatively to save historic files] '''IRC Channel {{IRC|mediaonfire}}'''.
* [[Microsoft Update]]: Removal of legacy Windows drivers announced. '''IRC Channel {{IRC|windowfixer}}'''
* [[Pastebin]]: Archiving the pastas. '''IRC Channel {{IRC|pastalavista}}'''.


== Manual projects ==
== Manual projects ==
* [[AOL]]: Climbing into the decaying walled garden. '''IRC Channel {{IRC|aohell}}'''
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''.
* [[Codearchiver]]: Dumping and archival of source code repositories and associated version control systems. '''IRC Channel {{IRC|codearchiver}}'''.
* [[FTP]]: Download all the FTP sites! '''IRC Channel {{IRC|effteepee}}'''.
* Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. '''IRC Channel {{IRC|archiveteam}}'''
* [[Froogle]]: Let's do a census of all of Google's products. '''IRC Channel {{IRC|froogle}}'''.
* [[Wikibot]] and [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channels {{IRC|wikibot}} {{IRC|wikiteam}}'''.
* [[INTERNETARCHIVE.BAK]]: How do we archive an archive? '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[Formats|File Formats]] and [[Just Solve the Problem 2012|Just Solve]]: Let's Document all the File Formats! also has contents on the likes of subdomains, e.g. [http://fileformats.archiveteam.org fileformats.archiveteam.org] and [http://justsolve.archiveteam.org justsolve.archiveteam.org]. '''IRC Channels {{IRC|justsolve}}'''
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
 
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
== Recently finished projects ==
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
<!-- projects that have finished in the last 30 days go here in reverse-chronogical order to be found easily and showcase recent work. additionally, keep projects here that are still in the tracker but not yet deleted so it won't confuse people. -->
* [[WikiTeam]]: permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.
* [[Typepad]]: A blogging service ceased to exist by the end of September 2025. '''IRC Channel {{IRC|typebad}}'''
* [[Woohoo]]: Yahoo is untrustworthy, let's do a census of all their products. '''IRC Channel {{IRC|woohoo}}'''.


== Upcoming projects ==
== Upcoming & proposed projects ==
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). -->
<!-- Top priority: could disappear anytime now -->
<!-- Top priority: could disappear anytime now -->
* [[RadioShack]]: RadioShack is going bankrupt. '''IRC Channel {{IRC|unshackled}}'''.
<!-- Shutting down, definite deadline given -->
<!-- Shutting down, definite deadline given -->
* [[Comcast Personal Web Pages]]: Comcast's web hosting, shutting down October 8, 2015.  '''IRC Channel {{IRC|comclose}}'''.
* [[Goo Blog]]: A blogging service closed on {{datetime|2025-11-25}} '''IRC Channel {{IRC|itsgoone}}'''
* [[Club Nintendo]]: Ran out of lives. Game over for North America on July 30, 2015, September 30, 2015 for Europe and Japan. '''IRC Channel {{IRC|clubnintendont}}'''.
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between {{datetime|2021-12-01}} and 2025. '''IRC Channel {{IRC|chromeweblore}}'''.
* [[Google Code]]: Google likes [[Github]] more. Shutting down on January 26, 2016. '''IRC Channel {{IRC|googlecodeblue}}'''.
<!-- Shutting down, vague deadline given -->
<!-- Shutting down, vague deadline given -->
* [[Photobucket]]: Finally following through on over a year of email threats that free accounts are going to be mass deactivated if they don't pay up. '''IRC Channel {{IRC|photosucket}}'''.
* Abandoned iOS App Store & Google Play apps: Both Apple and Google are slimming down on abandoned apps, [https://www.cnet.com/tech/mobile/one-third-of-apple-and-google-apps-are-so-outdated-they-could-get-removed/ with an estimated ~1.5M of them at risk]. '''IRC Channel {{IRC|appocalypse}}'''.
* [[Twitter]]: General instability; deleting inactive accounts <s>{{datetime|2019-12-11}}</s> sometime. '''IRC Channel {{IRC|twitterdead|EFnet|abandoned}}'''.
<!-- Shutting down, no deadline given -->
<!-- Shutting down, no deadline given -->
* [[Panoramio]]: [[Google]] is migrating photos only (no metadata!) to Maps. '''IRC Channel {{IRC|paranormio}}'''.
<!-- Archiving the archives -->
<!-- Archiving the archives -->
* [[Orkut]]: Orkut got kut on September 30, 2014. It lives on as a public archive. '''IRC Channel {{IRC|throatkut}}'''.
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
* [[Picasa|Picasa Web Albums]]: Main page redirecting to Google Plus, future uncertain. '''IRC Channel {{IRC|picasso}}'''.
* [[Dailymotion]]: Archiving inactive videos. '''IRC Channel {{IRC|DailyDemotion}}'''
* [[Blipfoto]]: <s>Went into liquidation on March 11, 2015, future uncertain.</s> Acquired on March 25, 2015, we're still grabbing it anyway. '''IRC Channel {{IRC|fotofinish}}'''.
* [[VKontakte]]: A Russian equivalent of Facebook carries the risk of tumbling down under the weight of sanctions as a result of the government's invasion of Ukraine. '''IRC Channel {{IRC|lostkontakt}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums|EFnet|abandoned}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal|EFnet|abandoned}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee|EFnet|abandoned}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Giphy]]: Bought by <s>Facebook</s>Shutterstock, to be "integrated" (assimilated) into <s>Instagram</s> https://news.knowyourmeme.com/news/facebook-to-buy-giphy


== Recently finished ==
== On hiatus ==
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
* [[Blip.tv]]: Disney/Maker Studios is killing Blip on August 20, 2015. '''IRC Channel {{IRC|blooper.tv}}'''.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''.
* [[Skillfeed]]: Shutterstock is closing skillfeed.com on September 30, 2015. Grabbing instructional videos and metadata. '''IRC Channel {{IRC|skillessfeed}}'''
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''.
 
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.
<!-- == Hiatus / Missed the Mark == -->
* [[Google Drive]]: Same as MediaFire. '''IRC Channel {{IRC|googlecrash}}'''. Currently on hiatus.
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], {{datetime|2016-06-07}}).
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
* [[Miraheze]]: <s>Shutting down sometime between {{datetime|2023-09-01}} and {{datetime|2023-10-31}}.</s> Rescued by new volunteers!
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.
* [[Tinkercad]]: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around {{datetime|2021-05-24}}. '''IRC Channel {{IRC|tinkerhad}}'''.
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. '''IRC Channel {{IRC|tumbledown}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on {{datetime|2023-06-19}}.  '''IRC Channel {{IRC|shreddit}}'''.


<small>ArchiveTeam uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[IRC|More info]]</small>
<small>ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://chat.hackint.org/#/connect?join=archiveteam-bs – [[Archiveteam:IRC|More info]]</small>

Latest revision as of 07:43, 1 October 2025

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Typepad

Short-term, urgent projects

Medium-term projects

Long-term projects

Long-term, slower-paced projects

These are projects that are actively running but generally only have small numbers of items available to complete at a time.

Manual projects

Recently finished projects

  • Typepad: A blogging service ceased to exist by the end of September 2025. IRC Channel #typebad (on hackint)

Upcoming & proposed projects

On hiatus

  • Angelfire: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. IRC Channel #angelonfire (on hackint).
  • Audit 2014: It's time to verify our shit. IRC Channel #auditteam (on hackint).
  • Flickr: Yahoo! SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. IRC Channel #flickrfckr (on hackint).
  • FTP: Help us find and download all FTP sites! IRC Channel #effteepee (on hackint).
  • Google Drive: Same as MediaFire. IRC Channel #googlecrash (on hackint). Currently on hiatus.
  • Google Groups: "Gone within a year" (SketchCow, 2016-06-07).
  • Google News Archive: Let's store all newspapers at Google, WCGW? IRC Channel #papersplease (on hackint).
  • INTERNETARCHIVE.BAK: Grab a slice of the big cake of The Archive! IRC Channel #internetarchive.bak (on hackint).
  • ISP Hosting: Finding ISP web hosting services before the Grim Reaper finds them. IRC Channel #webroasting (on hackint).
  • Miraheze: Shutting down sometime between 2023-09-01 and 2023-10-31. Rescued by new volunteers!
  • Project Newsletter: Archiving e-newsletters, currently in development. IRC Channel #projectnewsletter (on hackint).
  • Quizlet: Flashcards and other learning tools IRC Channel #quizletusin (on hackint).
  • Tinkercad: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around 2021-05-24. IRC Channel #tinkerhad (on hackint).
  • Tumblr: Yahoo! considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. IRC Channel #tumbledown (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on 2023-06-19. IRC Channel #shreddit (on hackint).

ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://chat.hackint.org/#/connect?join=archiveteam-bsMore info