Difference between revisions of "Main Page/Current Projects"

From Archiveteam
Jump to navigation Jump to search
(move justintv to active)
(🟒)
Β 
(865 intermediate revisions by 65 users not shown)
Line 1: Line 1:
__NOTOC__
__NOTOC__
== Archive Team recruiting ==
* Help us: '''[[ArchiveTeam_Warrior|☞ Download and run your warrior ☜]]'''. What's on: [https://tracker.archiveteam.org/ online tracker].
* '''[[Donate]]''' to keep our projects going.
* Anything shutting down? Put it on the '''[[Deathwatch]]''' or tell us on '''[[IRC]]'''!
* Want to code for Archive Team? [[Dev|Here's a starting point.]]


== Archive Team recruiting ==
== Warrior-based projects ==
* [[Dev|Want to code for Archive Team? Here's a starting point.]]
{{:CurrentWarriorProject}}
Β 
=== Short-term, urgent projects ===
<!-- Projects with strong deadline, deadline is in the future -->
<!-- sorted by deadline (soonest on top) -->
* [[Tistory]]: Will delete inactive blogs on {{datetime|2025-09-22}} '''IRC Channel {{IRC|tatteredstory}}'''
* [[Typepad]]: A blogging service ceased to exist by the end of September 2025. '''IRC Channel {{IRC|typebad}}'''
Β 
=== Medium-term projects ===
<!-- Projects for which the deadline has passed, deadline is unclear, but there is a moment they are "finished" -->
<!-- sorted alphabetically -->
* [[Meta Ad Library]]: Database for advertisements for Facebook and other products by Meta. '''IRC Channel {{IRC|fads}}'''
* [[Peing]]: A Japanese question/answer service, was slated to be shutdown on {{datetime|2025-08-29}}. '''IRC Channel {{IRC|peingpong}}'''
* [[US Government]]: Archiving the US government. '''IRC Channel {{IRC|UncleSamsArchive}}'''
** [[Radio Free Asia]]: Non-profit media organization owned by USAGM.
** [[Radio Free Europe|Radio Free Europe/Radio Liberty]]: Non-profit media organization owned by USAGM.
** [[Voice of America]]: An internationally-broadcasting state media network at risk of closure.


== Warrior based projects ==
=== Long-term projects ===
* [[URLTeam]]: URL shorteners were a fucking awful idea. IRC channel '''#urlteam'''. ''(Currently broken, coders wanted!)''
<!-- Ongoing projects. No deadline, no moment of "finishing" -->
* [[Justin.tv]]: Deleting all archived videos on June 8, 2014. IRC channel '''#justouttv'''.
<!-- sorted alphabetically -->
* [[Microsoft Update]]: Removal of legacy Windows drivers announced. '''IRC Channel {{IRC|windowfixer}}'''
* [[Telegram]]: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. '''IRC Channel {{IRC|telegrab}}'''.
* [[Twitch]]: Archiving metadata and select videos. '''IRC Channel {{IRC|burnthetwitch}}'''
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.
* [[URLs]]: A random collection of stuff. '''IRC Channel {{IRC|//}}'''.
* [[YouTube]]: Archiving [[YouTube#Scope|selected videos]]. '''IRC Channel {{IRC|down-the-tube}}'''.


Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.
=== Long-term, slower-paced projects ===
What's on: [http://tracker.archiveteam.org/ online tracker].
These are projects that are actively running but generally only have small numbers of items available to complete at a time.
<!-- sorted alphabetically -->
* [[Blogger]]: Grabbing inactive Blogger blogs since Google began a mass purge of inactive Google accounts on or after {{datetime|2023-12-01}}. '''IRC Channel {{IRC|frogger}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud}}'''.
* [[Imgur]]: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on {{datetime|2023-05-15}}. '''IRC Channel {{IRC|imgone}}'''.
* [[MediaFire]]: [https://twitter.com/textfiles/status/1349516443654758401 Not 'at-risk' but grabbing speculatively to save historic files] '''IRC Channel {{IRC|mediaonfire}}'''.
* [[Pastebin]]: Archiving the pastas. '''IRC Channel {{IRC|pastalavista}}'''.


== Manual projects ==
== Manual projects ==
* [[FTP]]: Download all the FTP sites!
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.
* [[WikiTeam]]: permanent effort, [http://code.google.com/p/wikiteam/wiki/NewTutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads).
* [[Codearchiver]]: Dumping and archival of source code repositories and associated version control systems. '''IRC Channel {{IRC|codearchiver}}'''.
* [[Puu.sh]]: Expiring inactive files after 1 month; now in continuous mode. IRC Channel '''#pushharder'''.
* Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. '''IRC Channel {{IRC|archiveteam}}'''
* [[Wikibot]] and [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channels {{IRC|wikibot}} {{IRC|wikiteam}}'''.
* [[Formats|File Formats]] and [[Just Solve the Problem 2012|Just Solve]]: Let's Document all the File Formats! also has contents on the likes of subdomains, e.g. [http://fileformats.archiveteam.org fileformats.archiveteam.org] and [http://justsolve.archiveteam.org justsolve.archiveteam.org]. '''IRC Channels {{IRC|justsolve}}'''
Β 
== Recently finished projects ==
<!-- projects that have finished in the last 30 days go here in reverse-chronogical order to be found easily and showcase recent work. additionally, keep projects here that are still in the tracker but not yet deleted so it won't confuse people. -->
* [[Oshiete! Goo]]: A Q&A service closed on September 17, 2025. '''IRC Channel {{IRC|itsgoone}}'''


== Upcoming projects ==
== Upcoming & proposed projects ==
* Saving Verizon customer pages, [http://www.verizon.com/support/residential/internet/fiosinternet/general+support/essentials+and+extras/questionsone/85372.htm shutting down] on September 2014.
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). -->
* [[MLKSHK]]: Shutting down September 1, 2014. '''IRC Channel #totheyard'''.
<!-- Top priority: could disappear anytime now -->
* [[Helium]]: 1 million articles to be deleted on December 15, 2014.
<!-- Shutting down, definite deadline given -->
* [[Goo Blog]]: A blogging service closed on {{datetime|2025-11-25}} '''IRC Channel {{IRC|itsgoone}}'''
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between {{datetime|2021-12-01}} and 2025. '''IRC Channel {{IRC|chromeweblore}}'''.
<!-- Shutting down, vague deadline given -->
* [[Photobucket]]: Finally following through on over a year of email threats that free accounts are going to be mass deactivated if they don't pay up. '''IRC Channel {{IRC|photosucket}}'''.
* Abandoned iOS App Store & Google Play apps: Both Apple and Google are slimming down on abandoned apps, [https://www.cnet.com/tech/mobile/one-third-of-apple-and-google-apps-are-so-outdated-they-could-get-removed/ with an estimated ~1.5M of them at risk]. '''IRC Channel {{IRC|appocalypse}}'''.
* [[Twitter]]: General instability; deleting inactive accounts <s>{{datetime|2019-12-11}}</s> sometime. '''IRC Channel {{IRC|twitterdead|EFnet|abandoned}}'''.
<!-- Shutting down, no deadline given -->
<!-- Archiving the archives -->
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
* [[Dailymotion]]: Archiving inactive videos. '''IRC Channel {{IRC|DailyDemotion}}'''
* [[VKontakte]]: A Russian equivalent of Facebook carries the risk of tumbling down under the weight of sanctions as a result of the government's invasion of Ukraine. '''IRC Channel {{IRC|lostkontakt}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums|EFnet|abandoned}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal|EFnet|abandoned}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee|EFnet|abandoned}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Giphy]]: Bought by <s>Facebook</s>Shutterstock, to be "integrated" (assimilated) into <s>Instagram</s> https://news.knowyourmeme.com/news/facebook-to-buy-giphy


== Recently finished ==
== On hiatus ==
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
* [[Canv.as]]: Saving the images before they go offline. IRC Channel '''#canvas'''.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''.
* [[Mochi Media]]: Goodbye Flash games. Shanda-acquiree forced to shut down on March 31, 2014. IRC Channel '''#mochibaibai'''.
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''.
* [[Dogster|Catster & Dogster]]: Won't be putting communities to sleep on March 3, 2014, but we got a copy anyway. IRC Channel '''#rawdogster'''.
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.
* [[Viddler]]: Won't be deleting personal and non-free account videos permanently. IRC Channel '''#fiddler'''.
* [[Google Drive]]: Same as MediaFire. '''IRC Channel {{IRC|googlecrash}}'''. Currently on hiatus.
* [[My Opera]]: It's all over after 2014-03-03. IRC Channel '''#fatlady'''.
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], {{datetime|2016-06-07}}).
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.
* [[Livestream]]: A video stream site merging with Vimeo in {{datetime|2025-01}}. '''IRC Channel {{IRC|deadtrickle}}'''
* [[Miraheze]]: <s>Shutting down sometime between {{datetime|2023-09-01}} and {{datetime|2023-10-31}}.</s> Rescued by new volunteers!
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.
* [[Tinkercad]]: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around {{datetime|2021-05-24}}. '''IRC Channel {{IRC|tinkerhad}}'''.
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. '''IRC Channel {{IRC|tumbledown}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on {{datetime|2023-06-19}}. '''IRC Channel {{IRC|shreddit}}'''.


=== Hiatus / Missed the Mark ===
<small>ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://chat.hackint.org/#/connect?join=archiveteam-bs – [[Archiveteam:IRC|More info]]</small>
* [[Bebo]]: Trashed by [[AOL]] and Criterion Capital Partners. Saving the remains. IRC Channel '''#cockandballs'''.
* Saving [[BerliOS]].
* Bolt is imploding, and announced [http://boltagain.ning.com/ the death of their domain and a month left to live.]
* [[Slidecast]] has announced it is going read only at the end of February 2014 and Slidecasts will become Slideshares on April 30, 2014.

Latest revision as of 14:42, 20 September 2025

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Tistory

Short-term, urgent projects

  • Tistory: Will delete inactive blogs on 2025-09-22 IRC Channel #tatteredstory (on hackint)
  • Typepad: A blogging service ceased to exist by the end of September 2025. IRC Channel #typebad (on hackint)

Medium-term projects

Long-term projects

Long-term, slower-paced projects

These are projects that are actively running but generally only have small numbers of items available to complete at a time.

Manual projects

Recently finished projects

Upcoming & proposed projects

On hiatus

  • Angelfire: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. IRC Channel #angelonfire (on hackint).
  • Audit 2014: It's time to verify our shit. IRC Channel #auditteam (on hackint).
  • Flickr: Yahoo! SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. IRC Channel #flickrfckr (on hackint).
  • FTP: Help us find and download all FTP sites! IRC Channel #effteepee (on hackint).
  • Google Drive: Same as MediaFire. IRC Channel #googlecrash (on hackint). Currently on hiatus.
  • Google Groups: "Gone within a year" (SketchCow, 2016-06-07).
  • Google News Archive: Let's store all newspapers at Google, WCGW? IRC Channel #papersplease (on hackint).
  • INTERNETARCHIVE.BAK: Grab a slice of the big cake of The Archive! IRC Channel #internetarchive.bak (on hackint).
  • ISP Hosting: Finding ISP web hosting services before the Grim Reaper finds them. IRC Channel #webroasting (on hackint).
  • Livestream: A video stream site merging with Vimeo in 2025-01. IRC Channel #deadtrickle (on hackint)
  • Miraheze: Shutting down sometime between 2023-09-01 and 2023-10-31. Rescued by new volunteers!
  • Project Newsletter: Archiving e-newsletters, currently in development. IRC Channel #projectnewsletter (on hackint).
  • Quizlet: Flashcards and other learning tools IRC Channel #quizletusin (on hackint).
  • Tinkercad: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around 2021-05-24. IRC Channel #tinkerhad (on hackint).
  • Tumblr: Yahoo! considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. IRC Channel #tumbledown (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on 2023-06-19. IRC Channel #shreddit (on hackint).

ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://chat.hackint.org/#/connect?join=archiveteam-bs – More info