Difference between revisions of "Main Page/Current Projects"
Jump to navigation
Jump to search
(Google Drive to warrior-based) |
|||
(214 intermediate revisions by 23 users not shown) | |||
Line 1: | Line 1: | ||
__NOTOC__ | __NOTOC__ | ||
== Archive Team recruiting == | == Archive Team recruiting == | ||
* Help us: '''[[ArchiveTeam_Warrior|☞ Download and run your warrior ☜]]'''. | |||
* What's on: [https://tracker.archiveteam.org/ online tracker]. | |||
* [[Donate|Donate to keep our projects going]]. | |||
* [[Dev|Want to code for Archive Team? Here's a starting point.]] | * [[Dev|Want to code for Archive Team? Here's a starting point.]] | ||
== Warrior-based projects == | == Warrior-based projects == | ||
{{:CurrentWarriorProject}} | {{:CurrentWarriorProject}} | ||
=== Short-term, urgent projects === | |||
<!-- Projects with strong deadline, deadline is in the future --> | |||
<!-- sorted by deadline (soonest on top) --> | |||
<!-- | * [[Typepad]]: A blogging service ceased to exist by the end of September 2025. '''IRC Channel {{IRC|typebad}}''' | ||
<!-- | |||
* [[ | |||
''An | === Medium-term projects === | ||
<!-- Projects for which the deadline has passed, deadline is unclear, but there is a moment they are "finished" --> | |||
=== | <!-- sorted alphabetically --> | ||
* [[Meta Ad Library]]: Database for advertisements for Facebook and other products by Meta. '''IRC Channel {{IRC|fads}}''' | |||
* [[Peing]]: A Japanese question/answer service, was slated to be shutdown on {{datetime|2025-08-29}}. '''IRC Channel {{IRC|peingpong}}''' | |||
* [[US Government]]: Archiving the US government. '''IRC Channel {{IRC|UncleSamsArchive}}''' | |||
** [[Radio Free Asia]]: Non-profit media organization owned by USAGM. | |||
** [[Radio Free Europe|Radio Free Europe/Radio Liberty]]: Non-profit media organization owned by USAGM. | |||
** [[Voice of America]]: An internationally-broadcasting state media network at risk of closure. | |||
=== Long-term projects === | |||
<!-- Ongoing projects. No deadline, no moment of "finishing" --> | |||
<!-- sorted alphabetically --> | |||
* [[Microsoft Update]]: Removal of legacy Windows drivers announced. '''IRC Channel {{IRC|windowfixer}}''' | |||
* [[Telegram]]: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. '''IRC Channel {{IRC|telegrab}}'''. | |||
* [[Twitch]]: Archiving metadata and select videos. '''IRC Channel {{IRC|burnthetwitch}}''' | |||
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''. | |||
* [[URLs]]: A random collection of stuff. '''IRC Channel {{IRC|//}}'''. | |||
* [[YouTube]]: Archiving [[YouTube#Scope|selected videos]]. '''IRC Channel {{IRC|down-the-tube}}'''. | |||
=== Long-term, slower-paced projects === | |||
These are projects that are actively running but generally only have small numbers of items available to complete at a time. | |||
<!-- sorted alphabetically --> | |||
* [[Blogger]]: Grabbing inactive Blogger blogs since Google began a mass purge of inactive Google accounts on or after {{datetime|2023-12-01}}. '''IRC Channel {{IRC|frogger}}'''. | |||
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud}}'''. | |||
* [[Imgur]]: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on {{datetime|2023-05-15}}. '''IRC Channel {{IRC|imgone}}'''. | |||
* [[MediaFire]]: [https://twitter.com/textfiles/status/1349516443654758401 Not 'at-risk' but grabbing speculatively to save historic files] '''IRC Channel {{IRC|mediaonfire}}'''. | |||
* [[Pastebin]]: Archiving the pastas. '''IRC Channel {{IRC|pastalavista}}'''. | |||
== Manual projects == | == Manual projects == | ||
* [[ | * [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''. | ||
* [[ | * [[Codearchiver]]: Dumping and archival of source code repositories and associated version control systems. '''IRC Channel {{IRC|codearchiver}}'''. | ||
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC | * Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. '''IRC Channel {{IRC|archiveteam}}''' | ||
* [[Wikibot]] and [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channels {{IRC|wikibot}} {{IRC|wikiteam}}'''. | |||
* [[Formats|File Formats]] and [[Just Solve the Problem 2012|Just Solve]]: Let's Document all the File Formats! also has contents on the likes of subdomains, e.g. [http://fileformats.archiveteam.org fileformats.archiveteam.org] and [http://justsolve.archiveteam.org justsolve.archiveteam.org]. '''IRC Channels {{IRC|justsolve}}''' | |||
== Recently finished projects == | |||
<!-- projects that have finished in the last 30 days go here in reverse-chronogical order to be found easily and showcase recent work. additionally, keep projects here that are still in the tracker but not yet deleted so it won't confuse people. --> | |||
* [[Glitch]]: Hobbyist web hosting. Due to be inaccessible {{datetime|2025-07-08}} and expected to fully shutdown on the end of 2025. '''IRC Channel {{IRC|ditched}}'''. | |||
* [[Goo.gl]]: Google's URL shortener will shut down on {{datetime|2025-08-25}}, excluding links that were active in late 2024. '''IRC Channel {{IRC|urlteamwasright}}''' | |||
== Upcoming & proposed projects == | == Upcoming & proposed projects == | ||
Line 37: | Line 59: | ||
<!-- Top priority: could disappear anytime now --> | <!-- Top priority: could disappear anytime now --> | ||
<!-- Shutting down, definite deadline given --> | <!-- Shutting down, definite deadline given --> | ||
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between | * [[Dailymotion]]: Archiving inactive videos. '''IRC Channel {{IRC|DailyDemotion}}''' | ||
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between {{datetime|2021-12-01}} and 2025. '''IRC Channel {{IRC|chromeweblore}}'''. | |||
<!-- Shutting down, vague deadline given --> | <!-- Shutting down, vague deadline given --> | ||
* [[ | * [[Photobucket]]: Finally following through on over a year of email threats that free accounts are going to be mass deactivated if they don't pay up. '''IRC Channel {{IRC|photosucket}}'''. | ||
* [[Twitter]]: | * Abandoned iOS App Store & Google Play apps: Both Apple and Google are slimming down on abandoned apps, [https://www.cnet.com/tech/mobile/one-third-of-apple-and-google-apps-are-so-outdated-they-could-get-removed/ with an estimated ~1.5M of them at risk]. '''IRC Channel {{IRC|appocalypse}}'''. | ||
* [[Twitter]]: General instability; deleting inactive accounts <s>{{datetime|2019-12-11}}</s> sometime. '''IRC Channel {{IRC|twitterdead|EFnet|abandoned}}'''. | |||
<!-- Shutting down, no deadline given --> | <!-- Shutting down, no deadline given --> | ||
<!-- Archiving the archives --> | <!-- Archiving the archives --> | ||
<!-- Misc. projects (unmaintained sites, distrust in owners) --> | <!-- Misc. projects (unmaintained sites, distrust in owners) --> | ||
* [[ | * [[VKontakte]]: A Russian equivalent of Facebook carries the risk of tumbling down under the weight of sanctions as a result of the government's invasion of Ukraine. '''IRC Channel {{IRC|lostkontakt}}'''. | ||
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums|EFnet|abandoned}}'''. | |||
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''. | * [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal|EFnet|abandoned}}'''. | ||
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal | * [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee|EFnet|abandoned}}'''. | ||
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''. | |||
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''. | * [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''. | ||
* [[Giphy]]: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy | * [[Giphy]]: Bought by <s>Facebook</s>Shutterstock, to be "integrated" (assimilated) into <s>Instagram</s> https://news.knowyourmeme.com/news/facebook-to-buy-giphy | ||
== | == On hiatus == | ||
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''. | * [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''. | ||
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}''' | * [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. | ||
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr | * [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''. | ||
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee| | * [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''. | ||
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07). | * [[Google Drive]]: Same as MediaFire. '''IRC Channel {{IRC|googlecrash}}'''. Currently on hiatus. | ||
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], {{datetime|2016-06-07}}). | |||
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''. | * [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''. | ||
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''. | * [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''. | ||
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting | * [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''. | ||
* [[ | * [[Livestream]]: A video stream site merging with Vimeo in {{datetime|2025-01}}. '''IRC Channel {{IRC|deadtrickle}}''' | ||
* [[Miraheze]]: <s>Shutting down sometime between {{datetime|2023-09-01}} and {{datetime|2023-10-31}}.</s> Rescued by new volunteers! | |||
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''. | * [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''. | ||
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''. | * [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''. | ||
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. '''IRC Channel {{IRC|tumbledown | * [[Tinkercad]]: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around {{datetime|2021-05-24}}. '''IRC Channel {{IRC|tinkerhad}}'''. | ||
* [[ | * [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. '''IRC Channel {{IRC|tumbledown}}'''. | ||
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on {{datetime|2023-06-19}}. '''IRC Channel {{IRC|shreddit}}'''. | |||
<small>ArchiveTeam | <small>ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://chat.hackint.org/#/connect?join=archiveteam-bs – [[Archiveteam:IRC|More info]]</small> | ||
Latest revision as of 23:36, 8 September 2025
Archive Team recruiting
- Help us: ☞ Download and run your warrior ☜.
- What's on: online tracker.
- Donate to keep our projects going.
- Want to code for Archive Team? Here's a starting point.
Warrior-based projects
Short-term, urgent projects
- Typepad: A blogging service ceased to exist by the end of September 2025. IRC Channel #typebad (on hackint)
Medium-term projects
- Meta Ad Library: Database for advertisements for Facebook and other products by Meta. IRC Channel #fads (on hackint)
- Peing: A Japanese question/answer service, was slated to be shutdown on 2025-08-29. IRC Channel #peingpong (on hackint)
- US Government: Archiving the US government. IRC Channel #UncleSamsArchive (on hackint)
- Radio Free Asia: Non-profit media organization owned by USAGM.
- Radio Free Europe/Radio Liberty: Non-profit media organization owned by USAGM.
- Voice of America: An internationally-broadcasting state media network at risk of closure.
Long-term projects
- Microsoft Update: Removal of legacy Windows drivers announced. IRC Channel #windowfixer (on hackint)
- Telegram: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. IRC Channel #telegrab (on hackint).
- Twitch: Archiving metadata and select videos. IRC Channel #burnthetwitch (on hackint)
- URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).
- URLs: A random collection of stuff. IRC Channel #// (on hackint).
- YouTube: Archiving selected videos. IRC Channel #down-the-tube (on hackint).
Long-term, slower-paced projects
These are projects that are actively running but generally only have small numbers of items available to complete at a time.
- Blogger: Grabbing inactive Blogger blogs since Google began a mass purge of inactive Google accounts on or after 2023-12-01. IRC Channel #frogger (on hackint).
- GitHub: Embraced-uh, I mean, bought by Microsoft. IRC Channel #gitgud (on hackint).
- Imgur: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on 2023-05-15. IRC Channel #imgone (on hackint).
- MediaFire: Not 'at-risk' but grabbing speculatively to save historic files IRC Channel #mediaonfire (on hackint).
- Pastebin: Archiving the pastas. IRC Channel #pastalavista (on hackint).
Manual projects
- ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
- Codearchiver: Dumping and archival of source code repositories and associated version control systems. IRC Channel #codearchiver (on hackint).
- Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. IRC Channel #archiveteam (on hackint)
- Wikibot and WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channels #wikibot (on hackint) #wikiteam (on hackint).
- File Formats and Just Solve: Let's Document all the File Formats! also has contents on the likes of subdomains, e.g. fileformats.archiveteam.org and justsolve.archiveteam.org. IRC Channels #justsolve (on hackint)
Recently finished projects
- Glitch: Hobbyist web hosting. Due to be inaccessible 2025-07-08 and expected to fully shutdown on the end of 2025. IRC Channel #ditched (on hackint).
- Goo.gl: Google's URL shortener will shut down on 2025-08-25, excluding links that were active in late 2024. IRC Channel #urlteamwasright (on hackint)
Upcoming & proposed projects
- Dailymotion: Archiving inactive videos. IRC Channel #DailyDemotion (on hackint)
- Chrome Web Store: Google has announced a timeline of policy changes that will lead to content being removed between 2021-12-01 and 2025. IRC Channel #chromeweblore (on hackint).
- Photobucket: Finally following through on over a year of email threats that free accounts are going to be mass deactivated if they don't pay up. IRC Channel #photosucket (on hackint).
- Abandoned iOS App Store & Google Play apps: Both Apple and Google are slimming down on abandoned apps, with an estimated ~1.5M of them at risk. IRC Channel #appocalypse (on hackint).
- Twitter: General instability; deleting inactive accounts
2019-12-11sometime. IRC Channel #archiveteam-bs (on hackint). - VKontakte: A Russian equivalent of Facebook carries the risk of tumbling down under the weight of sanctions as a result of the government's invasion of Ukraine. IRC Channel #lostkontakt (on hackint).
- JamiiForums: the Tanzanian government would like this gone. IRC Channel #archiveteam-bs (on hackint).
- LiveJournal: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. IRC Channel #archiveteam-bs (on hackint).
- The Pirate Bay: Recently came back up, grabbing an archive for sanity's sake. IRC Channel #archiveteam-bs (on hackint).
- Valhalla: Where to store what even the Internet Archive doesn't have space for? IRC Channel #huntinggrounds (on hackint).
- Giphy: Bought by
FacebookShutterstock, to be "integrated" (assimilated) intoInstagramhttps://news.knowyourmeme.com/news/facebook-to-buy-giphy
On hiatus
- Angelfire: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. IRC Channel #angelonfire (on hackint).
- Audit 2014: It's time to verify our shit. IRC Channel #auditteam (on hackint).
- Flickr:
Yahoo!SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. IRC Channel #flickrfckr (on hackint). - FTP: Help us find and download all FTP sites! IRC Channel #effteepee (on hackint).
- Google Drive: Same as MediaFire. IRC Channel #googlecrash (on hackint). Currently on hiatus.
- Google Groups: "Gone within a year" (SketchCow, 2016-06-07).
- Google News Archive: Let's store all newspapers at Google, WCGW? IRC Channel #papersplease (on hackint).
- INTERNETARCHIVE.BAK: Grab a slice of the big cake of The Archive! IRC Channel #internetarchive.bak (on hackint).
- ISP Hosting: Finding ISP web hosting services before the Grim Reaper finds them. IRC Channel #webroasting (on hackint).
- Livestream: A video stream site merging with Vimeo in 2025-01. IRC Channel #deadtrickle (on hackint)
- Miraheze:
Shutting down sometime between 2023-09-01 and 2023-10-31.Rescued by new volunteers! - Project Newsletter: Archiving e-newsletters, currently in development. IRC Channel #projectnewsletter (on hackint).
- Quizlet: Flashcards and other learning tools IRC Channel #quizletusin (on hackint).
- Tinkercad: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around 2021-05-24. IRC Channel #tinkerhad (on hackint).
- Tumblr: Yahoo! considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. IRC Channel #tumbledown (on hackint).
- Reddit: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on 2023-06-19. IRC Channel #shreddit (on hackint).
ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://chat.hackint.org/#/connect?join=archiveteam-bs – More info