Current Projects

From Archiveteam
Revision as of 03:14, 10 July 2023 by FireonLive (talk | contribs) (let's keep recently finished around a bit longer)
Jump to navigation Jump to search

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Telegram
  • 半次元: The Chinese ACG community is about to leave millions of fans behind on July 12, 2023. IRC Channel #wuciyuan (on hackint).
  • DPReview: Amazon will be throwing this digital photography resource into the shredder on April 10, 2023. IRC Channel #dprived (on hackint).
  • Imgur: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on May 15, 2023. IRC Channel #imgone (on hackint).
  • Issuu: Interactive flipbook repository is clamping down on free users' upload limits and plans to make existing uploads falling foul of its new limits inaccessible to others. IRC Channel #wetakeissuu (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. Restricting access to APIs and data on June 19, 2023. IRC Channel #shreddit (on hackint).
  • Ukraine/Russian invasion: Archiving various .ua sites in the wake of the Russian government's invasion. IRC Channel #ucryne (on hackint).
  • Telegram: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. IRC Channel #telegrab (on hackint).
  • GitHub: Embraced-uh, I mean, bought by Microsoft. IRC Channel #gitgud (on hackint).
  • MediaFire: Not 'at-risk' but grabbing speculatively to save historic files IRC Channel #mediaonfire (on hackint).
  • Google Drive: Same as MediaFire. IRC Channel #googlecrash (on hackint).
  • URLs: A random collection of stuff. IRC Channel #// (on hackint).
  • URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).

An updated Warrior virtual appliance (v3.2) is now available with better support for newer projects that utilize wget-at. Please download it using the link above.

Manual projects

  • 2019-202? coronavirus outbreak: Documenting and preserving data, events, and impacts of the virus on society. IRC Channel #coronarchive (on hackint)
  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • Codearchiver: Dumping and archival of source code repositories and associated version control systems. IRC Channel #codearchiver (on hackint).
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).
  • Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. IRC Channel #archiveteam (on hackint)

Upcoming & proposed projects

  • Skyblog: Skyrock will close its blogging service on August 21, 2023 to "comply with legislation on personal data". IRC Channel #bowlofpetunias (on hackint).
  • Xuite: The largest ISP in Taiwan has decided to give up its blogging business on August 31, 2023. IRC Channel #archiveteam-bs (on hackint).
  • Gfycat: A once-popular .webm host is pressing the stop button on its short videos for good on September 1, 2023. IRC Channel #deadcat (on hackint).
  • Chrome Web Store: Google has announced a timeline of policy changes that will lead to content being removed between December 1, 2020 and 2023. IRC Channel #chromeweblore (on hackint).
  • Photobucket: Finally following through on over a year of email threats that free accounts are going to be mass deactivated if they don't pay up. IRC Channel #photosucket (on hackint).
  • Abandoned iOS App Store & Google Play apps: Both Apple and Google are slimming down on abandoned apps, with an estimated ~1.5M of them at risk. IRC Channel #appocalypse (on hackint).
  • Kinja: Deleting all user pages, maybe? IRC Channel #gokinjagokinjago (on hackint).
  • Twitter: Deleting inactive accounts 2019-12-11 sometime. IRC Channel #archiveteam-bs (on hackint).
  • Miraheze: Shutting down sometime between September and October 2023.
  • VKontakte: A Russian equivalent of Facebook carries the risk of tumbling down under the weight of sanctions as a result of the government's invasion of Ukraine. IRC Channel #lostkontakt (on hackint).
  • YouTube: Archiving all YouTube metadata and selected videos afterwards soon. IRC Channel #down-the-tube (on hackint).
  • JamiiForums: the Tanzanian government would like this gone. IRC Channel #archiveteam-bs (on hackint).
  • LiveJournal: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. IRC Channel #archiveteam-bs (on hackint).
  • Ownlog: Ownlog is losing popularity and support from its owners. IRC Channel #archiveteam-bs (on hackint).
  • The Pirate Bay: Recently came back up, grabbing an archive for sanity's sake. IRC Channel #archiveteam-bs (on hackint).
  • Valhalla: Where to store what even the Internet Archive doesn't have space for? IRC Channel #huntinggrounds (on hackint).
  • Giphy: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy

Recently finished projects

  • ЯRUS: ЯRUS announced it would be closing its doors on June 30, 2023 at 12:00Z. IRC Channel #norus (on hackint).
  • LINE BLOG: LINE has dropped a series of services ahead of its merger with with Yahoo! Japan, and blogs were deleted on June 29, 2023. IRC Channel #holdtheline (on hackint).
  • Tiki: Tiki is an Indian short video hosting service. It shut down on June 28, 2023. IRC Channel #tiki-kacka (on hackint).
  • Egloos: The blogging platform shut down on June 16, 2023. IRC Channel #eggos (on hackint).
  • Classic Google Sites: Making more sites inaccessible to the public starting September 1, 2021 January 30, 2023 with Workspace accounts. IRC Channel #nearlylostmygoogles (on hackint).

Hiatus / Missed the Mark

  • Webs: Vistaprint is killing off the Freewebs you knew from the 2000s on March 31 June 30, 2021 sometime, unless you pay up. IRC Channel #webbed (on hackint).
  • Tinkercad: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around May 24, 2021. IRC Channel #tinkerhad (on hackint).
  • Angelfire: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. IRC Channel #angelonfire (on hackint).
  • Audit 2014: It's time to verify our shit. IRC Channel #auditteam (on hackint). THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.
  • Flickr: Yahoo! SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. IRC Channel #flickrfckr (on hackint).
  • FTP: Help us find and download all FTP sites! IRC Channel #effteepee (on hackint).
  • Google Groups: "Gone within a year" (SketchCow, 2016-06-07).
  • Google News Archive: Let's store all newspapers at Google, WCGW? IRC Channel #papersplease (on hackint).
  • DevPort: This portfolio SaaS provider has reportedly been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.
  • INTERNETARCHIVE.BAK: Grab a slice of the big cake of The Archive! IRC Channel #internetarchive.bak (on hackint).
  • ISP Hosting: Finding ISP web hosting services before the Grim Reaper finds them. IRC Channel #webroasting (on hackint).
  • NewsGrabber: Saving all news articles. Currently paused. IRC Channel #newsgrabber (on hackint).
  • Project Newsletter: Archiving e-newsletters, currently in development. IRC Channel #projectnewsletter (on hackint).
  • Quizlet: Flashcards and other learning tools IRC Channel #quizletusin (on hackint).
  • Tumblr: Yahoo! considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. IRC Channel #tumbledown (on hackint).
  • yuku: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. IRC Channel #archiveteam-bs (on hackint).

ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info