Difference between revisions of "Main Page/Current Projects"

From Archiveteam
Jump to navigation Jump to search
m
(add Manga Library Z, fix order)
Line 12: Line 12:
<!-- Urgent projects -->
<!-- Urgent projects -->
<!-- sorted by deadline (soonest on top) -->
<!-- sorted by deadline (soonest on top) -->
* [[Veoh]]: A video hosting site shutting down on {{datetime|2024-11-11}}. '''IRC Channel {{IRC|veohnah}}'''
* [[マンガ図書館Z|マンガ図書館Z (Manga Library Z)]]: A site that distributed old and out-of-print manga is shutting down on {{datetime|2024-11-26}}. '''IRC Channel {{IRC|mangoes}}'''
* [[Cohost]]: A social media site shutting down at the end of {{datetime|2024}}. '''IRC Channel {{IRC|nohost}}'''
* [[Cohost]]: A social media site shutting down at the end of {{datetime|2024}}. '''IRC Channel {{IRC|nohost}}'''
* [[Veoh]]: A video hosting site shutting down on {{datetime|2024-11-11}}. '''IRC Channel {{IRC|veohnah}}'''
* [[Nhentai]]: A hoster of hentai (pornographic manga) has repeatedly been in legal trouble and is being sued in the US. '''IRC Channel {{IRC|177013}}'''
* [[Nhentai]]: A hoster of hentai (pornographic manga) has repeatedly been in legal trouble and is being sued in the US. '''IRC Channel {{IRC|177013}}'''



Revision as of 22:05, 8 November 2024

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Typepad

Short-term, urgent projects

  • Veoh: A video hosting site shutting down on 2024-11-11. IRC Channel #veohnah (on hackint)
  • マンガ図書館Z (Manga Library Z): A site that distributed old and out-of-print manga is shutting down on 2024-11-26. IRC Channel #mangoes (on hackint)
  • Cohost: A social media site shutting down at the end of 2024. IRC Channel #nohost (on hackint)
  • Nhentai: A hoster of hentai (pornographic manga) has repeatedly been in legal trouble and is being sued in the US. IRC Channel #177013 (on hackint)

Medium-term projects

(none currently)

Long-term projects

  • Telegram: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. IRC Channel #telegrab (on hackint).
  • Tor URLs: A random collection of stuff, but from Tor .onion URLs. IRC Channel #// (on hackint).
  • URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).
  • URLs: A random collection of stuff. IRC Channel #// (on hackint).
  • YouTube: Archiving selected videos. IRC Channel #down-the-tube (on hackint).

Long-term, slower-paced projects

These are projects that are actively running but generally only have small numbers of items available to complete at a time.


An updated Warrior virtual appliance (v3.2, v4.0) is now available with better support for newer projects that utilize wget-at.

Manual projects

  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • Codearchiver: Dumping and archival of source code repositories and associated version control systems. IRC Channel #codearchiver (on hackint).
  • Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. IRC Channel #archiveteam (on hackint)
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).

Upcoming & proposed projects

Recently finished projects

On Hiatus

  • Angelfire: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. IRC Channel #angelonfire (on hackint).
  • Audit 2014: It's time to verify our shit. IRC Channel #auditteam (on hackint).
  • Flickr: Yahoo! SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. IRC Channel #flickrfckr (on hackint).
  • FTP: Help us find and download all FTP sites! IRC Channel #effteepee (on hackint).
  • Google Drive: Same as MediaFire. IRC Channel #googlecrash (on hackint). Currently on hiatus.
  • Google Groups: "Gone within a year" (SketchCow, 2016-06-07).
  • Google News Archive: Let's store all newspapers at Google, WCGW? IRC Channel #papersplease (on hackint).
  • INTERNETARCHIVE.BAK: Grab a slice of the big cake of The Archive! IRC Channel #internetarchive.bak (on hackint).
  • ISP Hosting: Finding ISP web hosting services before the Grim Reaper finds them. IRC Channel #webroasting (on hackint).
  • Miraheze: Shutting down sometime between 2023-09-01 and 2023-10-31. Rescued by new volunteers!
  • Project Newsletter: Archiving e-newsletters, currently in development. IRC Channel #projectnewsletter (on hackint).
  • Quizlet: Flashcards and other learning tools IRC Channel #quizletusin (on hackint).
  • Tinkercad: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around 2021-05-24. IRC Channel #tinkerhad (on hackint).
  • Tumblr: Yahoo! considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. IRC Channel #tumbledown (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on 2023-06-19. IRC Channel #shreddit (on hackint).

ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info