Difference between revisions of "Current Projects"

From Archiveteam
Jump to navigation Jump to search
m (HTTPS)
(Halo v2 and Endomondo to finished, and a note about the Warrior being a Docker thing for the time being)
(27 intermediate revisions by 5 users not shown)
Line 8: Line 8:
== Warrior-based projects ==
== Warrior-based projects ==
{{:CurrentWarriorProject}}
{{:CurrentWarriorProject}}
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.
* [[Tencent Weibo]]: Chinese Twitter-clone miniblog shutting down on September 28, 2020. '''IRC Channel {{IRC|twocents|network=hackint}}'''.


<!-- Urgent projects -->
<!-- Urgent projects -->
<!-- Long-term projects -->
<!-- Long-term projects -->
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam|network=hackint}}'''.
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam|network=hackint}}'''.
''Newer projects utilize wget-at which the existing Warrior virtual appliance isn't able to run. To be able to run these projects, use a Docker container. See [[Running Archive Team Projects with Docker]] for instructions.''


=== Scripts only ===
=== Scripts only ===
* Classic [[Google Sites]]: Making sites inaccessible to the public starting November 1, 2020. '''IRC Channel {{IRC|nearlylostmygoogles|network=hackint}}'''.
* [[MediaFire]]: [https://twitter.com/textfiles/status/1349516443654758401 Not 'at-risk' but grabbing speculatively to save historic files] '''IRC Channel {{IRC|mediaonfire|network=hackint}}'''.
* Classic [[Google Sites]]: Making more sites inaccessible to the public starting September 1, 2021. '''IRC Channel {{IRC|nearlylostmygoogles|network=hackint}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Currently grabbing ''new'' material. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.
* [[URLs]]: A random collection of stuff. '''IRC Channel {{IRC|//|network=hackint}}'''.


== Manual projects ==
== Manual projects ==
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive|network=hackint}}'''
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive|network=hackint}}'''
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks|network=hackint}}'''
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot|network=hackint}}'''.
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot|network=hackint}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam|network=hackint}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam|network=hackint}}'''.
Line 29: Line 32:
<!-- Top priority: could disappear anytime now -->
<!-- Top priority: could disappear anytime now -->
<!-- Shutting down, definite deadline given -->
<!-- Shutting down, definite deadline given -->
* [[Webs]]: Vistaprint is killing off the Freewebs you knew from the 2000s on March 31, 2021, unless you pay up. '''IRC Channel {{IRC|webbed|network=hackint}}'''.
* [[Periscope]]: Another Twitter acquisition, another shutdown. This time, its live-streamer gets to join Vine in the bin at the end of March. '''IRC Channel {{IRC|microscope|network=hackint}}'''.
* [[Google Poly]]: A 3D art repository that Google will send to the trash compactors on June 30, 2021. New uploads cease April 30. '''IRC Channel {{IRC|polygone|network=hackint}}'''.
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between December 1, 2020 and June 2022. '''IRC Channel {{IRC|chromeweblore|network=hackint}}'''.
<!-- Shutting down, vague deadline given -->
<!-- Shutting down, vague deadline given -->
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago|network=hackint}}'''.
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago|network=hackint}}'''.
Line 39: Line 46:
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
Line 46: Line 52:
== Recently finished projects ==
== Recently finished projects ==
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
* [[Clutch]]: Game clips site that lost a clutch-or-kick bet, getting kicked on August 14 2020. '''IRC Channel {{IRC|pearls|network=hackint}}'''.
* [[Halo]]: Back to finishing off unfinished business before Bungie kills the original website on February 9, 2021. '''IRC Channel {{IRC|yolohalo|network=hackint}}'''.
* [[Bitbucket]]: Kicking the bucket on Mercurial repositories by July 1 2020 to worship Git instead. '''IRC Channel {{IRC|kickthebucket|network=hackint}}'''.
* [[Endomondo]]: GPS workout tracker with optional social networking features, shutting down 2020-12-31. '''IRC Channel {{IRC|findelmundo|network=hackint}}'''.
* [[Mixer]]: Video game streaming network shutting down 2020-07-23. '''IRC Channel {{IRC|mixdown|network=hackint}}'''
* [[.eu domains]]: The Brexit deal is done, and with that comes a purge of UK-based sites no longer eligible to use the .EU domain as of 2021. '''IRC Channel {{IRC|noteurdomain|network=hackint}}'''.
* [[Flash]] domains: An effort to preserve what remains of a storied legacy of websites hosting Adobe Flash Player content before the web industry takes it behind the shed. '''IRC Channel {{IRC|flashbang|network=hackint}}'''.


== Hiatus / Missed the Mark ==
== Hiatus / Missed the Mark ==
* [[Fast.io]]: A CDN for cloud storage services which will evaporate completely on 2021-01-15. '''IRC Channel {{IRC|slowio|network=hackint}}'''.
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr|network=hackint}}'''.
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr|network=hackint}}'''.
* [[Freeml]]: Japanese mailing list provider is sending its final email 2019-12-02. '''IRC Channel {{IRC|fml}}'''.
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee|network=hackint}}'''.
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee|network=hackint}}'''.
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07).
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07).

Revision as of 06:52, 15 February 2021

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: DeviantArt
  • URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).

Newer projects utilize wget-at which the existing Warrior virtual appliance isn't able to run. To be able to run these projects, use a Docker container. See Running Archive Team Projects with Docker for instructions.

Scripts only

Manual projects

  • 2019-2020 coronavirus outbreak: Documenting and preserving data, events, and impacts of the virus on society. IRC Channel #coronarchive (on hackint)
  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).
  • MP3.com: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.

Upcoming & proposed projects

  • Webs: Vistaprint is killing off the Freewebs you knew from the 2000s on March 31, 2021, unless you pay up. IRC Channel #webbed (on hackint).
  • Periscope: Another Twitter acquisition, another shutdown. This time, its live-streamer gets to join Vine in the bin at the end of March. IRC Channel #microscope (on hackint).
  • Google Poly: A 3D art repository that Google will send to the trash compactors on June 30, 2021. New uploads cease April 30. IRC Channel #polygone (on hackint).
  • Chrome Web Store: Google has announced a timeline of policy changes that will lead to content being removed between December 1, 2020 and June 2022. IRC Channel #chromeweblore (on hackint).
  • Kinja: Deleting all user pages, maybe? IRC Channel #gokinjagokinjago (on hackint).
  • Twitter: Deleting inactive accounts 2019-12-11 sometime. IRC Channel #twitterdead (on hackint).
  • Imgur: Image hoster decided that using it for hosting images is not permitted. IRC Channel #imgone (on hackint).
  • JamiiForums: the Tanzanian government would like this gone. IRC Channel #jammedforums (on hackint).
  • LiveJournal: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. IRC Channel #recordedjournal (on hackint).
  • Ownlog: Ownlog is losing popularity and support from its owners. IRC Channel #pwnlog (on hackint).
  • The Pirate Bay: Recently came back up, grabbing an archive for sanity's sake. IRC Channel #yarharfiddlededee (on hackint).
  • Valhalla: Where to store what even the Internet Archive doesn't have space for? IRC Channel #huntinggrounds (on hackint).
  • Giphy: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy

Recently finished projects

  • Halo: Back to finishing off unfinished business before Bungie kills the original website on February 9, 2021. IRC Channel #yolohalo (on hackint).
  • Endomondo: GPS workout tracker with optional social networking features, shutting down 2020-12-31. IRC Channel #findelmundo (on hackint).
  • .eu domains: The Brexit deal is done, and with that comes a purge of UK-based sites no longer eligible to use the .EU domain as of 2021. IRC Channel #noteurdomain (on hackint).
  • Flash domains: An effort to preserve what remains of a storied legacy of websites hosting Adobe Flash Player content before the web industry takes it behind the shed. IRC Channel #flashbang (on hackint).

Hiatus / Missed the Mark

ArchiveTeam primarily uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info ArchiveTeam also has some channels left on the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090More info