Difference between revisions of "Current Projects"

From Archiveteam
Jump to navigation Jump to search
(Halo's back, bitches)
(Google Drive to warrior-based)
(26 intermediate revisions by 4 users not shown)
Line 2: Line 2:
== Archive Team recruiting ==
== Archive Team recruiting ==
* [[Dev|Want to code for Archive Team? Here's a starting point.]]
* [[Dev|Want to code for Archive Team? Here's a starting point.]]
* Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br>
* Help us: '''[[ArchiveTeam_Warrior|☞ Download and run your warrior ☜]]'''.<br>
* What's on: [https://tracker.archiveteam.org/ online tracker].<br>
* What's on: [https://tracker.archiveteam.org/ online tracker].<br>
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].-->
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].-->
Line 10: Line 10:


<!-- Urgent projects -->
<!-- Urgent projects -->
* Afghanistan: Archiving the Afghan web due to recent events. '''IRC Channel {{IRC|afghansites|network=hackint}}'''.
* [[XTube]]: The shutdown on 5 September 2021 will surely leave a gaping hole in the web. '''IRC Channel {{IRC|nevermind|network=hackint}}'''.
<!-- Longer but finite projects -->
* [[Google Drive]]: Google will break millions of shared Drive links on 13 September 2021. '''IRC Channel {{IRC|googlecrash|network=hackint}}'''.
* Classic [[Google Sites]]: Making more sites inaccessible to the public starting September 1, 2021. '''IRC Channel {{IRC|nearlylostmygoogles|network=hackint}}'''.
* [[Periscope]]: Another Twitter acquisition, another shutdown. This time, its live-streamer gets to join Vine in the bin at the end of March. '''IRC Channel {{IRC|microscope|network=hackint}}'''.
* [[Webs]]: Vistaprint is killing off the Freewebs you knew from the 2000s on <s>March 31</s> June 30, 2021, unless you pay up. '''IRC Channel {{IRC|webbed|network=hackint}}'''.
<!-- Long-term projects -->
<!-- Long-term projects -->
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.
* [[MediaFire]]: [https://twitter.com/textfiles/status/1349516443654758401 Not 'at-risk' but grabbing speculatively to save historic files] '''IRC Channel {{IRC|mediaonfire|network=hackint}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Currently grabbing ''new'' material. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.
* [[URLs]]: A random collection of stuff. '''IRC Channel {{IRC|//|network=hackint}}'''.
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam|network=hackint}}'''.
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam|network=hackint}}'''.


''There will be fewer Warrior projects than usual due to the virtual appliance being unable to run many newer projects that utilize wget-at. It will take a little bit of time before an updated version is available that can run it.''
''An updated Warrior virtual appliance (v3.2) is now available with better support for newer projects that utilize wget-at. Please download it using the link above.''
 
<!--
=== Scripts only ===
=== Scripts only === -->
* .EU domains: The Brexit deal is done, and with that comes a purge of UK-based sites no longer eligible to use the .EU domain as of 2021. '''IRC Channel {{IRC|noteurdomain|network=hackint}}'''.
* [[Endomondo]]: GPS workout tracker with optional social networking features, shutting down 2020-12-31. '''IRC Channel {{IRC|findelmundo|network=hackint}}'''.
* [[Flash]] domains: An effort to preserve what remains of a storied legacy of websites hosting Adobe Flash Player content before the web industry takes it behind the shed. '''IRC Channel {{IRC|flashbang|network=hackint}}'''.
* Classic [[Google Sites]]: Making more sites inaccessible to the public starting September 1, 2021. '''IRC Channel {{IRC|nearlylostmygoogles|network=hackint}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Currently grabbing ''new'' material. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.


== Manual projects ==
== Manual projects ==
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive|network=hackint}}'''
* [[Coronavirus|2019-2021 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive|network=hackint}}'''
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot|network=hackint}}'''.
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot|network=hackint}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam|network=hackint}}'''.
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam|network=hackint}}'''.
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.


== Upcoming & proposed projects ==
== Upcoming & proposed projects ==
Line 33: Line 37:
<!-- Top priority: could disappear anytime now -->
<!-- Top priority: could disappear anytime now -->
<!-- Shutting down, definite deadline given -->
<!-- Shutting down, definite deadline given -->
* [[Fast.io]]: A CDN for cloud storage services which will evaporate completely on 2021-01-15. '''IRC Channel {{IRC|slowio|network=hackint}}'''.
* [[Halo]]: Back to finishing off unfinished business before Bungie kills the original website on February 9, 2021. '''IRC Channel {{IRC|yolohalo|network=hackint}}'''.
* [[Webs]]: Vistaprint is killing off the Freewebs you knew from the 2000s on March 31, 2021, unless you pay up. '''IRC Channel {{IRC|webbed|network=hackint}}'''.
* [[Periscope]]: Another Twitter acquisition, another shutdown. This time, its live-streamer gets to join Vine in the bin at the end of March. '''IRC Channel {{IRC|microscope|network=hackint}}'''.
* [[Google Poly]]: A 3D art repository that Google will send to the trash compactors on June 30, 2021. New uploads cease April 30. '''IRC Channel {{IRC|polygone|network=hackint}}'''.
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between December 1, 2020 and June 2022. '''IRC Channel {{IRC|chromeweblore|network=hackint}}'''.
* [[Chrome Web Store]]: Google has announced a timeline of policy changes that will lead to content being removed between December 1, 2020 and June 2022. '''IRC Channel {{IRC|chromeweblore|network=hackint}}'''.
<!-- Shutting down, vague deadline given -->
<!-- Shutting down, vague deadline given -->
Line 44: Line 43:
<!-- Shutting down, no deadline given -->
<!-- Shutting down, no deadline given -->
<!-- Archiving the archives -->
<!-- Archiving the archives -->
* [[MediaFire]]: [https://twitter.com/textfiles/status/1339625133363912706 Setting fire to accounts and files it deems "abandoned" starting in January 2021.] '''IRC Channel {{IRC|mediaonfire|network=hackint}}'''.
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
<!-- Misc. projects (unmaintained sites, distrust in owners) -->
* [[YouTube]]: Archiving all YouTube metadata and selected videos afterwards soon. '''IRC Channel {{IRC|down-the-tube|network=hackint}}'''.
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.
Line 56: Line 55:
== Recently finished projects ==
== Recently finished projects ==
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people -->
* [[SmackJeeves]]: Webcomics host being tossed into the incinerator on 2020-12-31. '''IRC Channel {{IRC|archiveteam-bs|network=hackint}}'''.
* [[CodePlex]]: Microsoft's self-archive will be permanently removed from its Recycle Bin after July 1, 2021. '''IRC Channel {{IRC|plexicode|network=hackint}}'''.
* [[Voat]]: A reddit competitor from the Ellen Pao days gives its users a Christmas present: it's fucking dead! '''IRC Channel {{IRC|scrapevoat|network=hackint}}'''.
* [[Google Poly]]: A 3D art repository that Google will send to the trash compactors on June 30, 2021. New uploads cease April 30. '''IRC Channel {{IRC|polygone|network=hackint}}'''.
* [[Bintray]]: JFrog is dismantling the software distribution platform used by numerous projects in May. '''IRC Channel {{IRC|binnedtray|network=hackint}}'''.


== Hiatus / Missed the Mark ==
== Hiatus / Missed the Mark ==
* [[Tinkercad]]: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around May 24, 2021. '''IRC Channel {{IRC|tinkerhad|network=hackint}}'''.
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.

Revision as of 07:39, 8 September 2021

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Telegram
  • Afghanistan: Archiving the Afghan web due to recent events. IRC Channel #afghansites (on hackint).
  • XTube: The shutdown on 5 September 2021 will surely leave a gaping hole in the web. IRC Channel #nevermind (on hackint).
  • Google Drive: Google will break millions of shared Drive links on 13 September 2021. IRC Channel #googlecrash (on hackint).
  • Classic Google Sites: Making more sites inaccessible to the public starting September 1, 2021. IRC Channel #nearlylostmygoogles (on hackint).
  • Periscope: Another Twitter acquisition, another shutdown. This time, its live-streamer gets to join Vine in the bin at the end of March. IRC Channel #microscope (on hackint).
  • Webs: Vistaprint is killing off the Freewebs you knew from the 2000s on March 31 June 30, 2021, unless you pay up. IRC Channel #webbed (on hackint).
  • GitHub: Embraced-uh, I mean, bought by Microsoft. IRC Channel #gitgud (on hackint).
  • MediaFire: Not 'at-risk' but grabbing speculatively to save historic files IRC Channel #mediaonfire (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. Currently grabbing new material. IRC Channel #shreddit (on hackint).
  • URLs: A random collection of stuff. IRC Channel #// (on hackint).
  • URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).

An updated Warrior virtual appliance (v3.2) is now available with better support for newer projects that utilize wget-at. Please download it using the link above.

Manual projects

  • 2019-2021 coronavirus outbreak: Documenting and preserving data, events, and impacts of the virus on society. IRC Channel #coronarchive (on hackint)
  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).

Upcoming & proposed projects

Recently finished projects

  • CodePlex: Microsoft's self-archive will be permanently removed from its Recycle Bin after July 1, 2021. IRC Channel #plexicode (on hackint).
  • Google Poly: A 3D art repository that Google will send to the trash compactors on June 30, 2021. New uploads cease April 30. IRC Channel #polygone (on hackint).
  • Bintray: JFrog is dismantling the software distribution platform used by numerous projects in May. IRC Channel #binnedtray (on hackint).

Hiatus / Missed the Mark

ArchiveTeam primarily uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info ArchiveTeam also has some channels left on the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090More info