Difference between revisions of "Current Projects"

From Archiveteam
Jump to navigation Jump to search
(Remove Freeml from MtM)
(reddit to scripts-only)
Line 20: Line 20:
* [[Flash]] domains: An effort to preserve what remains of a storied legacy of websites hosting Adobe Flash Player content before the web industry takes it behind the shed. '''IRC Channel {{IRC|flashbang|network=hackint}}'''.
* [[Flash]] domains: An effort to preserve what remains of a storied legacy of websites hosting Adobe Flash Player content before the web industry takes it behind the shed. '''IRC Channel {{IRC|flashbang|network=hackint}}'''.
* Classic [[Google Sites]]: Making more sites inaccessible to the public starting September 1, 2021. '''IRC Channel {{IRC|nearlylostmygoogles|network=hackint}}'''.
* Classic [[Google Sites]]: Making more sites inaccessible to the public starting September 1, 2021. '''IRC Channel {{IRC|nearlylostmygoogles|network=hackint}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. Currently grabbing ''new'' material. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.


Line 47: Line 48:
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.

Revision as of 00:28, 9 January 2021

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Telegram
  • URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).

There will be fewer Warrior projects than usual due to the virtual appliance being unable to run many newer projects that utilize wget-at. It will take a little bit of time before an updated version is available that can run it.

Scripts only

  • .EU domains: The Brexit deal is done, and with that comes a purge of UK-based sites no longer eligible to use the .EU domain as of 2021. IRC Channel #noteurdomain (on hackint).
  • Endomondo: GPS workout tracker with optional social networking features, shutting down 2020-12-31. IRC Channel #findelmundo (on hackint).
  • Flash domains: An effort to preserve what remains of a storied legacy of websites hosting Adobe Flash Player content before the web industry takes it behind the shed. IRC Channel #flashbang (on hackint).
  • Classic Google Sites: Making more sites inaccessible to the public starting September 1, 2021. IRC Channel #nearlylostmygoogles (on hackint).
  • Reddit: Banning communities that generate bad PR for Reddit Inc. Currently grabbing new material. IRC Channel #shreddit (on hackint).
  • GitHub: Embraced-uh, I mean, bought by Microsoft. IRC Channel #gitgud (on hackint).

Manual projects

  • 2019-2020 coronavirus outbreak: Documenting and preserving data, events, and impacts of the virus on society. IRC Channel #coronarchive (on hackint)
  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).
  • MP3.com: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.

Upcoming & proposed projects

Recently finished projects

  • SmackJeeves: Webcomics host being tossed into the incinerator on 2020-12-31. IRC Channel #archiveteam-bs (on hackint).
  • Voat: A reddit competitor from the Ellen Pao days gives its users a Christmas present: it's fucking dead! IRC Channel #scrapevoat (on hackint).
  • Freshlive: A Japanese video host that's being closed to the public on December 18th. IRC Channel #archiveteam-bs (on hackint).
  • Tencent Weibo: Chinese Twitter-clone miniblog shutting down on September 28, 2020. IRC Channel #twocents (on hackint).

Hiatus / Missed the Mark

ArchiveTeam primarily uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info ArchiveTeam also has some channels left on the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090More info