https://wiki.archiveteam.org/api.php?action=feedcontributions&user=Joaquinito01&feedformat=atomArchiveteam - User contributions [en]2024-03-29T09:34:36ZUser contributionsMediaWiki 1.37.1https://wiki.archiveteam.org/index.php?title=Current_Projects&diff=44903Current Projects2020-06-27T13:25:40Z<p>Joaquinito01: </p>
<hr />
<div>__NOTOC__<br />
== Archive Team recruiting ==<br />
* [[Dev|Want to code for Archive Team? Here's a starting point.]]<br />
* Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br><br />
* What's on: [http://tracker.archiveteam.org/ online tracker].<br><br />
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].--><br />
<br />
== Warrior-based projects ==<br />
{{:CurrentWarriorProject}}<br />
<br />
<!-- Urgent projects --><br />
<!-- Long-term projects --><br />
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.<br />
<br />
=== Scripts only ===<br />
* [[Bitbucket]]: Kicking the bucket on Mercurial repositories by July 1 2020 to worship Git instead. '''IRC Channel {{IRC|kickthebucket|network=hackint}}'''.<br />
<br />
== Manual projects ==<br />
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive}}'''<br />
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks|network=hackint}}'''<br />
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.<br />
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.<br />
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site. '''IRC Channel {{IRC|mp3lose|network=hackint}}'''.<br />
<br />
== Upcoming & proposed projects ==<br />
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). --><br />
<!-- Top priority: could disappear anytime now --><br />
<!-- Shutting down, definite deadline given --><br />
<!-- Shutting down, vague deadline given --><br />
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago}}'''.<br />
* [[Mixer]]: Video game streaming network shutting down 2020-07-23. '''IRC Channel {{IRC|mixdown|network=hackint}}'''<br />
* [[Twitter]]: Deleting inactive accounts <s>2019-12-11</s> sometime. '''IRC Channel {{IRC|twitterdead}}'''.<br />
<!-- Shutting down, no deadline given --><br />
<!-- Archiving the archives --><br />
<!-- Misc. projects (unmaintained sites, distrust in owners) --><br />
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.<br />
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.<br />
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.<br />
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.<br />
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.<br />
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.<br />
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.<br />
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.<br />
* [[Giphy]]: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy<br />
<br />
== Recently finished projects ==<br />
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people --><br />
* [[League of Legends#LoL_Boards_&_Forum|League of Legends Boards & Forum]]: Riot forcing Boards to surrender at 20 then smiting it and the forum archive on March 16th. '''IRC Channel {{IRC|archiveteam-bs}}'''.<br />
* [[8tracks]]: social network around audio streaming and creating playlists, was going to shut down 2019-12-31 but has been acquired and restored. '''IRC Channel {{IRC|8ball|network=hackint}}'''.<br />
* [[Plays.tv]]: Stopping.tv on 2019-12-15. '''IRC Channel {{IRC|stops.tv|network=hackint}}'''.<br />
* [[YouTube]]: Making playlists of liked videos private on 2019-12-05. '''IRC Channel {{IRC|down-the-tube|network=hackint}}'''.<br />
<br />
== Hiatus / Missed the Mark ==<br />
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.<br />
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.<br />
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''.<br />
* [[Freeml]]: Japanese mailing list provider is sending its final email 2019-12-02. '''IRC Channel {{IRC|fml}}'''.<br />
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.<br />
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07).<br />
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.<br />
* [[DevPort]]: This [http://developerportfolio.com/ portfolio SaaS provider] has [http://www.lowendtalk.com/discussion/65135/need-some-help-saas-provider-is-dead-but-my-site-is-still-up-how-should-i-grab-it reportedly] been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.<br />
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.<br />
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.<br />
* [[NewsGrabber]]: Saving all news articles. <!-- Help with server power or by finding more news sites.-->Currently paused. '''IRC Channel {{IRC|newsgrabber}}'''.<br />
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.<br />
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.<br />
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. '''IRC Channel {{IRC|tumbledown}}'''.<br />
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. '''IRC Channel {{IRC|archiveteam}}'''.<br />
<br />
<small>ArchiveTeam primarily uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[Archiveteam:IRC|More info]]</small><br><br />
<small>ArchiveTeam also uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/ – [[Archiveteam:IRC|More info]]</div>Joaquinito01https://wiki.archiveteam.org/index.php?title=Current_Projects&diff=44902Current Projects2020-06-27T13:24:58Z<p>Joaquinito01: /* Manual projects */</p>
<hr />
<div>__NOTOC__<br />
== Archive Team recruiting ==<br />
* [[Dev|Want to code for Archive Team? Here's a starting point.]]<br />
* Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br><br />
* What's on: [http://tracker.archiveteam.org/ online tracker].<br><br />
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].--><br />
<br />
== Warrior-based projects ==<br />
{{:CurrentWarriorProject}}<br />
<br />
<!-- Urgent projects --><br />
<!-- Long-term projects --><br />
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.<br />
<br />
=== Scripts only ===<br />
* [[Bitbucket]]: Kicking the bucket on Mercurial repositories by July 1 2020 to worship Git instead. '''IRC Channel {{IRC|kickthebucket|network=hackint}}'''.<br />
<br />
== Manual projects ==<br />
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive}}'''<br />
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks|network=hackint}}'''<br />
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.<br />
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.<br />
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site. '''IRC Channel {{IRC|mp3lose}}'''.<br />
<br />
== Upcoming & proposed projects ==<br />
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). --><br />
<!-- Top priority: could disappear anytime now --><br />
<!-- Shutting down, definite deadline given --><br />
<!-- Shutting down, vague deadline given --><br />
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago}}'''.<br />
* [[Mixer]]: Video game streaming network shutting down 2020-07-23. '''IRC Channel {{IRC|mixdown|network=hackint}}'''<br />
* [[Twitter]]: Deleting inactive accounts <s>2019-12-11</s> sometime. '''IRC Channel {{IRC|twitterdead}}'''.<br />
<!-- Shutting down, no deadline given --><br />
<!-- Archiving the archives --><br />
<!-- Misc. projects (unmaintained sites, distrust in owners) --><br />
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|gitgud|network=hackint}}'''.<br />
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.<br />
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.<br />
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.<br />
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.<br />
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.<br />
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.<br />
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.<br />
* [[Giphy]]: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy<br />
<br />
== Recently finished projects ==<br />
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people --><br />
* [[League of Legends#LoL_Boards_&_Forum|League of Legends Boards & Forum]]: Riot forcing Boards to surrender at 20 then smiting it and the forum archive on March 16th. '''IRC Channel {{IRC|archiveteam-bs}}'''.<br />
* [[8tracks]]: social network around audio streaming and creating playlists, was going to shut down 2019-12-31 but has been acquired and restored. '''IRC Channel {{IRC|8ball|network=hackint}}'''.<br />
* [[Plays.tv]]: Stopping.tv on 2019-12-15. '''IRC Channel {{IRC|stops.tv|network=hackint}}'''.<br />
* [[YouTube]]: Making playlists of liked videos private on 2019-12-05. '''IRC Channel {{IRC|down-the-tube|network=hackint}}'''.<br />
<br />
== Hiatus / Missed the Mark ==<br />
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.<br />
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.<br />
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''.<br />
* [[Freeml]]: Japanese mailing list provider is sending its final email 2019-12-02. '''IRC Channel {{IRC|fml}}'''.<br />
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.<br />
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07).<br />
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.<br />
* [[DevPort]]: This [http://developerportfolio.com/ portfolio SaaS provider] has [http://www.lowendtalk.com/discussion/65135/need-some-help-saas-provider-is-dead-but-my-site-is-still-up-how-should-i-grab-it reportedly] been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.<br />
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.<br />
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.<br />
* [[NewsGrabber]]: Saving all news articles. <!-- Help with server power or by finding more news sites.-->Currently paused. '''IRC Channel {{IRC|newsgrabber}}'''.<br />
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.<br />
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.<br />
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. '''IRC Channel {{IRC|tumbledown}}'''.<br />
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. '''IRC Channel {{IRC|archiveteam}}'''.<br />
<br />
<small>ArchiveTeam primarily uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[Archiveteam:IRC|More info]]</small><br><br />
<small>ArchiveTeam also uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/ – [[Archiveteam:IRC|More info]]</div>Joaquinito01https://wiki.archiveteam.org/index.php?title=Giphy&diff=44855Giphy2020-06-24T09:54:17Z<p>Joaquinito01: Upcoming...</p>
<hr />
<div>{{Infobox project<br />
| title = Giphy<br />
| URL = http://giphy.com<br />
| project_status = {{endangered}}<br />
| archiving_status = {{upcoming}}<br />
}}<br />
'''Giphy''' is a GIF sharing site. It was recently acquired by Facebook.<ref>https://news.knowyourmeme.com/news/facebook-to-buy-giphy</ref><ref>https://about.fb.com/news/2020/05/welcome-giphy/</ref><ref>https://medium.com/@giphy/giphy-to-join-facebook-as-part-of-the-instagram-team-e7ea8d32d7b6</ref><ref>https://reclaimthenet.org/facebook-giphy-sale-privacy/</ref> Apparently, it will be assimilated into Instagram.<br />
<br />
{{Navigation box}}</div>Joaquinito01https://wiki.archiveteam.org/index.php?title=Current_Projects&diff=44854Current Projects2020-06-24T09:50:28Z<p>Joaquinito01: /* Upcoming & proposed projects */</p>
<hr />
<div>__NOTOC__<br />
== Archive Team recruiting ==<br />
* [[Dev|Want to code for Archive Team? Here's a starting point.]]<br />
* Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br><br />
* What's on: [http://tracker.archiveteam.org/ online tracker].<br><br />
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].--><br />
<br />
== Warrior-based projects ==<br />
{{:CurrentWarriorProject}}<br />
<br />
<!-- Urgent projects --><br />
<!-- Long-term projects --><br />
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.<br />
<br />
=== Scripts only ===<br />
* [[Bitbucket]]: Kicking the bucket on Mercurial repositories by July 1 2020 to worship Git instead. '''IRC Channel {{IRC|kickthebucket|network=hackint}}'''.<br />
<br />
== Manual projects ==<br />
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive}}'''<br />
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks|network=hackint}}'''<br />
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.<br />
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.<br />
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.<br />
<br />
== Upcoming & proposed projects ==<br />
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). --><br />
<!-- Top priority: could disappear anytime now --><br />
<!-- Shutting down, definite deadline given --><br />
<!-- Shutting down, vague deadline given --><br />
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago}}'''.<br />
* [[Mixer]]: Video game streaming network shutting down 2020-07-23. '''IRC Channel {{IRC|mixdown|network=hackint}}'''<br />
* [[Twitter]]: Deleting inactive accounts <s>2019-12-11</s> sometime. '''IRC Channel {{IRC|twitterdead}}'''.<br />
<!-- Shutting down, no deadline given --><br />
<!-- Archiving the archives --><br />
<!-- Misc. projects (unmaintained sites, distrust in owners) --><br />
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|getgit}}'''.<br />
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.<br />
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.<br />
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.<br />
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.<br />
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.<br />
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.<br />
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.<br />
* [[Giphy]]: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy '''IRC Channel {{IRC|giphyint|network=hackint}}'''<br />
<br />
== Recently finished projects ==<br />
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people --><br />
* [[League of Legends#LoL_Boards_&_Forum|League of Legends Boards & Forum]]: Riot forcing Boards to surrender at 20 then smiting it and the forum archive on March 16th. '''IRC Channel {{IRC|archiveteam-bs}}'''.<br />
* [[8tracks]]: social network around audio streaming and creating playlists, was going to shut down 2019-12-31 but has been acquired and restored. '''IRC Channel {{IRC|8ball|network=hackint}}'''.<br />
* [[Plays.tv]]: Stopping.tv on 2019-12-15. '''IRC Channel {{IRC|stops.tv|network=hackint}}'''.<br />
* [[YouTube]]: Making playlists of liked videos private on 2019-12-05. '''IRC Channel {{IRC|down-the-tube|network=hackint}}'''.<br />
<br />
== Hiatus / Missed the Mark ==<br />
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.<br />
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.<br />
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''.<br />
* [[Freeml]]: Japanese mailing list provider is sending its final email 2019-12-02. '''IRC Channel {{IRC|fml}}'''.<br />
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.<br />
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07).<br />
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.<br />
* [[DevPort]]: This [http://developerportfolio.com/ portfolio SaaS provider] has [http://www.lowendtalk.com/discussion/65135/need-some-help-saas-provider-is-dead-but-my-site-is-still-up-how-should-i-grab-it reportedly] been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.<br />
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.<br />
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.<br />
* [[NewsGrabber]]: Saving all news articles. <!-- Help with server power or by finding more news sites.-->Currently paused. '''IRC Channel {{IRC|newsgrabber}}'''.<br />
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.<br />
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.<br />
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. '''IRC Channel {{IRC|tumbledown}}'''.<br />
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. '''IRC Channel {{IRC|archiveteam}}'''.<br />
<br />
<small>ArchiveTeam primarily uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[Archiveteam:IRC|More info]]</small><br><br />
<small>ArchiveTeam also uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/ – [[Archiveteam:IRC|More info]]</div>Joaquinito01https://wiki.archiveteam.org/index.php?title=Current_Projects&diff=44853Current Projects2020-06-24T09:50:00Z<p>Joaquinito01: </p>
<hr />
<div>__NOTOC__<br />
== Archive Team recruiting ==<br />
* [[Dev|Want to code for Archive Team? Here's a starting point.]]<br />
* Help us: '''[[warrior|☞ Download and run your warrior ☜]]'''.<br><br />
* What's on: [http://tracker.archiveteam.org/ online tracker].<br><br />
<!--Combined project activity graphs [http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/project_items.html here].--><br />
<br />
== Warrior-based projects ==<br />
{{:CurrentWarriorProject}}<br />
<br />
<!-- Urgent projects --><br />
<!-- Long-term projects --><br />
* [[URLTeam]]: URL shorteners were a fucking awful idea. '''IRC Channel {{IRC|urlteam}}'''.<br />
<br />
=== Scripts only ===<br />
* [[Bitbucket]]: Kicking the bucket on Mercurial repositories by July 1 2020 to worship Git instead. '''IRC Channel {{IRC|kickthebucket|network=hackint}}'''.<br />
<br />
== Manual projects ==<br />
* [[Coronavirus|2019-2020 coronavirus outbreak]]: Documenting and preserving data, events, and impacts of the virus on society. '''IRC Channel {{IRC|coronarchive}}'''<br />
* [[Yahoo! Groups]]: Years of internet threads, soon to go private-only. '''IRC Channel {{IRC|yahoosucks|network=hackint}}'''<br />
* [[ArchiveBot]]: For those with lots of disk space, bandwidth and long-term commitment. '''IRC Channel {{IRC|archivebot}}'''.<br />
* [[WikiTeam]]: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, [https://github.com/WikiTeam/wikiteam/wiki/Tutorial#I_have_no_shell_access_to_server everyone can help] (you choose the size of your downloads). '''IRC Channel {{IRC|wikiteam}}'''.<br />
* [[MP3.com]]: Digging through the WayBack Machine's archives to build a database of all the DAM CDs made available through the site.<br />
<br />
== Upcoming & proposed projects ==<br />
<!-- Websites you would like to have archived. Please create a wikipage about the project with information about the website (shutting down? (when), why should it be archived, etc.). --><br />
<!-- Top priority: could disappear anytime now --><br />
<!-- Shutting down, definite deadline given --><br />
<!-- Shutting down, vague deadline given --><br />
* [[Kinja]]: Deleting all user pages, maybe? '''IRC Channel {{IRC|gokinjagokinjago}}'''.<br />
* [[Mixer]]: Video game streaming network shutting down 2020-07-23. '''IRC Channel {{IRC|mixdown|network=hackint}}'''<br />
* [[Twitter]]: Deleting inactive accounts <s>2019-12-11</s> sometime. '''IRC Channel {{IRC|twitterdead}}'''.<br />
<!-- Shutting down, no deadline given --><br />
<!-- Archiving the archives --><br />
<!-- Misc. projects (unmaintained sites, distrust in owners) --><br />
* [[GitHub]]: Embraced-uh, I mean, bought by Microsoft. '''IRC Channel {{IRC|getgit}}'''.<br />
* [[Imgur]]: Image hoster decided that using it for hosting images is not permitted. '''IRC Channel {{IRC|imgone}}'''.<br />
* [[JamiiForums]]: the Tanzanian government would like this gone. '''IRC Channel {{IRC|jammedforums}}'''.<br />
* [[LiveJournal]]: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. '''IRC Channel {{IRC|recordedjournal}}'''.<br />
* [[Ownlog]]: Ownlog is losing popularity and support from its owners. '''IRC Channel {{IRC|pwnlog}}'''.<br />
* [[Reddit]]: Banning communities that generate bad PR for Reddit Inc. '''IRC Channel {{IRC|shreddit|network=hackint}}'''.<br />
* [[The Pirate Bay]]: Recently came back up, grabbing an archive for sanity's sake. '''IRC Channel {{IRC|yarharfiddlededee}}'''.<br />
* [[Valhalla]]: Where to store what even the [[Internet Archive]] doesn't have space for? '''IRC Channel {{IRC|huntinggrounds}}'''.<br />
* [[Giphy]]: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy '''IRC Channel {{IRC|giphyint|network=freenode}}'''<br />
<br />
== Recently finished projects ==<br />
<!-- put projects here that are still in the tracker but not yet deleted so it won't confuse people --><br />
* [[League of Legends#LoL_Boards_&_Forum|League of Legends Boards & Forum]]: Riot forcing Boards to surrender at 20 then smiting it and the forum archive on March 16th. '''IRC Channel {{IRC|archiveteam-bs}}'''.<br />
* [[8tracks]]: social network around audio streaming and creating playlists, was going to shut down 2019-12-31 but has been acquired and restored. '''IRC Channel {{IRC|8ball|network=hackint}}'''.<br />
* [[Plays.tv]]: Stopping.tv on 2019-12-15. '''IRC Channel {{IRC|stops.tv|network=hackint}}'''.<br />
* [[YouTube]]: Making playlists of liked videos private on 2019-12-05. '''IRC Channel {{IRC|down-the-tube|network=hackint}}'''.<br />
<br />
== Hiatus / Missed the Mark ==<br />
* [[Angelfire]]: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. '''IRC Channel {{IRC|angelonfire}}'''.<br />
* [[Audit2014|Audit 2014]]: It's time to verify our shit. '''IRC Channel {{IRC|auditteam}}'''. THIS PROJECT IS ON HIATUS AND WILL BE RETURNED TO AS AUDIT2018.<br />
* [[Flickr]]: <s>[[Yahoo!]]</s> SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. '''IRC Channel {{IRC|flickrfckr}}'''.<br />
* [[Freeml]]: Japanese mailing list provider is sending its final email 2019-12-02. '''IRC Channel {{IRC|fml}}'''.<br />
* [[FTP]]: Help us find and download all FTP sites! '''IRC Channel {{IRC|effteepee}}'''.<br />
* [[Google Groups]]: "Gone within a year" ([[User:Jscott|SketchCow]], 2016-06-07).<br />
* [[Google News Archive]]: Let's store all newspapers at Google, WCGW? '''IRC Channel {{IRC|papersplease}}'''.<br />
* [[DevPort]]: This [http://developerportfolio.com/ portfolio SaaS provider] has [http://www.lowendtalk.com/discussion/65135/need-some-help-saas-provider-is-dead-but-my-site-is-still-up-how-should-i-grab-it reportedly] been having infrastructure issues, and removed their social media accounts. Possible impending shutdown.<br />
* [[INTERNETARCHIVE.BAK]]: Grab a slice of the big cake of [[Internet Archive|The Archive]]! '''IRC Channel {{IRC|internetarchive.bak}}'''.<br />
* [[ISP Hosting]]: Finding ISP web hosting services before the Grim Reaper finds them. '''IRC Channel {{IRC|webroasting}}'''.<br />
* [[NewsGrabber]]: Saving all news articles. <!-- Help with server power or by finding more news sites.-->Currently paused. '''IRC Channel {{IRC|newsgrabber}}'''.<br />
* [[Project Newsletter]]: Archiving e-newsletters, currently in development. '''IRC Channel {{IRC|projectnewsletter}}'''.<br />
* [[Quizlet]]: Flashcards and other learning tools '''IRC Channel {{IRC|quizletusin}}'''.<br />
* [[Tumblr]]: [[Yahoo!]] considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. '''IRC Channel {{IRC|tumbledown}}'''.<br />
* [[yuku]]: Lately yuku is very unstable and hosting thousands of forums. Project currently paused. '''IRC Channel {{IRC|archiveteam}}'''.<br />
<br />
<small>ArchiveTeam primarily uses the EFnet IRC network – irc://irc.efnet.org – webchat: http://chat.efnet.org:9090 – [[Archiveteam:IRC|More info]]</small><br><br />
<small>ArchiveTeam also uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/ – [[Archiveteam:IRC|More info]]</div>Joaquinito01https://wiki.archiveteam.org/index.php?title=GitHub&diff=44852GitHub2020-06-24T09:46:08Z<p>Joaquinito01: Changing to upcoming...</p>
<hr />
<div>{{Infobox project<br />
| title = GitHub<br />
| logo = GitHub_logo.png<br />
| image = GitHub 1303511667338.png<br />
| description = A screen shot of the GitHub home page taken on 2015-11-08<br />
| URL = {{url|1=https://github.com/|2=GitHub}}<br />
| project_status = {{online}}<br />
| archiving_status = {{upcoming}}<br />
| irc = getgit<br />
}}<br />
<br />
:''See also [[GitHub Downloads]]''<br />
<br />
'''GitHub''' is a software repository powered by Git. Does not seem to have any site issues, often 24 hours uptime (see [http://status.github.com/ site status]). Looks pretty sunny at the moment, but when disaster strikes it would be a problem archiving the private repositories.<br />
<br />
== Size ==<br />
As of 12th August 2012: 1,963,652 people hosting over 3,460,582 repositories [https://github.com/search?type=Repositories&q=fork%3Atrue 1,117,147 public repositories] are forks, which greatly reduces the amount of data required to archive it.<br />
<br />
As of 22 November 2015: There are 32,000,000 repositories, with a similar fork ratio. Back-of-the-envelope calculations suggest 120TB of data in git repositories.<br />
<br />
As of June 2018, there are 79.6 million public repositories in 137 million repository IDs, indicating that around 42 % of all repositories ever created are private or have been deleted.<br />
<br />
== Acquisition by Microsoft ==<br />
It was [https://www.bloomberg.com/news/articles/2018-06-03/microsoft-is-said-to-have-agreed-to-acquire-coding-site-github reported by Bloomberg] and [https://news.microsoft.com/2018/06/04/microsoft-to-acquire-github-for-7-5-billion/ confirmed on June 4, 2018], that Microsoft bought GitHub for 7.5 billion dollars. On 26th October 2018, the new GitHub CEO, Nat Friedman, [https://blog.github.com/2018-10-26-github-and-microsoft/ announced] that the acquisition was complete.<br />
<br />
A discussion into the feasibility of archiving GitHub has commenced in {{IRC|getgit}}.<br />
* Users in the FOSS community fear Microsoft's "embrace, extend, extinguish" schemes in the 1990s and 2000s and many called for a move to rival [[GitLab]] in the wake of the news.<br />
* [[LinkedIn]] shows how user content can be gradually taken away (by means of paywalls and login walls).<br />
<br />
== Backup tools ==<br />
=== git itself ===<br />
<tt>git clone</tt> is the simplest one (and also works outside of GitHub, obviously). However, it does not get some project data that is not stored in git, including issue reports, comments, pull requests.<br />
<br />
When cloning a repository for archival, it is best to use the <tt>--mirror</tt> option. This mirror will include all branches and even the code associated with pull requests. (Note however that the PR code will get purged eventually by Git's GC when you create a clone from this mirror as the PR commits aren't referenced by any branches, though this can be solved by adding a line like <tt>fetch = +refs/pull/*/head:refs/remotes/origin/pr/*</tt> to the repository config file.)<br />
<br />
To pack a clone/mirror into a single, easily handleable file, use <tt>git bundle create FILE --all</tt> inside the clone/mirror.<br />
<br />
=== Other tools ===<br />
<br />
[https://github-backup.branchable.com/ github-backup] runs in a git repository and chases down that information, committing it to a "github" branch. It also chases down the forks and efficiently downloads them as well.<br />
<br />
[http://www.githubarchive.org/ githubarchive.org] and [http://ghtorrent.org/ GHTorrent] are both creating archives of the GitHub "timeline", that is, all events like git pushes, forks, created issues, pull requests, etc.<br />
<br />
[http://codearchive.org codearchive.org] Effort to backup all the versions of all the repos on GitHub and other sources. [https://speakerdeck.com/filosottile/the-code-archive-hope-xi Slides from a talk about it].<br />
<br />
[https://github.com/josegonzalez/python-github-backup python-github-backup] can backup entire users or organisations and retrieves issues, PRs, labels, milestones, hooks, wikis, gists, and LFS data. It can also grab starred repositories and forks.<br />
<br />
See also [[Software Heritage]].<br />
<br />
== GitHub replacement engines ==<br />
<br />
If we ever have to archive the data out of GitHub, the data will need to be exportable to a GitHub-style engine.<br />
<br />
Currently<sup>[when?]</sup>, the best GitHub-style engine that has a Wiki, issues, Git Repo hosting, and is free and open source to use is [http://gitlab.com GitLab]. The engine is used by and paid for by many major organizations, so it is likely to live on in a stable way. Other popular FOSS alternatives to GitHub include [https://gitea.io/en-US/ Gitea] and [https://gogs.io/ Gogs].<br />
<br />
We will need a complete migration system to move a git repository and all related GitHub service information of a repository to GitLab.<br />
<br />
== Things to scrape ==<br />
<br />
In case of emergency, these are the items we need to grab.<br />
<br />
* Git Repository - Accomplished by github-backup<br />
** Forked Repositories - Accomplished by github-backup<br />
** '''Notes on Commits/Lines of Code''' - Not supported by github-backup yet. GitHub API support exists since ca. 2011.<br />
* '''GitHub Gollum Wiki''' - No tool yet, but just clone the whole thing, and then push it to GitLab.<br />
** The wiki is a full-blown git repository, though only few features are exposed on the user interfaces (e.g. no branches). The clone URL is shown on wiki pages and is <tt>https://github.com/owner/repository.wiki.git</tt>.<br />
* '''Releases''' - Tags on GitHub can have binaries attached. These are of high priority to archive.<br />
* Issues + Comments - Accomplished by github-backup<br />
** '''Milestones''' - ''github-backup currently does not archive this yet.''<br />
** '''Labels''' - ''github-backup currently does not archive this yet.''<br />
* '''Hooks''' - Needs some kind of tool to archive GitHub Hooks<br />
<br />
== Lists of repositories ==<br />
<br />
A list of repositories from GitHub API data are maintained by an archive team member at [https://za3k.com/github/ za3k.com]. It scrapes continuously. Public downloads are updated once a day. This list does not include gists.<br />
<br />
The Internet Archive item {{IA item|github_repository_index_201806}} contains another crawl of the API from June 2018.<br />
<br />
== GithubArchive ==<br />
<br />
The metadata generated by the GitHub API are archived to Google BigQuery every hour by [https://www.githubarchive.org/ GithubArchive]. <br />
<br />
It obviously doesn't grab events '''dating before 2011''', so a targeted repository scrape may still be ideal.<br />
<br />
But at least it could be possible to grab all info about a single repository using Google BigQuery's free version, since it would use a low amount of CPU power. However, we need to create such an export script for it when the time comes.<br />
<br />
== ArchiveTeam archival efforts ==<br />
In June 2018, a discovery warrior project was started based on the current list of repositories. The goal was to obtain the number of watchers, stars, forks, and the origin repository (for forks) for each repository – all information which is not returned by the [https://developer.github.com/v3/repos/#list-all-public-repositories repositories API endpoint] which was used to collect the list – so that a prioritisation of content according to those numbers would be possible. The origin repository is needed for storing forks efficiently: since the original repository and all its forks are usually mostly identical, this can be stored in a single repository instead of one clone per fork, thus storing the shared revisions only once.<br />
<br />
In December 2018, a list of around 2,000 GitHub repos linked from [[Wikidata lists|Wikidata]] were saved using [[ArchiveBot]].<br />
<br />
== The Github Archive Program ==<br />
On February 2, 2020, Github "captured a snapshot of every active public repository, to be preserved in the GitHub Arctic Code Vault". Read more at https://archiveprogram.github.com.<br />
== External links ==<br />
* {{url|1=https://github.com/|2=GitHub}}<br />
* {{url|1=https://archive.softwareheritage.org/|2=Software Heritage Archive}}<br />
* {{url|1=https://archiveprogram.github.com/|2=The GitHub Archive Program}}<br />
<br />
{{Navigation box}}</div>Joaquinito01https://wiki.archiveteam.org/index.php?title=Mixer&diff=44851Mixer2020-06-24T09:37:32Z<p>Joaquinito01: </p>
<hr />
<div>{{Infobox project<br />
| title = Mixer<br />
| description = Microsoft-owned video game streaming service.<br />
| URL = https://mixer.com/<br />
| project_status = {{Closing}} July 22, 2020<br />
| archiving_status = {{upcoming}}<br />
| irc = mixdown<br />
| irc_network = hackint<br />
}}<br />
Mixer was a Microsoft-owned service for video game streaming and had direct streaming capabilities from Xbox. It originated from the acquisition of Beam.<br />
<br />
Notably, much of the video data on the site was temporary in nature. For example, past streams (VoDs) are kept for 14 days for regular and pro users, 90 days for partners, 180 days for verified accounts, and longer for some official esports and events channels.<ref>https://watchbeam.zendesk.com/hc/en-us/articles/209662033-Past-Streams-VoDs-</ref> Clips expire after 14 days for regular, non-partner users, and for partners and verified channels, they expire after 90 days.<ref>https://watchbeam.zendesk.com/hc/en-us/articles/360005089311-Clips-FAQ-</ref><br />
<br />
== Shutdown ==<br />
On June 22, 2020, Microsoft announced that Mixer would be shutting down in one month, on July 22, 2020. After that date, the site will redirect to Facebook Gaming.<br />
<br />
Access to past VODs will be available until July 22, 2020. After that date, users will need to submit a written request to retrieve their data.<br />
<br />
[https://blog.mixer.com/2020/06/22/the-next-step-for-mixer/ Announcement blog post]<br />
[https://watchbeam.zendesk.com/hc/en-us/articles/360044847472 FAQ page]<br />
[https://www.facebook.com/fbgaminghome/blog/welcome-mixer-facebook-gaming Facebook announcement]<br />
<br />
== API ==<br />
Mixer has a [https://dev.mixer.com/guides/core/introduction well-documented public API].<br />
API points of interest include <code><nowiki>https://mixer.com/api/v1/channels?limit=100</nowiki></code> (online channels) and <code><nowiki>https://mixer.com/api/v1/recordings?limit=100</nowiki></code> (channels with saved broadcasts- note broadcasts are saved for a limited time for most channels anyway but this could be a good discovery mechanism).</div>Joaquinito01