Difference between revisions of "Alive... OR ARE THEY"
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
	
|  (add alternate sites for carrd) | JustAGrook (talk | contribs)  | ||
| (37 intermediate revisions by 23 users not shown) | |||
| Line 1: | Line 1: | ||
| Like many sites before them, these places indicate a sunny outlook, a clean bill of health and a total sense of "all systems go". But as we've found out from those many sites before them, fortunes can change overnight. | Like many sites before them, these places indicate a sunny outlook, a clean bill of health and a total sense of "all systems go". But as we've found out from those many sites before them, fortunes can change overnight.   | ||
| Archive Team considers these sites specifically of interest because they solicit so much content, contain so many works and projects by a wide group of people, or have the internet particularly dependent on them. Consider this a fire drill.. know what you can do to get your data off these sites and back them off for later. | Archive Team considers these sites specifically of interest because they solicit so much content, contain so many works and projects by a wide group of people, or have the internet particularly dependent on them. Consider this a fire drill.. know what you can do to get your data off these sites and back them off for later. | ||
| Line 18: | Line 18: | ||
| === Watchlist === | === Watchlist === | ||
| * '''[[9GAG]]''' ({{url|1=http://9gag.com/}}) has been seeing a dwindling popularity, and its owner seems to love spending a lot of money on investing in NFTs. | |||
| * '''[[Academic Earth]]''' ({{url|1=http://academicearth.org/}}) has been worryingly unloved for a while, and holds a mountain of free education that's invaluable to the world. | * '''[[Academic Earth]]''' ({{url|1=http://academicearth.org/}}) has been worryingly unloved for a while, and holds a mountain of free education that's invaluable to the world. | ||
| * '''[[A-Infos]]''' ({{url|1=http://ainfos.ca/}}) a multi-lingual news service by, for, and about anarchists. Have stuff archived since the 90's that isn't available anywhere else. | * '''[[A-Infos]]''' ({{url|1=http://ainfos.ca/}}) a multi-lingual news service by, for, and about anarchists. Have stuff archived since the 90's that isn't available anywhere else. | ||
| Line 25: | Line 25: | ||
| * '''[[AnimeMusicVideos.org]]''' ({{url|1=http://www.animemusicvideos.org/}}) is fine right now, but they rely on donations and host vast amounts of user-edited music videos on their server (presumably without mirrors). Hard to download as you have to be a member to get all the download links, and after downloading a handful you have to vode before you can d/l again (or you can donate which presumably gives you 1 year of free d/l access). Also, this site might be a grey area, copyright-wise, as the videos are all cut together from copyrighted material. | * '''[[AnimeMusicVideos.org]]''' ({{url|1=http://www.animemusicvideos.org/}}) is fine right now, but they rely on donations and host vast amounts of user-edited music videos on their server (presumably without mirrors). Hard to download as you have to be a member to get all the download links, and after downloading a handful you have to vode before you can d/l again (or you can donate which presumably gives you 1 year of free d/l access). Also, this site might be a grey area, copyright-wise, as the videos are all cut together from copyrighted material. | ||
| * '''[[Archive of Our Own]]''' ({{url|1=https://archiveofourown.org/}}) is stable but contains a large catalog of fanfiction that would likely be lost if the site were to shut down. | * '''[[Archive of Our Own]]''' ({{url|1=https://archiveofourown.org/}}) is stable but contains a large catalog of fanfiction that would likely be lost if the site were to shut down. | ||
| * '''[[Baseportal]]''' is a web database which, although currently existing, isn't in the best state, seeing as the site hasn't updated to HTTPS and the forums are overran with spam. It would be great for archiving seeing as thousands use it. (including the Philippine government) | |||
| * '''[[The Believer Magazine]]''' ({{url|1=https://www.thebeliever.net/}}) <s>was just purchased [https://twitter.com/ST_Collective_/status/1523756595317927936 by what appears to be a sex toy company with an interest in keeping the archives up] but they do not appear to be reliable or trustworthy custodians.</s> The site has been sold back to its previous/original publisher, McSweeney's, since 2022. The website was originally {{url|1=https://believermag.com/}} (which now redirects to culture.org), and was moved on May 16 (coinciding with the change in ownership); it seems that all previous publications up to March 2003 are archived on the current site. | |||
| * '''[[BetaArchive]]''' ({{url|http://www.betaarchive.com/}}) has Kafkaesque requirements to be able to access it, and apparently refuses to be backed up, presumably so that they get more visitors. Valuable cultural library of historic software with no backups? Aargh. | * '''[[BetaArchive]]''' ({{url|http://www.betaarchive.com/}}) has Kafkaesque requirements to be able to access it, and apparently refuses to be backed up, presumably so that they get more visitors. Valuable cultural library of historic software with no backups? Aargh. | ||
| * '''[[BioMedia Project]]''' ({{url|1=http://biomediaproject.com/bmp/}}) is a large archive of various BIONICLE media that has not had a notable update since 2015. | * '''[[BioMedia Project]]''' ({{url|1=http://biomediaproject.com/bmp/}}) is a large archive of various BIONICLE media that has not had a notable update since 2015. | ||
| * '''[[Carrd]]''' ({{url|1=https://carrd.co}}, {{url|1=https://crd.co}}, {{url|1=https://ju.mp}}, and {{url|1=https://uwu.ai}}) is a free web host, mainly used by minorities (the LGBTQ+ community and people with mental illnesses, for example) and Twitter/Tumblr communities. Also used for activism (one of the most popular Carrd websites is about Black Lives Matter, for example). Pretty stable as of now, but could become a good historical resource in the future. | * '''[[Carrd]]''' ({{url|1=https://carrd.co}}, {{url|1=https://crd.co}}, {{url|1=https://ju.mp}}, and {{url|1=https://uwu.ai}}) is a free web host, mainly used by minorities (the LGBTQ+ community and people with mental illnesses, for example) and Twitter/Tumblr communities. Also used for activism (one of the most popular Carrd websites is about Black Lives Matter, for example). Pretty stable as of now, but could become a good historical resource in the future. | ||
| * '''[[Catbox]]''' ({{url|1=https://catbox.moe/}}) is a file host whose main source of funding ([[Patreon]]) {{url|1=https://blog.catbox.moe/post/785233399498555392/important-catbox-needs-your-help|2=fucked them over}}. | |||
| * '''[[Codecademy]]''' ({{url|1=http://www.codecademy.com/}}) has a large amount of valuable coding lessons. | * '''[[Codecademy]]''' ({{url|1=http://www.codecademy.com/}}) has a large amount of valuable coding lessons. | ||
| * '''[[DatasheetArchive]]''' ({{url|1=http://www.datasheetarchive.com/}}) hosts over 350 million PDF datasheets for integrated circuits, some of which are very old and hard to track down otherwise. The site is slow from time to time and uses a convoluted IFRAME-based online viewer, presumably to make scraping the site harder. Nevertheless, multiple other similar sites exist, with large parts of their PDFs non-overlapping, so that at some point, all should be saved. Similar sites include http://doc.chipfind.<!-- -->ru/ (1.6m datasheets), http://www.alldatasheet.com/ (20m datasheets), http://www.datasheets.com/ (250m datasheets), http://www.datasheetcatalog.com/, http://freedatasheets.com/ and several others | * '''[[DatasheetArchive]]''' ({{url|1=http://www.datasheetarchive.com/}}) hosts over 350 million PDF datasheets for integrated circuits, some of which are very old and hard to track down otherwise. The site is slow from time to time and uses a convoluted IFRAME-based online viewer, presumably to make scraping the site harder. Nevertheless, multiple other similar sites exist, with large parts of their PDFs non-overlapping, so that at some point, all should be saved. Similar sites include http://doc.chipfind.<!-- -->ru/ (1.6m datasheets), http://www.alldatasheet.com/ (20m datasheets), http://www.datasheets.com/ (250m datasheets), http://www.datasheetcatalog.com/, http://freedatasheets.com/ and several others | ||
| * '''[[Delicious]]''' ({{url|1=http://www.delicious.com/}}) loves to change their API, which has a side effect of making it difficult to back up. | * '''[[Delicious]]''' ({{url|1=http://www.delicious.com/}}) loves to change their API, which has a side effect of making it difficult to back up. | ||
| * '''[[Encyclopedia Dramatica]]''' ({{url|1=edramatica.com}}) is frequently up, down, and changing domains due to general Internet drama. | |||
| * '''[[Facebook]]''' ({{url|1=http://www.facebook.com/}}) seems stable at the moment. | * '''[[Facebook]]''' ({{url|1=http://www.facebook.com/}}) seems stable at the moment. | ||
| * '''[[Fandom]]''' ({{url|1=http://www.fandom.com/}}), the for-pay arm of Wikipedia (just kidding, it's a different company, but shares a lot of people) is a repository of directed, unsubject-to-wikipolitics wikis, many of them intense and completist. It'd be bad for them to go away. | |||
| * '''[[FanFiction]]''' ({{url|1=http://www.fanfiction.net/}}) represents many thousands of user-generated stories, essays and huge amounts of work. | * '''[[FanFiction]]''' ({{url|1=http://www.fanfiction.net/}}) represents many thousands of user-generated stories, essays and huge amounts of work. | ||
| * '''[[Forrst]]''' ({{url|1=http://zurb.com/forrst}}) was shut down on April 14th, 2014, but all posts were archived by Forrst. | * '''[[Forrst]]''' ({{url|1=http://zurb.com/forrst}}) was shut down on April 14th, 2014, but all posts were archived by Forrst. | ||
| Line 39: | Line 43: | ||
| * '''[[Google]]''' ({{url|1=http://www.google.com/}}) wants you to think they will be here forever. | * '''[[Google]]''' ({{url|1=http://www.google.com/}}) wants you to think they will be here forever. | ||
| * '''[[h2g2]]''' ({{url|1=https://www.h2g2.com}}) was (among?) the first [https://h2g2.com/edited_entry/A550955 online, collaborative encyclopaedia(s)]. | * '''[[h2g2]]''' ({{url|1=https://www.h2g2.com}}) was (among?) the first [https://h2g2.com/edited_entry/A550955 online, collaborative encyclopaedia(s)]. | ||
| * '''[[GBAtemp]]''' ({{url|1=https://GBAtemp.net}}) A popular forums site in the console homebrewing community. Most of GBAtemp is accessible without an account apart from user profiles containing profile posts. It doesn't appear to be in any danger, having been around for 20 years. Also has a Wiki ({{url|1=https://wiki.GBAtemp.net}}) which mentions compatibility about different backup loaders, site history, modchips, and so on. | |||
| * '''[http://identifyyourbreyer.com/ Identify Your Breyer]''' - As of August 31st, 2022, "three curators" have come in to save the website. They're currently working on moving site hosts, but as of September 11th, 2022, they needed to push it back "a few weeks". While the fate of the site is still somewhat uncertain, it's no longer in danger of expiring within the next few months. | |||
| * '''[[IFTTT]]''' ({{url|1=http://ifttt.com/}}) is still growing. | * '''[[IFTTT]]''' ({{url|1=http://ifttt.com/}}) is still growing. | ||
| * '''[[Internet Archive]]''' ({{url|1=http://www.archive.org/}}) seems stable at the moment, but its [https://archive.org/~tracey/mrtg/du.html 45 petabytes] of data (as of October 2018) aren't mirrored anywhere else. The code for their system isn't open source, and generally they're a single point of failure for a large amount of the web's history. Why should there be only 1 internet archive? | * '''[[Internet Archive]]''' ({{url|1=http://www.archive.org/}}) seems stable at the moment, but its [https://archive.org/~tracey/mrtg/du.html 45 petabytes] of data (as of October 2018) aren't mirrored anywhere else. The code for their system isn't open source, and generally they're a single point of failure for a large amount of the web's history. Why should there be only 1 internet archive? | ||
| Line 46: | Line 52: | ||
| * '''[[JanusVR]]''' is a company that aims to re-imagine the web as "rooms" interconnected with portals. These "rooms" can be hosted anywhere. Though the company and its technology are still fairly new, there are already cases of rooms with missing assets or even entire links broken, and [[ArchiveBot]]'s ability to grab these rooms is somewhat limited. | * '''[[JanusVR]]''' is a company that aims to re-imagine the web as "rooms" interconnected with portals. These "rooms" can be hosted anywhere. Though the company and its technology are still fairly new, there are already cases of rooms with missing assets or even entire links broken, and [[ArchiveBot]]'s ability to grab these rooms is somewhat limited. | ||
| *'''[[JSFiddle]]''' ({{url|1=http://jsfiddle.net/}}) is referenced in many StackOverflow answers, as well as other forums, etc. It shows no signs of going away, but should we archive it just in case? | *'''[[JSFiddle]]''' ({{url|1=http://jsfiddle.net/}}) is referenced in many StackOverflow answers, as well as other forums, etc. It shows no signs of going away, but should we archive it just in case? | ||
| *'''[[Know Your Meme]]''' ({{url|1=http://knowyourmeme.com/}}) is at this point the de facto central repository for information on internet memes and culture. It is as popular as ever at the moment, but even with this popularity, former owners Rocketboom had trouble financing it. In the spring of 2011 was sold to Cheezburger Networks, a site which has been known to "reorganize" its properties, sometimes with a detrimental effect on content. Though it was quite a different story, I might remind people what happened to [[Encyclopedia Dramatica]]. | *'''[[Know Your Meme]]''' ({{url|1=http://knowyourmeme.com/}}) is at this point the de facto central repository for information on internet memes and culture. It is as popular as ever at the moment, but even with this popularity, former owners Rocketboom had trouble financing it. In the spring of 2011 was sold to Cheezburger Networks, a site which has been known to "reorganize" its properties, sometimes with a detrimental effect on content. Though it was quite a different story, I might remind people what happened to [[Encyclopedia Dramatica]]. | ||
| * '''[[Last.fm]]''' ({{url|1=http://www.last.fm/}}) is being cloned by free software developers in the form of [http://libre.fm Libre.fm] -- they have a tool, [http://svn.savannah.gnu.org/viewvc/*checkout*/trunk/lastscrape/lastscrape.py?root=librefm Lastscrape] which can get all your listening data out into a tab delimited text file. | * '''[[Last.fm]]''' ({{url|1=http://www.last.fm/}}) is being cloned by free software developers in the form of [http://libre.fm Libre.fm] -- they have a tool, [http://svn.savannah.gnu.org/viewvc/*checkout*/trunk/lastscrape/lastscrape.py?root=librefm Lastscrape] which can get all your listening data out into a tab delimited text file. | ||
| Line 61: | Line 66: | ||
| * '''[[Pixiv]]''' ({{url|1=http://www.pixiv.net/}}) and '''[[deviantArt]]''' ({{url|1=http://www.deviantart.com/}}) are the largest Japanese and American (respectively) fanart (and valuable art in general) collections on the internet. Most of works are possibly included in [https://saucenao.com/status.html many boorus]. Someone also mentioned [http://www.minitokyo.net/ Minitokyo] around here somewhere. [[Internet Archive|IA]] has a 2008 [https://archive.org/details/mt-walls-2008-01-01 wallpaper dump] from there. | * '''[[Pixiv]]''' ({{url|1=http://www.pixiv.net/}}) and '''[[deviantArt]]''' ({{url|1=http://www.deviantart.com/}}) are the largest Japanese and American (respectively) fanart (and valuable art in general) collections on the internet. Most of works are possibly included in [https://saucenao.com/status.html many boorus]. Someone also mentioned [http://www.minitokyo.net/ Minitokyo] around here somewhere. [[Internet Archive|IA]] has a 2008 [https://archive.org/details/mt-walls-2008-01-01 wallpaper dump] from there. | ||
| * '''[[Pouet]]''' ({{url|1=http://www.pouet.net/}}) is an important site of the demoscene. It indexes and ranks demoscene productions ('prods') and also includes a free-for-all BBS-style forum. [https://demozoo.org/ Demozoo] is a site in the same vein with a slightly different focus. They both use the same ISP iirc so if that goes down a lot of user created content is lost. | * '''[[Pouet]]''' ({{url|1=http://www.pouet.net/}}) is an important site of the demoscene. It indexes and ranks demoscene productions ('prods') and also includes a free-for-all BBS-style forum. [https://demozoo.org/ Demozoo] is a site in the same vein with a slightly different focus. They both use the same ISP iirc so if that goes down a lot of user created content is lost. | ||
| * '''[[Reddit]]''' ({{url|1=http://www.reddit.com/}}) is a content aggregator where [https://en.wikipedia.org/wiki/Digg#Issues_relating_to_former_Digg_website many Digg users migrated in 2010]. Attracted controversy in July 2015, with accusations of [http://knowyourmeme.com/memes/events/amageddon censorship] and [https://en.wikipedia.org/wiki/Stealth_banning shadowbanning]. Stable for now, but team is small. | * '''[[Reddit]]''' ({{url|1=http://www.reddit.com/}}) is a content aggregator where [https://en.wikipedia.org/wiki/Digg#Issues_relating_to_former_Digg_website many Digg users migrated in 2010]. Attracted controversy in July 2015, with accusations of [http://knowyourmeme.com/memes/events/amageddon censorship] and [https://en.wikipedia.org/wiki/Stealth_banning shadowbanning]. Many controversial subreddits (with up to hundreds of thousands subscribers each) are "quarantined"; many of these have subsequently been deleted. Stable for now, but team is small. | ||
| * '''[[SourceForge]]''' ({{url|1=http://www.sourceforge.net/}}) is a critical repository of open source code, information, and webpages. It is mirrored and maintained, but there are sure to be parts that are neither. | * '''[[SourceForge]]''' ({{url|1=http://www.sourceforge.net/}}) is a critical repository of open source code, information, and webpages. It is mirrored and maintained, but there are sure to be parts that are neither. | ||
| * '''[[The Pirate Bay]]''' ({{url|1=http://www.thepiratebay.org/}}) is one of the largest and most popular torrent search engines. It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB ''all over this wiki'', this site is pretty dang important. After they were raided in December 2014, a project known as [http://openbay.isohunt.to/ The Open Bay] was launched, which lets anybody host a mirror of TPB with automatic database updates, so even if TPB goes down again, temporarily or not, its database is still available. | * '''[[The Pirate Bay]]''' ({{url|1=http://www.thepiratebay.org/}}) is one of the largest and most popular torrent search engines. It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB ''all over this wiki'', this site is pretty dang important. After they were raided in December 2014, a project known as [http://openbay.isohunt.to/ The Open Bay] was launched, which lets anybody host a mirror of TPB with automatic database updates, so even if TPB goes down again, temporarily or not, its database is still available. | ||
| *'''Supercard''' {{url|1=http://supercard.sc/}}) is the website of a manufacturer of Nintendo DS flashcarts (notably the DSTWO). The website is fine but it has not been updated in years and some section of it are broken. Archiving the [http://down.supercard.sc/download/ download section] should be a priority as some flashcarts rely on software hosted there. | |||
| * '''[[Tribe]]''' ({{url|1=http://tribe.net}}) hosts large amount of user-generated data, and has been having consistent uptime issues. | * '''[[Tribe]]''' ({{url|1=http://tribe.net}}) hosts large amount of user-generated data, and has been having consistent uptime issues. | ||
| * '''[[Tumblr]]''' ({{url|1=http://tumblr.com}}) is a highly popular blogging platform which was bought by [[Yahoo]]! in May, 2013.   | * '''[[Tumblr]]''' ({{url|1=http://tumblr.com}}) is a highly popular blogging platform which was bought by [[Yahoo]]! in May, 2013.   | ||
| * '''[[TVTropes]]''' ({{url|1=http://www.tvtropes.org/pmwiki/pmwiki.php/Main/HomePage}}) is a popular wiki dedicated to finding recurring patterns in fiction, and discussing fiction in general. No word on whether there are backups. The administrators have a tendency to delete things indiscriminately, usually to save on disk space: article edit histories are frequently purged, and old forum threads have been known to get deleted mercilessly. A [http://archive.org/details/archiveteam-tvtropes-2012-09 backup] and [https://allthetropes.org/wiki/Main_Page an alternate website] with imported content from July 2012 are available.   | * '''[[TVTropes]]''' ({{url|1=http://www.tvtropes.org/pmwiki/pmwiki.php/Main/HomePage}}) is a popular wiki dedicated to finding recurring patterns in fiction, and discussing fiction in general. No word on whether there are backups. The administrators have a tendency to delete things indiscriminately, usually to save on disk space: article edit histories are frequently purged, and old forum threads have been known to get deleted mercilessly. A [http://archive.org/details/archiveteam-tvtropes-2012-09 backup] and [https://allthetropes.org/wiki/Main_Page an alternate website] with imported content from July 2012 are available.   | ||
| * '''[[Urban Dictionary]]''' ({{URL|https://www.urbandictionary.com/}}) is "a crowdsourced online dictionary for slang words and phrases"<ref>{{URL|https://en.wikipedia.org/w/index.php?title=Urban_Dictionary&oldid=926230888|Wikipedia}}</ref>. It's financed by ads and a web store, and there are no signs of serious trouble as of late 2019. | * '''[[Urban Dictionary]]''' ({{URL|https://www.urbandictionary.com/}}) is "a crowdsourced online dictionary for slang words and phrases"<ref>{{URL|https://en.wikipedia.org/w/index.php?title=Urban_Dictionary&oldid=926230888|Wikipedia}}</ref>. It's financed by ads and a web store, and there are no signs of serious trouble as of late 2019. | ||
| * '''[[WebCite]]''' ({{url|1=http://www.webcitation.org/}}) itself seems to be having trouble with funding, and is facing "possible discontinuation." As this site serves as a stable reference for fleeting Web references, it would be pretty disastrous if it went away. | * '''[[WebCite]]''' ({{url|1=http://www.webcitation.org/}}) itself seems to be having trouble with funding, and is facing "possible discontinuation." As this site serves as a stable reference for fleeting Web references, it would be pretty disastrous if it went away. | ||
| * '''[[whitehouse.gov]]''' ({{url|1=http://www.whitehouse.gov/}}) is overhauled every time a new US president assumes office and changes continuously during the term. Old versions are preserved thanks to the [http://kitenet.net/~joey/blog/entry/ephemera_vs_the_law/ Presidential Records Act] (e.g. [http://georgewbush-whitehouse.archives.gov/ George W. Bush's]), but we also want to watch out for site changes / disappeared pages that were embarrassing or whatnot. | * '''[[whitehouse.gov]]''' ({{url|1=http://www.whitehouse.gov/}}) is overhauled every time a new US president assumes office and changes continuously during the term. Old versions are preserved thanks to the [http://kitenet.net/~joey/blog/entry/ephemera_vs_the_law/ Presidential Records Act] (e.g. [http://georgewbush-whitehouse.archives.gov/ George W. Bush's]), but we also want to watch out for site changes / disappeared pages that were embarrassing or whatnot. | ||
| * '''[[WikiLeaks]]''' ({{url|1=http://wikileaks.org/}}) contains several thousand leaked documents from sources such as the Iraq War and the cables famously known under the label 'Cablegate'. Due to the content on the website, and that PayPal and Amazon (very) quickly dropped their hosting for them during Cablegate's opening days, it should be considered a potential target for any number of government committees for a quick shutdown. They have an uncertain financial situation, and the site was inaccessible for some time in 2010. | * '''[[WikiLeaks]]''' ({{url|1=http://wikileaks.org/}}) contains several thousand leaked documents from sources such as the Iraq War and the cables famously known under the label 'Cablegate'. Due to the content on the website, and that PayPal and Amazon (very) quickly dropped their hosting for them during Cablegate's opening days, it should be considered a potential target for any number of government committees for a quick shutdown. They have an uncertain financial situation, and the site was inaccessible for some time in 2010. | ||
| * '''[[Wikidot]]''' ({{url|1=https://www.wikidot.com/}}) has not had development work for years, there is no official presence on their [http://community.wikidot.com/forum:recent-posts support forum], just many complaints about the difficulty contacted staff, even from those with paid accounts. Backup and wiki conversion tools are poor, services like search and anti-spam bots have problems for some time, and the site has been [https://bitcointalk.org/index.php?topic=5228862.0 up for sale] since February 2020. | |||
| ** [[SCP Foundation]] is hosted on Wikidot. They plan on moving off of the platform and there are multiple archives, but there are still over 10 years' worth of creative writing that could be lost forever if Wikidot were to shut down. | |||
| * '''[[Wikipedia]]''' ({{url|1=http://www.wikipedia.org/}}) will surely be here forever and ever! Fortunately, we don't have to take their word for it as they offer dumps of the data minus the photos. However, no-one has verified that Wikipedia can actually be restored from these dumps. If disaster strikes then we could discover a serious problem. | * '''[[Wikipedia]]''' ({{url|1=http://www.wikipedia.org/}}) will surely be here forever and ever! Fortunately, we don't have to take their word for it as they offer dumps of the data minus the photos. However, no-one has verified that Wikipedia can actually be restored from these dumps. If disaster strikes then we could discover a serious problem. | ||
| * '''[[Writing]]''' ({{url|1=http://Writing.com/}}) A website for writing that was big in the 2000's. More and more restricted to guests and free users, as they are in need of money to keep the site running. Less and less popular for that reason | * '''[[Writing]]''' ({{url|1=http://Writing.com/}}) A website for writing that was big in the 2000's. More and more restricted to guests and free users, as they are in need of money to keep the site running. Less and less popular for that reason | ||
| * [[YTMND]] - Resurrected on March 31st, 2020 with an archive being made in June 2018. Still on the Watchlist because of its past, but mostly safe for the time being. | |||
| == Endangered == | == Endangered == | ||
| Line 80: | Line 87: | ||
| Did someone leave the oven on? | Did someone leave the oven on? | ||
| * '''[ | * '''[kirbysrainbowresort.net Kirby's Rainbow Resort]''' is a Kirby fan site that has a significant archive of old fan works and official media that's difficult to find elsewhere. Around the beginning of 2023, the website's forums, Oekaki imageboard, news updates, and other parts of the site have unceremoniously disappeared. The rest of the site hasn't been updated since 2020. | ||
| * '''[[Kiwi Farms]]''' ({{url|http://kiwifarms.net/}}, {{url|http://kiwifar.ms/|former alternate .ms domain}}) is a notorious forum where users mock [http://www.urbandictionary.com/define.php?term=lolcow "lolcows"], or people who have attracted ridicule because of their behavior or their beliefs. On January 20, 2017, the forum was shut down without a warning by its owner, but was restored three weeks later. As of September 2022, the website has been excluded from the Wayback Machine and the owner is struggling to find a new host due to a [https://www.msn.com/en-gb/news/us/citing-imminent-danger-cloudflare-drops-hate-site-kiwi-farms/ar-AA11t2U9 dispute with Cloudflare]. | |||
| * '''[http://www.ning.com/ Ning]''' in 2010 has laid off 40% of staff and seems to be running out of money [http://techcrunch.com/2010/04/15/nings-bubble-bursts-no-more-free-networks-cuts-40-of-staff/]. There is certainly some networks worth archiving among the 2 million networks[http://blog.ning.com/2010/01/2-million-ning-networks.html] they host. Grouply[http://blog.grouply.com/grouply-welcomes-ning-networks/] and Posterous[http://blog.posterous.com/posterous-commits-to-building-a-ning-blog-imp] say they are going to offer migration tools. | * '''[http://www.ning.com/ Ning]''' in 2010 has laid off 40% of staff and seems to be running out of money [http://techcrunch.com/2010/04/15/nings-bubble-bursts-no-more-free-networks-cuts-40-of-staff/]. There is certainly some networks worth archiving among the 2 million networks[http://blog.ning.com/2010/01/2-million-ning-networks.html] they host. Grouply[http://blog.grouply.com/grouply-welcomes-ning-networks/] and Posterous[http://blog.posterous.com/posterous-commits-to-building-a-ning-blog-imp] say they are going to offer migration tools. | ||
| * As of 2014, ScraperWiki Classic is now read-only.  But don’t worry! You can transfer this scraper to Morph.io if you want to continue editing it. | |||
| * {{As of|2014}}, ScraperWiki Classic is now read-only.  But don’t worry! You can transfer this scraper to Morph.io if you want to continue editing it. | |||
| * '''[http://debates.oireachtas.ie/ debates.oireachtas.ie]''' on September 18th, 2012 the Houses of Oireachtas website [http://www.kildarestreet.com/statement2012/ announced] that it would no longer be updating its XML data for Irish parliamentary debates (1919-2012). Access to pre-existing data is still available, but is likely to disappear, if the current trend continues. It would be useful to at least capture the XML data that is there, while it is still available. Here's a [https://archive.org/details/debatesoireachtasie-XML WARC archive] of the XML only. | * '''[http://debates.oireachtas.ie/ debates.oireachtas.ie]''' on September 18th, 2012 the Houses of Oireachtas website [http://www.kildarestreet.com/statement2012/ announced] that it would no longer be updating its XML data for Irish parliamentary debates (1919-2012). Access to pre-existing data is still available, but is likely to disappear, if the current trend continues. It would be useful to at least capture the XML data that is there, while it is still available. Here's a [https://archive.org/details/debatesoireachtasie-XML WARC archive] of the XML only. | ||
| * '''[[ | * '''[http://www.groklaw.net/article.php?story=20130818120421175 Groklaw]''' will no longer be posting new articles, "due to government monitoring of the internet, particularly e-mail." Whether or not its archives will remain online is unclear, although it does seem rather unlikely it will 100% disappear. OTOH, better safe than sorry. Still up as of November 28th, 2023. | ||
| * '''[http://www.worldofspectrum.org/ World of Spectrum]''''s current administrator announced that [http://www.worldofspectrum.org/forums/discussion/52892/had-enough he's ceasing to support the website and forum within 8 weeks] (as of July 3). The future of the website is uncertain. Still up as of October 11th, 2022. | |||
| * The '''Centralstation Community''' [http://community.thisiscentralstation.com/_Central-Station-v2-Q38As/blog/5449967/126249.html has closed]. The site is a UK-based social network for artists and creatives that provides hosting for content and portfolio. Users are being advised to back up their work as the new version of their platform will rely on existing media hosting sites like Flickr, Vimeo, and Soundcloud. Still up as of October 11th, 2022. | |||
| * Most of the paid staff at '''The Escapist''' [http://www.escapistmagazine.com/news/view/171005-Open-Letter-to-The-Escapist-Community has been "relieved of their duties"] as of October 20, 2017, and the future and longevity of the site is uncertain; it's currently run mostly through volunteer efforts. | |||
| * [[Yelp, Inc.]] lost 30% of its advertisers and people don't seem too happy about it. | |||
| * '''[[Cheezburger]]''' {{url|https://www.cheezburger.com/}}, once a “network” of meme blogs but now one single centralized site, has barely been maintained by its parent company (Cracked.com) for years, with the account creation system being broken since at least 2019. A lot of old meme images (of the Impact-font/rage-face/advice-animal variety) can be found here, dating back to 2007, and it would be a loss to the unique culture of the Internet if all these were to disappear. A lot of its contemporaries (such as [[Lolcats.com]]) have been lost to domain expiries and the ensuing cyber-squatters. There are still some dedicated users posting brand-new meme material here, but some people held on to GeoCities till the bitter end as well. | |||
| * '''[http://www. | * '''[[Twitter]]''' ({{url|1=http://www.twitter.com/}}) is tweaking away, with a dire financial situation and [http://www.breitbart.com/tech/2016/02/06/twitter-in-meltdown-as-entire-userbase-revolts/ controversial decisions]. [https://www.theverge.com/2022/11/10/23452196/elon-musk-twitter-employee-meeting-q-and-a Now faces potential bankruptcy after being purchased by Elon Musk]. | ||
| * '''[ | * '''[https://ja.osdn.net/ OSDN]''' is a hosting site for Japan, containing repositories of open-source code and web pages. However, since its acquisition by OSCHINA in 2022, the site has become very unstable, with frequent 504 errors and other issues. | ||
| *  | * '''[https://5ch.net 5ch.net]''' 5ch (formerly 2ch), an online forum in operation since May 30, 1999, has been experiencing ongoing instability due to issues such as script-based trolling . Archived posts dating back to 1999 are currently inaccessible, with the administrators citing a physical server failure as the cause. There is no estimated timeframe for their restoration.[https://5ch.net/kakolog.html List of archive servers] | ||
| * '''[ | * '''[https://blog.seesaa.jp blog.seesaa.jp]''' Seesaa Inc., the company operating Seesaa Blog was established in 2003 as a blog service provider. It became a subsidiary of Fan Communications in 2017 and was merged into the parent company in 2024. The company, which was in a poor financial state and operating at a loss just before the merger, also ran services like Seesaa Wiki and SS Blog (a business inherited from another company). SS Blog is scheduled to terminate service on March 31, 2025[https://blog-wn.blog.ss-blog.jp/2024-11-15 Notice of Termination of SS Blog Service] | ||
| *  | * '''[https://www.zakzak.co.jp zakzak]''' ZakZak, an online news site operated by Sankei Newspaper, announced that it will cease to be updated on 2024/01/31, with no mention of  continued website existence.[https://www.zakzak.co.jp/article/20241001-JR6E6JBXP5CZDHW2QM3IXXTRTA/ Notice of Suspension of Newspaper] | ||
| *  | * '''[https://aminoapps.com/ Amino]''' is a fandom based community that hosts millions of pieces of art, literature, and wikis. Several fired site mods have sounded alarms on how the site is on its last legs. | ||
| *  | * '''Old websites made by Nintendo Korea''' old websites made by the company, such as https://www.nintendocaution.co.kr/, are in risk of getting deleted if the company decides to delete them. | ||
| == Alarm == | == Alarm == | ||
| Line 110: | Line 130: | ||
| --> | --> | ||
| I smell smoke. | I smell smoke. | ||
| * [[Calorie Restriction Society]] [http://www.crsociety.org]. an InvisionPowerBoard forum. See https://www.crsociety.org/topic/18710-crsocietyorg-finally-got-back-online-after-4-months/?do=getNewComment for details. Site went down for 4 months then went back up in October 2024. We still don't know if it may go down again, given that the administrators have gone out of contact and the original person paying for crsociety.org died. | |||
| * [[Mozilla Hubs]] [https://hubs.mozilla.com/labs/mozilla-hubs-early-access-release/ Blog mozilla hubs early access release]. Mozilla Hubs is changing their payment structure from open source to a subscription service. They will be deleting open source content. | |||
| * [[The Correspondent]], From 30 Sep 19 to 1 Jan 21 [https://thecorrespondent.com The Correspondent] published member-funded journalism about the forces that shape our world. The organization has shut down[https://thecorrespondent.com/834] and has put up a read-only archive of all publications that were previously only accessible to paying members. As keeping such an archive running will probably cost someone money it might not stay online for many more years. | |||
| * YoyoGames, developer of the GameMaker application, is planning to retire the old "'''[[GameMaker Sandbox]]'''" game hosting website in favor of the "GameMaker: Player" service, by late October.<ref>http://gamemakerblog.com/2014/10/04/its-official-digital-store-will-replace-gamemaker-sandbox/</ref> [http://help.yoyogames.com/entries/101815476-GameMaker-Player-FAQs "Sandbox content will remain available for a period of time until the GameMaker: Player is fully live."] | * YoyoGames, developer of the GameMaker application, is planning to retire the old "'''[[GameMaker Sandbox]]'''" game hosting website in favor of the "GameMaker: Player" service, by late October.<ref>http://gamemakerblog.com/2014/10/04/its-official-digital-store-will-replace-gamemaker-sandbox/</ref> [http://help.yoyogames.com/entries/101815476-GameMaker-Player-FAQs "Sandbox content will remain available for a period of time until the GameMaker: Player is fully live."] | ||
| Line 117: | Line 143: | ||
| * [[Yahoo!]] [http://www.yqlblog.net/blog/2013/11/11/y-ahoo-it-url-shortener-end-of-life-announcement/ retired] the y.ahoo.it [[URLTeam|URL shortener]] November 20th 2013 but the shortener is still active. | * [[Yahoo!]] [http://www.yqlblog.net/blog/2013/11/11/y-ahoo-it-url-shortener-end-of-life-announcement/ retired] the y.ahoo.it [[URLTeam|URL shortener]] November 20th 2013 but the shortener is still active. | ||
| * CtoSims Has been infected with a Javascript that redirects to different pages, and the owner seems to have not been active for a long time. The entire site now redirects to a placeholder page that says "Coming Soon!". As of December  | * CtoSims Has been infected with a Javascript that redirects to different pages, and the owner seems to have not been active for a long time. The entire site now redirects to a placeholder page that says "Coming Soon!". {{As of|2016|December}}, All Downloads have been taken down as well. | ||
| * [[Giphy]]: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy | |||
| * [[ | * '''[[Dayviews]]''' ({{url|1=http://dayviews.com/}}), the Swedish photo diary community contains quite a bit of 00s Swedish "youth culture". It's had "technical problems" for months now and has said that "many old photos were lost". | ||
| * [[ | * [[Surrender at 20]], a popular [[League of Legends]] news and teardown site, has suddenly ceased activity on November 14th, 2022. Site owner moobeat posted a tweet [https://twitter.com/moobeat/status/1592230197729837059 saying "life sucks major shit right now"] and alluded to some personal woes he's been going through. moobeat's wife, Aznbeat, posted a comment on November 25th on the site's latest post [https://www.surrenderat20.net/2022/11/winter-skin-splash-preview.html#comment-6050163346 stating that while she helped with running the site to pay their bills after moobeat was hospitalized for a while, she otherwise "had no interest in League, nor will I"] and that "unless by some miracle he ever decides to take some responsibility for it, then don't expect any more coverage from us". | ||
| * | * {{url|https://www.tinaja.com/}} Owner is deceased. | ||
| == See Also == | == See Also == | ||
Latest revision as of 14:50, 24 July 2025
Like many sites before them, these places indicate a sunny outlook, a clean bill of health and a total sense of "all systems go". But as we've found out from those many sites before them, fortunes can change overnight.
Archive Team considers these sites specifically of interest because they solicit so much content, contain so many works and projects by a wide group of people, or have the internet particularly dependent on them. Consider this a fire drill.. know what you can do to get your data off these sites and back them off for later.
Still Alive
Owned by Yahoo! Imminent Demise!
- Yahoo and AOL properties are being sold by Verizon to hedge fund Apollo in 2021. The news comes immediately after the closure of Yahoo! Answers and will presumably be swiftly followed by radical disruption for the sake of short-term cash milking.[1]
- Flickr contains billions of files, hundreds millions of which are under a Creative Commons license or stored there by many museums and other cultural institutions. The site was tumblr-ised in 2013 and has been poorly functional ever since; pro users were removed, so it doesn't yet have a business model. Additionally, it's owned by Yahoo!, need to say more?!- Flickr was sold to SmugMug in 2018, purged the biggest non-paying and non-freely licensed accounts and switched to a more predictable subscription-based model.
 
Watchlist
- 9GAG (http://9gag.com/[IA•Wcite•.today•MemWeb]) has been seeing a dwindling popularity, and its owner seems to love spending a lot of money on investing in NFTs.
- Academic Earth (http://academicearth.org/[IA•Wcite•.today•MemWeb]) has been worryingly unloved for a while, and holds a mountain of free education that's invaluable to the world.
- A-Infos (http://ainfos.ca/[IA•Wcite•.today•MemWeb]) a multi-lingual news service by, for, and about anarchists. Have stuff archived since the 90's that isn't available anywhere else.
- Encyclopedia Astronautica (http://www.astronautix.com/[IA•Wcite•.today•MemWeb]) is the most comprehensive collection of the history of space travel. Period. Seriously, the official NASA history folks will refer you this website if they can't answer your questions. However, Mark Wade (the sole creator/maintainer) abandoned his blog at the end of 2007, and the Encyclopedia has not been updated since May 2008, despite much happening in the space exploration world since then. A backup was made of the site as of 28/01/2017.
- Angelfire has been in constant decline for many years now.
- AnimeMusicVideos.org (http://www.animemusicvideos.org/[IA•Wcite•.today•MemWeb]) is fine right now, but they rely on donations and host vast amounts of user-edited music videos on their server (presumably without mirrors). Hard to download as you have to be a member to get all the download links, and after downloading a handful you have to vode before you can d/l again (or you can donate which presumably gives you 1 year of free d/l access). Also, this site might be a grey area, copyright-wise, as the videos are all cut together from copyrighted material.
- Archive of Our Own (https://archiveofourown.org/[IA•Wcite•.today•MemWeb]) is stable but contains a large catalog of fanfiction that would likely be lost if the site were to shut down.
- Baseportal is a web database which, although currently existing, isn't in the best state, seeing as the site hasn't updated to HTTPS and the forums are overran with spam. It would be great for archiving seeing as thousands use it. (including the Philippine government)
- The Believer Magazine (https://www.thebeliever.net/[IA•Wcite•.today•MemWeb]) was just purchased by what appears to be a sex toy company with an interest in keeping the archives up but they do not appear to be reliable or trustworthy custodians.The site has been sold back to its previous/original publisher, McSweeney's, since 2022. The website was originally https://believermag.com/[IA•Wcite•.today•MemWeb] (which now redirects to culture.org), and was moved on May 16 (coinciding with the change in ownership); it seems that all previous publications up to March 2003 are archived on the current site.
- BetaArchive (http://www.betaarchive.com/[IA•Wcite•.today•MemWeb]) has Kafkaesque requirements to be able to access it, and apparently refuses to be backed up, presumably so that they get more visitors. Valuable cultural library of historic software with no backups? Aargh.
- BioMedia Project (http://biomediaproject.com/bmp/[IA•Wcite•.today•MemWeb]) is a large archive of various BIONICLE media that has not had a notable update since 2015.
- Carrd (https://carrd.co[IA•Wcite•.today•MemWeb], https://crd.co[IA•Wcite•.today•MemWeb], https://ju.mp[IA•Wcite•.today•MemWeb], and https://uwu.ai[IA•Wcite•.today•MemWeb]) is a free web host, mainly used by minorities (the LGBTQ+ community and people with mental illnesses, for example) and Twitter/Tumblr communities. Also used for activism (one of the most popular Carrd websites is about Black Lives Matter, for example). Pretty stable as of now, but could become a good historical resource in the future.
- Catbox (https://catbox.moe/[IA•Wcite•.today•MemWeb]) is a file host whose main source of funding (Patreon) fucked them over[IA•Wcite•.today•MemWeb].
- Codecademy (http://www.codecademy.com/[IA•Wcite•.today•MemWeb]) has a large amount of valuable coding lessons.
- DatasheetArchive (http://www.datasheetarchive.com/[IA•Wcite•.today•MemWeb]) hosts over 350 million PDF datasheets for integrated circuits, some of which are very old and hard to track down otherwise. The site is slow from time to time and uses a convoluted IFRAME-based online viewer, presumably to make scraping the site harder. Nevertheless, multiple other similar sites exist, with large parts of their PDFs non-overlapping, so that at some point, all should be saved. Similar sites include http://doc.chipfind.ru/ (1.6m datasheets), http://www.alldatasheet.com/ (20m datasheets), http://www.datasheets.com/ (250m datasheets), http://www.datasheetcatalog.com/, http://freedatasheets.com/ and several others
- Delicious (http://www.delicious.com/[IA•Wcite•.today•MemWeb]) loves to change their API, which has a side effect of making it difficult to back up.
- Encyclopedia Dramatica ([edramatica.com edramatica.com][IA•Wcite•.today•MemWeb]) is frequently up, down, and changing domains due to general Internet drama.
- Facebook (http://www.facebook.com/[IA•Wcite•.today•MemWeb]) seems stable at the moment.
- Fandom (http://www.fandom.com/[IA•Wcite•.today•MemWeb]), the for-pay arm of Wikipedia (just kidding, it's a different company, but shares a lot of people) is a repository of directed, unsubject-to-wikipolitics wikis, many of them intense and completist. It'd be bad for them to go away.
- FanFiction (http://www.fanfiction.net/[IA•Wcite•.today•MemWeb]) represents many thousands of user-generated stories, essays and huge amounts of work.
- Forrst (http://zurb.com/forrst[IA•Wcite•.today•MemWeb]) was shut down on April 14th, 2014, but all posts were archived by Forrst.
- FreewareFiles (http://freewarefiles.net/[IA•Wcite•.today•MemWeb]) is a treasure trove of free and open source completed softwares. It's been around for 20 years, but hasn't had its look updated in a long while.
- FurAffinity (http://www.furaffinity.net/[IA•Wcite•.today•MemWeb])
- Google (http://www.google.com/[IA•Wcite•.today•MemWeb]) wants you to think they will be here forever.
- h2g2 (https://www.h2g2.com[IA•Wcite•.today•MemWeb]) was (among?) the first online, collaborative encyclopaedia(s).
- GBAtemp (https://GBAtemp.net[IA•Wcite•.today•MemWeb]) A popular forums site in the console homebrewing community. Most of GBAtemp is accessible without an account apart from user profiles containing profile posts. It doesn't appear to be in any danger, having been around for 20 years. Also has a Wiki (https://wiki.GBAtemp.net[IA•Wcite•.today•MemWeb]) which mentions compatibility about different backup loaders, site history, modchips, and so on.
- Identify Your Breyer - As of August 31st, 2022, "three curators" have come in to save the website. They're currently working on moving site hosts, but as of September 11th, 2022, they needed to push it back "a few weeks". While the fate of the site is still somewhat uncertain, it's no longer in danger of expiring within the next few months.
- IFTTT (http://ifttt.com/[IA•Wcite•.today•MemWeb]) is still growing.
- Internet Archive (http://www.archive.org/[IA•Wcite•.today•MemWeb]) seems stable at the moment, but its 45 petabytes of data (as of October 2018) aren't mirrored anywhere else. The code for their system isn't open source, and generally they're a single point of failure for a large amount of the web's history. Why should there be only 1 internet archive?
- There is a second instance of the Wayback Machine at Bibliotheca Alexandrina, but it appears to have been broken since several years as of October 2018 ("The Resource you have requested is temporarily unavailable" when trying to access a snapshot as of 2018-10-03). It also hasn't been updated since 2007, i.e. all crawl data since 2008 is only available at the Internet Archive.
- More discussion at INTERNETARCHIVE.BAK
 
- Invisionfree (http://invisionfree.com/[IA•Wcite•.today•MemWeb]) is as far as I can tell not used by as many people nowadays as before, probably because other free forum hosts use better forum software like phpBB that has better layout and looks better. The copyright notice on zIFBoards.com, which Invisionfree.com redirects to, has not been updated since 2014.
- JanusVR is a company that aims to re-imagine the web as "rooms" interconnected with portals. These "rooms" can be hosted anywhere. Though the company and its technology are still fairly new, there are already cases of rooms with missing assets or even entire links broken, and ArchiveBot's ability to grab these rooms is somewhat limited.
- JSFiddle (http://jsfiddle.net/[IA•Wcite•.today•MemWeb]) is referenced in many StackOverflow answers, as well as other forums, etc. It shows no signs of going away, but should we archive it just in case?
- Know Your Meme (http://knowyourmeme.com/[IA•Wcite•.today•MemWeb]) is at this point the de facto central repository for information on internet memes and culture. It is as popular as ever at the moment, but even with this popularity, former owners Rocketboom had trouble financing it. In the spring of 2011 was sold to Cheezburger Networks, a site which has been known to "reorganize" its properties, sometimes with a detrimental effect on content. Though it was quite a different story, I might remind people what happened to Encyclopedia Dramatica.
- Last.fm (http://www.last.fm/[IA•Wcite•.today•MemWeb]) is being cloned by free software developers in the form of Libre.fm -- they have a tool, Lastscrape which can get all your listening data out into a tab delimited text file.
- Literotica.com (http://literotica.com/[IA•Wcite•.today•MemWeb]) Contains over 290,000 user-written stories and poems. First pass at a backup: part1.rar, part2.rar, part3.rar, part4.rar -- contains the text of all stories as of the backup date in XML format. (One page of one story is missing because it doesn't exist on the site; embedded images and audio are not included this time; non-English stories aren't labelled with their language).
- LiveJournal (http://www.livejournal.com/[IA•Wcite•.today•MemWeb]) fired a bunch of US-based developers, but is still serving from its new (presumably cheaper) data center in Montana.
- Mapillary (http://www.mapillary.com/[IA•Wcite•.today•MemWeb]) project similar to Google's Street View, but with CC BY-SA photos submitted by users. Particularly worrisome since Mapillary requires purchasing licenses to download a large number of photos, making it essentially a big silo. See more at http://blog.improve-osm.org/en/2016/11/a-glimpse-into-the-future-of-mapmaking-with-osm-2/
- Megalodon.jp (http://megalodon.jp/[IA•Wcite•.today•MemWeb]) is a Japanese archiving website. Outside of Japanese users, reddit users use it sometimes.
- The Mod Archive (http://modarchive.org/[IA•Wcite•.today•MemWeb]) One of the largest collection of music modules.
- Mod DB (http://moddb.com/[IA•Wcite•.today•MemWeb]) is the largest website dedicated to user generated game content, including mods (12,000) and addons (14,000) with a combined size of 4,5 TB of user-generated downloadable content.
- Moegirlpedia (https://zh.moegirl.org[IA•Wcite•.today•MemWeb]) is the biggest Chinese-language encyclopedia for popular culture.
- MUGEN Archive (http://www.mugenarchive.com/[IA•Wcite•.today•MemWeb]) Holds 1000's of user created addon content for the custom fighting game MUGEN. Much of the stuff uploaded here was thought lost forever. However, thanks to all the copyright infringing content and uploading a lot of content without the creators permission, the future of the site is questionable.
- nyaa (https://nyaa.si/[IA•Wcite•.today•MemWeb]) - Biggest torrent site of Japanese popular culture.
- Pastebin (http://www.pastebin.com/[IA•Wcite•.today•MemWeb]) is still getting filled with text.
- Pixiv (http://www.pixiv.net/[IA•Wcite•.today•MemWeb]) and deviantArt (http://www.deviantart.com/[IA•Wcite•.today•MemWeb]) are the largest Japanese and American (respectively) fanart (and valuable art in general) collections on the internet. Most of works are possibly included in many boorus. Someone also mentioned Minitokyo around here somewhere. IA has a 2008 wallpaper dump from there.
- Pouet (http://www.pouet.net/[IA•Wcite•.today•MemWeb]) is an important site of the demoscene. It indexes and ranks demoscene productions ('prods') and also includes a free-for-all BBS-style forum. Demozoo is a site in the same vein with a slightly different focus. They both use the same ISP iirc so if that goes down a lot of user created content is lost.
- Reddit (http://www.reddit.com/[IA•Wcite•.today•MemWeb]) is a content aggregator where many Digg users migrated in 2010. Attracted controversy in July 2015, with accusations of censorship and shadowbanning. Many controversial subreddits (with up to hundreds of thousands subscribers each) are "quarantined"; many of these have subsequently been deleted. Stable for now, but team is small.
- SourceForge (http://www.sourceforge.net/[IA•Wcite•.today•MemWeb]) is a critical repository of open source code, information, and webpages. It is mirrored and maintained, but there are sure to be parts that are neither.
- The Pirate Bay (http://www.thepiratebay.org/[IA•Wcite•.today•MemWeb]) is one of the largest and most popular torrent search engines. It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB all over this wiki, this site is pretty dang important. After they were raided in December 2014, a project known as The Open Bay was launched, which lets anybody host a mirror of TPB with automatic database updates, so even if TPB goes down again, temporarily or not, its database is still available.
- Supercard http://supercard.sc/[IA•Wcite•.today•MemWeb]) is the website of a manufacturer of Nintendo DS flashcarts (notably the DSTWO). The website is fine but it has not been updated in years and some section of it are broken. Archiving the download section should be a priority as some flashcarts rely on software hosted there.
- Tribe (http://tribe.net[IA•Wcite•.today•MemWeb]) hosts large amount of user-generated data, and has been having consistent uptime issues.
- Tumblr (http://tumblr.com[IA•Wcite•.today•MemWeb]) is a highly popular blogging platform which was bought by Yahoo! in May, 2013.
- TVTropes (http://www.tvtropes.org/pmwiki/pmwiki.php/Main/HomePage[IA•Wcite•.today•MemWeb]) is a popular wiki dedicated to finding recurring patterns in fiction, and discussing fiction in general. No word on whether there are backups. The administrators have a tendency to delete things indiscriminately, usually to save on disk space: article edit histories are frequently purged, and old forum threads have been known to get deleted mercilessly. A backup and an alternate website with imported content from July 2012 are available.
- Urban Dictionary (https://www.urbandictionary.com/[IA•Wcite•.today•MemWeb]) is "a crowdsourced online dictionary for slang words and phrases"[2]. It's financed by ads and a web store, and there are no signs of serious trouble as of late 2019.
- WebCite (http://www.webcitation.org/[IA•Wcite•.today•MemWeb]) itself seems to be having trouble with funding, and is facing "possible discontinuation." As this site serves as a stable reference for fleeting Web references, it would be pretty disastrous if it went away.
- whitehouse.gov (http://www.whitehouse.gov/[IA•Wcite•.today•MemWeb]) is overhauled every time a new US president assumes office and changes continuously during the term. Old versions are preserved thanks to the Presidential Records Act (e.g. George W. Bush's), but we also want to watch out for site changes / disappeared pages that were embarrassing or whatnot.
- WikiLeaks (http://wikileaks.org/[IA•Wcite•.today•MemWeb]) contains several thousand leaked documents from sources such as the Iraq War and the cables famously known under the label 'Cablegate'. Due to the content on the website, and that PayPal and Amazon (very) quickly dropped their hosting for them during Cablegate's opening days, it should be considered a potential target for any number of government committees for a quick shutdown. They have an uncertain financial situation, and the site was inaccessible for some time in 2010.
- Wikidot (https://www.wikidot.com/[IA•Wcite•.today•MemWeb]) has not had development work for years, there is no official presence on their support forum, just many complaints about the difficulty contacted staff, even from those with paid accounts. Backup and wiki conversion tools are poor, services like search and anti-spam bots have problems for some time, and the site has been up for sale since February 2020.
- SCP Foundation is hosted on Wikidot. They plan on moving off of the platform and there are multiple archives, but there are still over 10 years' worth of creative writing that could be lost forever if Wikidot were to shut down.
 
- Wikipedia (http://www.wikipedia.org/[IA•Wcite•.today•MemWeb]) will surely be here forever and ever! Fortunately, we don't have to take their word for it as they offer dumps of the data minus the photos. However, no-one has verified that Wikipedia can actually be restored from these dumps. If disaster strikes then we could discover a serious problem.
- Writing (http://Writing.com/[IA•Wcite•.today•MemWeb]) A website for writing that was big in the 2000's. More and more restricted to guests and free users, as they are in need of money to keep the site running. Less and less popular for that reason
- YTMND - Resurrected on March 31st, 2020 with an archive being made in June 2018. Still on the Watchlist because of its past, but mostly safe for the time being.
Endangered
Did someone leave the oven on?
- [kirbysrainbowresort.net Kirby's Rainbow Resort] is a Kirby fan site that has a significant archive of old fan works and official media that's difficult to find elsewhere. Around the beginning of 2023, the website's forums, Oekaki imageboard, news updates, and other parts of the site have unceremoniously disappeared. The rest of the site hasn't been updated since 2020.
- Kiwi Farms (http://kiwifarms.net/[IA•Wcite•.today•MemWeb], former alternate .ms domain[IA•Wcite•.today•MemWeb]) is a notorious forum where users mock "lolcows", or people who have attracted ridicule because of their behavior or their beliefs. On January 20, 2017, the forum was shut down without a warning by its owner, but was restored three weeks later. As of September 2022, the website has been excluded from the Wayback Machine and the owner is struggling to find a new host due to a dispute with Cloudflare.
- Ning in 2010 has laid off 40% of staff and seems to be running out of money [1]. There is certainly some networks worth archiving among the 2 million networks[2] they host. Grouply[3] and Posterous[4] say they are going to offer migration tools.
- As of 2014[update], ScraperWiki Classic is now read-only. But don’t worry! You can transfer this scraper to Morph.io if you want to continue editing it.
- debates.oireachtas.ie on September 18th, 2012 the Houses of Oireachtas website announced that it would no longer be updating its XML data for Irish parliamentary debates (1919-2012). Access to pre-existing data is still available, but is likely to disappear, if the current trend continues. It would be useful to at least capture the XML data that is there, while it is still available. Here's a WARC archive of the XML only.
- Groklaw will no longer be posting new articles, "due to government monitoring of the internet, particularly e-mail." Whether or not its archives will remain online is unclear, although it does seem rather unlikely it will 100% disappear. OTOH, better safe than sorry. Still up as of November 28th, 2023.
- World of Spectrum's current administrator announced that he's ceasing to support the website and forum within 8 weeks (as of July 3). The future of the website is uncertain. Still up as of October 11th, 2022.
- The Centralstation Community has closed. The site is a UK-based social network for artists and creatives that provides hosting for content and portfolio. Users are being advised to back up their work as the new version of their platform will rely on existing media hosting sites like Flickr, Vimeo, and Soundcloud. Still up as of October 11th, 2022.
- Most of the paid staff at The Escapist has been "relieved of their duties" as of October 20, 2017, and the future and longevity of the site is uncertain; it's currently run mostly through volunteer efforts.
- Yelp, Inc. lost 30% of its advertisers and people don't seem too happy about it.
- Cheezburger https://www.cheezburger.com/[IA•Wcite•.today•MemWeb], once a “network” of meme blogs but now one single centralized site, has barely been maintained by its parent company (Cracked.com) for years, with the account creation system being broken since at least 2019. A lot of old meme images (of the Impact-font/rage-face/advice-animal variety) can be found here, dating back to 2007, and it would be a loss to the unique culture of the Internet if all these were to disappear. A lot of its contemporaries (such as Lolcats.com) have been lost to domain expiries and the ensuing cyber-squatters. There are still some dedicated users posting brand-new meme material here, but some people held on to GeoCities till the bitter end as well.
- Twitter (http://www.twitter.com/[IA•Wcite•.today•MemWeb]) is tweaking away, with a dire financial situation and controversial decisions. Now faces potential bankruptcy after being purchased by Elon Musk.
- OSDN is a hosting site for Japan, containing repositories of open-source code and web pages. However, since its acquisition by OSCHINA in 2022, the site has become very unstable, with frequent 504 errors and other issues.
- 5ch.net 5ch (formerly 2ch), an online forum in operation since May 30, 1999, has been experiencing ongoing instability due to issues such as script-based trolling . Archived posts dating back to 1999 are currently inaccessible, with the administrators citing a physical server failure as the cause. There is no estimated timeframe for their restoration.List of archive servers
- blog.seesaa.jp Seesaa Inc., the company operating Seesaa Blog was established in 2003 as a blog service provider. It became a subsidiary of Fan Communications in 2017 and was merged into the parent company in 2024. The company, which was in a poor financial state and operating at a loss just before the merger, also ran services like Seesaa Wiki and SS Blog (a business inherited from another company). SS Blog is scheduled to terminate service on March 31, 2025Notice of Termination of SS Blog Service
- zakzak ZakZak, an online news site operated by Sankei Newspaper, announced that it will cease to be updated on 2024/01/31, with no mention of continued website existence.Notice of Suspension of Newspaper
- Amino is a fandom based community that hosts millions of pieces of art, literature, and wikis. Several fired site mods have sounded alarms on how the site is on its last legs.
- Old websites made by Nintendo Korea old websites made by the company, such as https://www.nintendocaution.co.kr/, are in risk of getting deleted if the company decides to delete them.
Alarm
I smell smoke.
- Calorie Restriction Society [5]. an InvisionPowerBoard forum. See https://www.crsociety.org/topic/18710-crsocietyorg-finally-got-back-online-after-4-months/?do=getNewComment for details. Site went down for 4 months then went back up in October 2024. We still don't know if it may go down again, given that the administrators have gone out of contact and the original person paying for crsociety.org died.
- Mozilla Hubs Blog mozilla hubs early access release. Mozilla Hubs is changing their payment structure from open source to a subscription service. They will be deleting open source content.
- The Correspondent, From 30 Sep 19 to 1 Jan 21 The Correspondent published member-funded journalism about the forces that shape our world. The organization has shut down[6] and has put up a read-only archive of all publications that were previously only accessible to paying members. As keeping such an archive running will probably cost someone money it might not stay online for many more years.
- YoyoGames, developer of the GameMaker application, is planning to retire the old "GameMaker Sandbox" game hosting website in favor of the "GameMaker: Player" service, by late October.[3] "Sandbox content will remain available for a period of time until the GameMaker: Player is fully live."
- Yahoo! retired the y.ahoo.it URL shortener November 20th 2013 but the shortener is still active.
- CtoSims Has been infected with a Javascript that redirects to different pages, and the owner seems to have not been active for a long time. The entire site now redirects to a placeholder page that says "Coming Soon!". As of December 2016[update], All Downloads have been taken down as well.
- Giphy: Bought by Facebook, to be "integrated" (assimilated) into Instagram https://news.knowyourmeme.com/news/facebook-to-buy-giphy
- Dayviews (http://dayviews.com/[IA•Wcite•.today•MemWeb]), the Swedish photo diary community contains quite a bit of 00s Swedish "youth culture". It's had "technical problems" for months now and has said that "many old photos were lost".
- Surrender at 20, a popular League of Legends news and teardown site, has suddenly ceased activity on November 14th, 2022. Site owner moobeat posted a tweet saying "life sucks major shit right now" and alluded to some personal woes he's been going through. moobeat's wife, Aznbeat, posted a comment on November 25th on the site's latest post stating that while she helped with running the site to pay their bills after moobeat was hospitalized for a while, she otherwise "had no interest in League, nor will I" and that "unless by some miracle he ever decides to take some responsibility for it, then don't expect any more coverage from us".
- https://www.tinaja.com/[IA•Wcite•.today•MemWeb] Owner is deceased.
See Also
References
← Deathwatch • Alive... OR ARE THEY • Projects →