Difference between revisions of "Alive... OR ARE THEY"

From Archiveteam
Jump to navigation Jump to search
m (→‎All the others: Forgot Have. Also note on last edit, Wikileaks not Wikipedia.)
(Added Mod DB)
Line 23: Line 23:
* '''[[BetaArchive]]''' ({{url|http://www.betaarchive.com/}}) has Kafkaesque requirements to be able to access it, and apparently refuses to be backed up, presumably so that they get more visitors. Valuable cultural library of historic software with no backups? Aargh.
* '''[[BetaArchive]]''' ({{url|http://www.betaarchive.com/}}) has Kafkaesque requirements to be able to access it, and apparently refuses to be backed up, presumably so that they get more visitors. Valuable cultural library of historic software with no backups? Aargh.
* '''[[Codecademy]]''' ({{url|1=http://www.codecademy.com/}}) has a large amount of valuable coding lessons.
* '''[[Codecademy]]''' ({{url|1=http://www.codecademy.com/}}) has a large amount of valuable coding lessons.
*'''[[The Mod Archive]]''' ({{url|1=http://modarchive.org/}}) One of the largest collection of music modules.
* '''[[cyberpunkreview.com]]''': 80s science fiction fansite and community {{url|1=http://cyberpunkreview.com/}} hasn't seen much staff activity in a long time, although the forums are going strong. UPDATE: Looking active again. [[User:Aggroskater|Aggroskater]] 08:26, 19 March 2012 (EDT)
* '''[[cyberpunkreview.com]]''': 80s science fiction fansite and community {{url|1=http://cyberpunkreview.com/}} hasn't seen much staff activity in a long time, although the forums are going strong. UPDATE: Looking active again. [[User:Aggroskater|Aggroskater]] 08:26, 19 March 2012 (EDT)
* '''[[Delicious]]''' ({{url|1=http://www.delicious.com/}}) loves to change their API, which has a side effect of making it difficult to back up.
* '''[[Delicious]]''' ({{url|1=http://www.delicious.com/}}) loves to change their API, which has a side effect of making it difficult to back up.
Line 39: Line 38:
* '''[[Literotica.com]]''' ({{url|1=http://literotica.com/}}) Contains over 290,000 user-written stories and poems. First pass at a backup: [http://mir.cr/12CMQUTL part1.rar], [http://mir.cr/HO79CCUO part2.rar], [http://mir.cr/TOVJWQ4E part3.rar], [http://mir.cr/1SIAB4AM part4.rar] -- contains the text of all stories as of the backup date in XML format. (One page of one story is missing because it doesn't exist on the site; embedded images and audio are not included this time; non-English stories aren't labelled with their language).
* '''[[Literotica.com]]''' ({{url|1=http://literotica.com/}}) Contains over 290,000 user-written stories and poems. First pass at a backup: [http://mir.cr/12CMQUTL part1.rar], [http://mir.cr/HO79CCUO part2.rar], [http://mir.cr/TOVJWQ4E part3.rar], [http://mir.cr/1SIAB4AM part4.rar] -- contains the text of all stories as of the backup date in XML format. (One page of one story is missing because it doesn't exist on the site; embedded images and audio are not included this time; non-English stories aren't labelled with their language).
* '''[[LiveJournal]]''' ({{url|1=http://www.livejournal.com/}}) fired a bunch of US-based developers, but is still serving from its new (presumably cheaper) data center in Montana.
* '''[[LiveJournal]]''' ({{url|1=http://www.livejournal.com/}}) fired a bunch of US-based developers, but is still serving from its new (presumably cheaper) data center in Montana.
*'''[[The Mod Archive]]''' ({{url|1=http://modarchive.org/}}) One of the largest collection of music modules.
*'''[[Mod DB]]''' ({{url|1=http://moddb.com/}}) is the largest website dedicated to user generated game content, including mods (12,000) and addons (14,000) with a combined size of 4,5 TB of user-generated downloadable content.
* '''[[Pastebin]]''' ({{url|1=http://www.pastebin.com/}}) is still getting filled with text.
* '''[[Pastebin]]''' ({{url|1=http://www.pastebin.com/}}) is still getting filled with text.
* '''[[Pixiv]]''' ({{url|1=http://www.pixiv.net/}}) and '''[[deviantArt]]''' ({{url|1=http://www.deviantart.com/}}) are the largest Japanese and American (respectively) fanart (and valuable art in general) collections on the internet.
* '''[[Pixiv]]''' ({{url|1=http://www.pixiv.net/}}) and '''[[deviantArt]]''' ({{url|1=http://www.deviantart.com/}}) are the largest Japanese and American (respectively) fanart (and valuable art in general) collections on the internet.

Revision as of 13:01, 9 June 2015

Like many sites before them, these places indicate a sunny outlook, a clean bill of health and a total sense of "all systems go". But as we've found out from those many sites before them, fortunes can change overnight.

Archive Team considers these sites specifically of interest because they solicit so much content, contain so many works and projects by a wide group of people, or have the internet particularly dependent on them. Consider this a fire drill.. know what you can do to get your data off these sites and back them off for later.

Still Alive

Not so alive, rather living deads (owned by Yahoo!)

  • Flickr contains billions of files, hundreds millions of which are under a Creative Commons license or stored there by many museums and other cultural institutions. The site was tumblr-ised in 2013 and has been poorly functional ever since; pro users were removed, so it doesn't yet have a business model. Additionally, it's owned by Yahoo!, need to say more?!

All the others

So Worried

Did someone leave the oven on?

  • FriendFeed (http://friendfeed.com/[IAWcite.todayMemWeb]) has been purchased by Facebook, leaving FriendFeed users uncertain as to its future and mostly unsupported. The Twitter bridge, for instance, has not worked for years now.
  • Ning in 2010 has laid off 40% of staff and seems to be running out of money [1]. There is certainly some networks worth archiving among the 2 million networks[2] they host. Grouply[3] and Posterous[4] say they are going to offer migration tools.
  • As of 2014, ScraperWiki Classic is now read-only. But don’t worry! You can transfer this scraper to Morph.io if you want to continue editing it.
  • Convozine hasn't been active lately. Their last reply to a support question was in 2012, their last update in the "News" section was December 2011, and their last blog post was in January 2013. (See [5] and [6].)
  • debates.oireachtas.ie on September 18th, 2012 the Houses of Oireachtas website announced that it would no longer be updating its XML data for Irish parliamentary debates (1919-2012). Access to pre-existing data is still available, but is likely to disappear, if the current trend continues. It would be useful to at least capture the XML data that is there, while it is still available. Here's a WARC archive of the XML only.
  • ownlog.com - once one of the most popular and oldest blog platform in Poland seems to be dying slowly - no development and actualizations except most critical maintenance.
  • The Grid (magazine in Toronto) printed its last issue on July 3rd 2014 (see here) not sure how long the site will stay up. Saved by ArchiveBot
  • Nakido (site) claims to be a "time capsule" that will "host your files for decades" - except it's a commercial enterprise selling premium acounts, and uses a proprietary P2P platform for delivery. What could possibly go wrong?
  • Groklaw will no longer be posting new articles, "due to government monitoring of the internet, particularly e-mail." Whether or not its archives will remain online is unclear, although it does seem rather unlikely it will 100% disappear. OTOH, better safe than sorry.
  • Strawpoll.me
  • The Centralstation Community has closed. The site is a UK-based social network for artists and creatives that provides hosting for content and portfolio. Users are being advised to back up their work as the new version of their platform will rely on existing media hosting sites like Flickr, Vimeo, and Soundcloud.

Fire Alarm Sounds Like Whoop Whoop Whoop

I smell smoke.

  • Ovi Store's infrastructure is slowly rotting away.
  • Blip.tv will be removing accounts/videos on September 1st, 2014.

See Also

References