https://wiki.archiveteam.org/api.php?action=feedcontributions&user=Jaakkoh&feedformat=atomArchiveteam - User contributions [en]2024-03-29T06:41:17ZUser contributionsMediaWiki 1.37.1https://wiki.archiveteam.org/index.php?title=Projects&diff=481Projects2009-04-04T05:39:39Z<p>Jaakkoh: /* Ideas for Projects */</p>
<hr />
<div>Here's where Archive Teamsters can list the projects they are currently working on and organize new projects.<br />
<br />
== Active Projects ==<br />
<br />
* '''[[User:Jscott|Jason Scott]]''' is running [http://www.textfiles.com Textfiles.com] and archiving a ton of things.<br />
* '''[[User:Ross|Ross]]''' is interviewing the sites of 2008.<br />
* '''[[User:LesOrchard|l.m.orchard]]''' is starting work on some self-hosted web apps that will migrate and archive from other sites. (ie. [http://github.com/lmorchard/friendfeedarchiver FriendFeed], [http://github.com/lmorchard/memex/ Delicious])<br />
<br />
== Ideas for Projects ==<br />
<br />
* '''Set up''' an FTP hub which AT members can access and up/down finished projects.<br />
* Track the 100+ top twitter feeds, as designated by one of these idiot Twitter grading sites, and back up on a regular basis the top twitter people, for posterity.<br />
* '''[http://www.groklaw.net/ Groklaw]''' has a [http://www.groklaw.net/article.php?story=20090105033126835 project proposal] that we could help with. - [[User:Jscott|Jason]]<br />
* '''Archive''' the shutdown announcement pages on dead sites.<br />
* '''RSS Feed''' with death notices. - [[User:Jscott|Jason]]<br />
* '''Twitter profile''' might be a good way to broadcast new site obituaries. - psicom<br />
* '''[[TinyURL]]''' and similar services, scraping/backup - [[User:scumola|Steve]]<br />
** highlight services that at least allow exporting data ([[Diigo]] that I know of). Next "best" - services that have registeration and enable viewing your URL / saving them by e.g. saving as HTML ([[tr.im]]). Etc. --[[User:Jaakkoh|Jaakkoh]] 05:39, 4 April 2009 (UTC)<br />
* '''[http://symphony21.com/ Symphony]''' could [http://nick-dunn.co.uk/article/symphony-as-a-data-preservation-utility/ potentially be used] for archiving structured XML/RSS feeds to a relational database - [[User:nickdunn|Nick]]<br />
<br />
== Finished Projects ==<br />
<br />
* [[User:Jscott|Jason]] founded the Archive Team.</div>Jaakkohhttps://wiki.archiveteam.org/index.php?title=Talk:Alive..._OR_ARE_THEY&diff=480Talk:Alive... OR ARE THEY2009-04-04T04:39:56Z<p>Jaakkoh: /* Add a section to the article that suggest workarounds */ new section</p>
<hr />
<div>Has anyone attempted to restore a running copy of wikipedia from the dumps? If so do the dumps provide enough data that, in the event of a catastrophic data failure, wikipedia could be brought back up using just the backups?--[[User:Adewale|Adewale]] 08:42, 27 February 2009 (UTC)<br />
* That's a very good question. The problem is that the dumps are now insanely huge. I don't know how many people would even have the capacity to unpack them. --[[User:Jscott|Jscott]] 15:45, 27 February 2009 (UTC)<br />
* This: http://aws.amazon.com/publicdatasets/#1 suggests that DBpedia and FreeBase are attempting to maintain their own structured versions of the wikipedia dataset. Theoretically, if Amazon keeps their public dataset up to date, then it is possible to restore wikipedia from that.<br />
<br />
== Add a section to the article that suggest workarounds ==<br />
<br />
Easiest (to save content) is to submit to multiple websites. My personal favorites for typical files I upload are [[Ovi Share]] (by Nokia, unlimited diskspace for quite a wide variety of files but no easy mass download), [[Scribd]], [[docstoc]], [[Slideshare]] and [[Box.net]] (from which I can download files as a zip file). --[[User:Jaakkoh|Jaakkoh]] 04:39, 4 April 2009 (UTC)</div>Jaakkohhttps://wiki.archiveteam.org/index.php?title=Google&diff=479Google2009-04-04T04:06:18Z<p>Jaakkoh: /* Gmail */</p>
<hr />
<div>Google probably isn't Evil per se, but they do want you to put all of your data on their servers. Trusting any one company that much is probably a bad idea. If your entire life is on Google, what happens to Google happens to you.<br />
<br />
== Backup Tools ==<br />
<br />
=== Blogger ===<br />
<br />
* [http://google-opensource.blogspot.com/2009/01/google-blog-converters-10-released.html Google Blog Converters 1.0] uses Python to convert between Blogger, LiveJournal, MovableType, and WordPress. <br />
<br />
* Blogger can now export the entire contents of a blog, over at [http://draft.blogger.com/ Blogger in Draft].<br />
<br />
=== Gmail ===<br />
<br />
* [http://www.gmail-backup.com/ Gmail Backup]<br />
* Gmail provides IMAP access, so you can use [http://software.complete.org/software/projects/show/offlineimap OfflineIMAP] to backup and sync your complete archive in standard UNIX maildir format, usable by Mutt, Thunderbird and most sane e-mail clients. See [http://soren.overgaard.org/2007/12/15/backing-up-gmail-using-offlineimap/ this blog post] for more details.<br />
* POP access is a very simple way to continuously download all your emails in Gmail to your favorite email client. This method doesn't preserve the label/folder structure, though - but does include your emails that are sent from Gmail.<br />
* You may also want to consider setting up forwarding of all your emails in Gmail to a Yahoo account or some other email provider (that has enough quota to work as your archive).<br />
<br />
=== Google Docs ===<br />
<br />
* [http://1st-soft.net/gdd/ GM Script by Peter Schafer] download Google Docs en masse.<br />
<br />
* [http://code.google.com/p/gdatacopier/ gdatacopier] "Bi-directional copy utility & API for Google docs"<br />
<br />
=== Google Calendar ===<br />
<br />
* [http://www.google.com/support/calendar/bin/answer.py?hl=en&answer=37111 Export your Google Calendar]<br />
<br />
=== Google Reader ===<br />
<br />
* [http://ze-ze.cn/2008/01/how-to-backup-articles-from-google-reader.html How to Back Up Articles from Google Reader]<br />
<br />
=== Google Notebook ===<br />
<br />
* Has been announced to be discontinued. GNotebook (luckily) has an export-to-XML function (a link at the bottom of the screen) that at least [http://diigo.com Diigo] and [http://evernote.com Evernote] are able to import (without coding skills).<br />
<br />
=== Google Gears ===<br />
<br />
* Is not a backup tool per se but at least for Google Docs and Gmail GGears downloads all documents/attachments to your computer as readable documents (which can be found in your user profile/Google folder(s)).<br />
<br />
== Miscellaneous ==<br />
<br />
Does a tool suite exist that backs up all of the Google Apps cloud?<br />
<br />
== Vital Signs ==<br />
<br />
Pump up the NASDAQ.</div>Jaakkohhttps://wiki.archiveteam.org/index.php?title=Google&diff=478Google2009-04-04T04:05:02Z<p>Jaakkoh: /* Gmail */</p>
<hr />
<div>Google probably isn't Evil per se, but they do want you to put all of your data on their servers. Trusting any one company that much is probably a bad idea. If your entire life is on Google, what happens to Google happens to you.<br />
<br />
== Backup Tools ==<br />
<br />
=== Blogger ===<br />
<br />
* [http://google-opensource.blogspot.com/2009/01/google-blog-converters-10-released.html Google Blog Converters 1.0] uses Python to convert between Blogger, LiveJournal, MovableType, and WordPress. <br />
<br />
* Blogger can now export the entire contents of a blog, over at [http://draft.blogger.com/ Blogger in Draft].<br />
<br />
=== Gmail ===<br />
<br />
* [http://www.gmail-backup.com/ Gmail Backup]<br />
* Gmail provides IMAP access, so you can use [http://software.complete.org/software/projects/show/offlineimap OfflineIMAP] to backup and sync your complete archive in standard UNIX maildir format, usable by Mutt, Thunderbird and most sane e-mail clients. See [http://soren.overgaard.org/2007/12/15/backing-up-gmail-using-offlineimap/ this blog post] for more details.<br />
* POP access is a very simple way to continuously download all your emails in Gmail to your favorite email client. This method doesn't preserve the label/folder structure, though - but does include your emails that are sent from Gmail.<br />
<br />
=== Google Docs ===<br />
<br />
* [http://1st-soft.net/gdd/ GM Script by Peter Schafer] download Google Docs en masse.<br />
<br />
* [http://code.google.com/p/gdatacopier/ gdatacopier] "Bi-directional copy utility & API for Google docs"<br />
<br />
=== Google Calendar ===<br />
<br />
* [http://www.google.com/support/calendar/bin/answer.py?hl=en&answer=37111 Export your Google Calendar]<br />
<br />
=== Google Reader ===<br />
<br />
* [http://ze-ze.cn/2008/01/how-to-backup-articles-from-google-reader.html How to Back Up Articles from Google Reader]<br />
<br />
=== Google Notebook ===<br />
<br />
* Has been announced to be discontinued. GNotebook (luckily) has an export-to-XML function (a link at the bottom of the screen) that at least [http://diigo.com Diigo] and [http://evernote.com Evernote] are able to import (without coding skills).<br />
<br />
=== Google Gears ===<br />
<br />
* Is not a backup tool per se but at least for Google Docs and Gmail GGears downloads all documents/attachments to your computer as readable documents (which can be found in your user profile/Google folder(s)).<br />
<br />
== Miscellaneous ==<br />
<br />
Does a tool suite exist that backs up all of the Google Apps cloud?<br />
<br />
== Vital Signs ==<br />
<br />
Pump up the NASDAQ.</div>Jaakkohhttps://wiki.archiveteam.org/index.php?title=Google&diff=477Google2009-04-04T04:02:36Z<p>Jaakkoh: /* Backup Tools */</p>
<hr />
<div>Google probably isn't Evil per se, but they do want you to put all of your data on their servers. Trusting any one company that much is probably a bad idea. If your entire life is on Google, what happens to Google happens to you.<br />
<br />
== Backup Tools ==<br />
<br />
=== Blogger ===<br />
<br />
* [http://google-opensource.blogspot.com/2009/01/google-blog-converters-10-released.html Google Blog Converters 1.0] uses Python to convert between Blogger, LiveJournal, MovableType, and WordPress. <br />
<br />
* Blogger can now export the entire contents of a blog, over at [http://draft.blogger.com/ Blogger in Draft].<br />
<br />
=== Gmail ===<br />
<br />
* [http://www.gmail-backup.com/ Gmail Backup]<br />
* Gmail provides IMAP access, so you can use [http://software.complete.org/software/projects/show/offlineimap OfflineIMAP] to backup and sync your complete archive in standard UNIX maildir format, usable by Mutt, Thunderbird and most sane e-mail clients. See [http://soren.overgaard.org/2007/12/15/backing-up-gmail-using-offlineimap/ this blog post] for more details.<br />
<br />
=== Google Docs ===<br />
<br />
* [http://1st-soft.net/gdd/ GM Script by Peter Schafer] download Google Docs en masse.<br />
<br />
* [http://code.google.com/p/gdatacopier/ gdatacopier] "Bi-directional copy utility & API for Google docs"<br />
<br />
=== Google Calendar ===<br />
<br />
* [http://www.google.com/support/calendar/bin/answer.py?hl=en&answer=37111 Export your Google Calendar]<br />
<br />
=== Google Reader ===<br />
<br />
* [http://ze-ze.cn/2008/01/how-to-backup-articles-from-google-reader.html How to Back Up Articles from Google Reader]<br />
<br />
=== Google Notebook ===<br />
<br />
* Has been announced to be discontinued. GNotebook (luckily) has an export-to-XML function (a link at the bottom of the screen) that at least [http://diigo.com Diigo] and [http://evernote.com Evernote] are able to import (without coding skills).<br />
<br />
=== Google Gears ===<br />
<br />
* Is not a backup tool per se but at least for Google Docs and Gmail GGears downloads all documents/attachments to your computer as readable documents (which can be found in your user profile/Google folder(s)).<br />
<br />
== Miscellaneous ==<br />
<br />
Does a tool suite exist that backs up all of the Google Apps cloud?<br />
<br />
== Vital Signs ==<br />
<br />
Pump up the NASDAQ.</div>Jaakkoh