Difference between revisions of "Google"

From Archiveteam
Jump to navigation Jump to search
m (Google on Wheels! moved to Google over redirect: Unwheeling)
 
(→‎Vital Signs: YouTube, Google Docs and Google Drive being purged in 2024?)
Tag: merged edit of another user
(36 intermediate revisions by 17 users not shown)
Line 1: Line 1:
Google probably isn't Evil per se, but they do want you to put all of your data on their servers. Trusting any one company that much is probably a bad idea. If your entire life is on Google, what happens to Google happens to you.
[[File:Google Logo.png|thumbnail|right|Google logo until September 2015.]]
'''Google''' is currently one of the largest Internet-based companies in existence (if not THE biggest), hosting dozens of different services.
 
'''Google''' probably isn't evil per se, but they do want you to put all of your data on their servers. Trusting any one company that much is probably a bad idea. If your entire life is on Google, what happens to Google happens to you. For a remote idea of what can happen, look at [[Yahoo!]].
 
Archive Team has decided to take a census of Google services, to see what has and hasn't been saved. See the [[Froogle|Froogle project]].


== Backup Tools ==
== Backup Tools ==
[http://www.dataliberation.org/ DataLiberation] is an engineering team at Google whose singular goal is to make it easier for users to move their data in and out of Google products. Here you can find instructions to backup from every Google service.


=== Blogger ===
=== Blogger ===
Line 11: Line 17:
=== Gmail ===
=== Gmail ===


* [http://www.gmail-backup.com/ Gmail Backup]
* [https://github.com/eblume/gmail_safe gmail_safe] incremental gmail backup nodejs package. It saves thread information (Google Mail 'conversations') and Google Mail labels. It is decently fast (about 20 emails per second) without using much CPU or RAM.
* [http://www.gmail-backup.com/ Gmail Backup] allows you to backup your emails in EML format and optionally upload them again into a separate Gmail account.
* Gmail provides IMAP access, so you can use [http://software.complete.org/software/projects/show/offlineimap OfflineIMAP] to backup and sync your complete archive in standard UNIX maildir format, usable by Mutt, Thunderbird and most sane e-mail clients. See [http://soren.overgaard.org/2007/12/15/backing-up-gmail-using-offlineimap/ this blog post] for more details.
* Gmail provides IMAP access, so you can use [http://software.complete.org/software/projects/show/offlineimap OfflineIMAP] to backup and sync your complete archive in standard UNIX maildir format, usable by Mutt, Thunderbird and most sane e-mail clients. See [http://soren.overgaard.org/2007/12/15/backing-up-gmail-using-offlineimap/ this blog post] for more details.
* POP access is a very simple way to continuously download all your emails in Gmail to your favorite email client. This method doesn't preserve the label/folder structure, though - but does include your emails that are sent from Gmail.
* POP access is a very simple way to continuously download all your emails in Gmail to your favorite email client. This method doesn't preserve the label/folder structure, though - but does include your emails that are sent from Gmail.
* You may also want to consider setting up forwarding of all your emails in Gmail to a Yahoo account or some other email provider (that has enough quota to work as your archive).
* You may also want to consider setting up forwarding of all your emails in Gmail to an Outlook account or some other email provider (that has enough quota to work as your archive).


=== Google Docs ===
=== Google Calendar ===


* [http://1st-soft.net/gdd/ GM Script by Peter Schafer] download Google Docs en masse.
* [http://www.google.com/support/calendar/bin/answer.py?hl=en&answer=37111 Export your Google Calendar]


* [http://code.google.com/p/gdatacopier/ gdatacopier] "Bi-directional copy utility & API for Google docs"
=== Google Docs Editors ===


=== Google Calendar ===
==== Tools ====
* [http://1st-soft.net/gdd/ GM Script by Peter Schafer] - Download Google Docs en masse.
* [http://code.google.com/p/gdatacopier/ gdatacopier] - "Bi-directional copy utility & API for Google docs"
==== URL patterns ====
Note: Due to https://github.com/ArchiveTeam/wpull/issues/425 and Google's use of HTTP 307 Redirects, the Export Menu based URLs currently do not work in [[ArchiveBot]]. They also do not work in Chromebot. They do work in IA SPN. The view/edit/mobilebasic URLs do appear to work in AB and/or Chromebot.
 
===== Documents =====
Using Document docid 17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y (The Declaration of Independence - modern translation 2012) as an example:
 
* Document -> View/Edit/MobileBasic/HTML Export
# https://docs.google.com/document/d/17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y/view
# https://docs.google.com/document/d/17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y/edit
# https://docs.google.com/document/d/17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y/mobilebasic
# https://docs.google.com/document/d/17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y/export
* Document Export Menu Options -> odt, docx, pdf, zip (of HTML), epub, rtf, txt
# https://docs.google.com/document/export?format=odt&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
# https://docs.google.com/document/export?format=docx&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
# https://docs.google.com/document/export?format=pdf&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
# https://docs.google.com/document/export?format=zip&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
# https://docs.google.com/document/export?format=epub&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
# https://docs.google.com/document/export?format=rtf&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
# https://docs.google.com/document/export?format=txt&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
 
=====Spreadsheets=====
Using Spreadsheet docid 17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s (Recipe Cost Calculator) as an example:
 
* Spreadsheets -> View/Edit/XLSX Export
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/view
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/edit
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export
 
* Spreadsheets Export Menu Options -> ods, xlsx, pdf, zip (of HTML), csv (current sheet), tsv (current sheet)
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=ods
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=xlsx
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=pdf
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=zip
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=csv
# https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=tsv
 
=====Slides=====
Using Slide docid 1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y (Timing Individual Google Slides) as an example:
 
* Slides -> View/Edit
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/view
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/edit
 
* Slides Export Menu Options -> odp, pptx, pdf, png, jpg, svg, txt
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/odp
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/pptx
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/pdf
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/png
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/jpg
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/svg
# https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/txt
 
=== Google Drive ===
 
* For content that should go into the IA WBM, [[ArchiveBot]] can be used to grab direct links to individual files. This example uses file id 19Vk6mrf6FY1iCKudKFjeQDXTfpGiElMd (for NTSB report 'WOA 8535 CVR Transcript.pdf'):
** chromebot: a https://drive.google.com/file/d/19Vk6mrf6FY1iCKudKFjeQDXTfpGiElMd/view
** !ao https://drive.google.com/uc?id=19Vk6mrf6FY1iCKudKFjeQDXTfpGiElMd&export=download
 
* wget can also be used with the 'export=download' URL. See https://clay-atlas.com/us/blog/2020/08/27/linux-en-wget-download-google-drive-files/
** wget "https://drive.google.com/u/1/uc?id=19Vk6mrf6FY1iCKudKFjeQDXTfpGiElMd&export=download"
 
Additional tools that may be useful:
 
* https://github.com/odeke-em/drive
 
* https://github.com/segnolin/google-drive-folder-downloader
 
* https://github.com/astrada/google-drive-ocamlfuse
 
* https://github.com/ericyd/gdrive-copy
 
* https://github.com/iwestlin/gd-utils (Chinese)
 
* https://github.com/iwestlin/gd-utils/blob/master/compare.md
 
* https://github.com/roshanconnor123/gd-utils (gd-utils, English fork)
 
* https://github.com/roshanconnor123/Gdutils_Tgbot
 
* https://github.com/prasmussen/gdrive
 
* https://itsfoss.com/use-google-drive-linux/
 
* https://linuxconfig.org/how-to-use-google-drive-on-linux


* [http://www.google.com/support/calendar/bin/answer.py?hl=en&answer=37111 Export your Google Calendar]
* https://www.driveexport.com/


=== Google Reader ===
=== Google Gears ===


* [http://ze-ze.cn/2008/01/how-to-backup-articles-from-google-reader.html How to Back Up Articles from Google Reader]
* Is not a backup tool per se but at least for Google Docs and Gmail GGears downloads all documents/attachments to your computer as readable documents (which can be found in your user profile/Google folder(s)). Google Gears is no longer supported by Google.


=== Google Notebook ===
=== Google Notebook ===
Line 34: Line 128:
* Has been announced to be discontinued. GNotebook (luckily) has an export-to-XML function (a link at the bottom of the screen) that at least [http://diigo.com Diigo] and [http://evernote.com Evernote] are able to import (without coding skills).
* Has been announced to be discontinued. GNotebook (luckily) has an export-to-XML function (a link at the bottom of the screen) that at least [http://diigo.com Diigo] and [http://evernote.com Evernote] are able to import (without coding skills).


=== Google Gears ===
=== Google Reader ===


* Is not a backup tool per se but at least for Google Docs and Gmail GGears downloads all documents/attachments to your computer as readable documents (which can be found in your user profile/Google folder(s)).
An RSS/feed reader webapp with discoverability features for finding new feeds. On the 13th of March, [http://googlereader.blogspot.com/2013/03/powering-down-google-reader.html Google announced that they would shut down Google Reader at 1st of July].
* [http://googlereader.blogspot.com/2013/03/powering-down-google-reader.html Powering Down Google Reader]
* [http://googleblog.blogspot.com/2013/03/a-second-spring-of-cleaning.html A second spring of cleaning]
* [http://ze-ze.cn/2008/01/how-to-backup-articles-from-google-reader.html How to Back Up Articles from Google Reader]
* [http://support.google.com/reader/answer/3028851 How can I download my Reader data?]


== Miscellaneous ==
== Miscellaneous ==


Does a tool suite exist that backs up all of the Google Apps cloud?
Does a tool suite exist that backs up all of the Google Apps cloud?
Generally, data can be exported with the "Google Takeout" interface which sends a series of compressed archives with data from the various services. It's not always reliable.<ref>[https://www.theguardian.com/technology/2020/feb/04/google-software-glitch-sent-some-users-videos-to-strangers Google software glitch sent some users' videos to strangers: Bug affected users of Google Takeout exporting from Google Photos in late November], 2020-02-04.</ref>


== Vital Signs ==
== Vital Signs ==


Pump up the NASDAQ.
Pump up the NASDAQ.
=== Google Photos ===
;November 2018: [https://www.reddit.com/r/DataHoarder/comments/a5grrw/google_photos_will_no_longer_provide_unlimited/ Unsupported videos will no longer have unlimited space]
=== Google Plus ===
;April 2, 2019: [[Google+]] was shut down. [https://www.theverge.com/2018/12/10/18134541/google-plus-privacy-api-data-leak-developers]
=== Inactive accounts ===
Starting in 2024, the content of accounts inactive for over 2 years will be eligible for deletion across all Google products: «if a Google Account has not been used or signed into for at least 2 years, we may delete the account and its contents – including content within Google Workspace (Gmail, Docs, Drive, Meet, Calendar), YouTube and Google Photos.»
See the [https://blog.google/technology/safety-security/updating-our-inactive-account-policies/ 2023-05-16 announcement].
=== Other ===
Over 150 products, including web services, discontinued by Google: https://killedbygoogle.com/
== See also ==
* [[Google Books Ngram]]
{{Navigation box}}
[[Category:Corporations]]
[[Category:Google]]
{{DISPLAYTITLE:<span style="font-family:product-sans"><span style=color:#4885ed>G</span><span style=color:#db3236>o</span><span style=color:#f4c20d>o</span><span style=color:#4885ed>g</span><span style=color:#3cba54>l</span><span style=color:#db3236>e</span>
</span>}}

Revision as of 05:56, 17 May 2023

Google logo until September 2015.

Google is currently one of the largest Internet-based companies in existence (if not THE biggest), hosting dozens of different services.

Google probably isn't evil per se, but they do want you to put all of your data on their servers. Trusting any one company that much is probably a bad idea. If your entire life is on Google, what happens to Google happens to you. For a remote idea of what can happen, look at Yahoo!.

Archive Team has decided to take a census of Google services, to see what has and hasn't been saved. See the Froogle project.

Backup Tools

DataLiberation is an engineering team at Google whose singular goal is to make it easier for users to move their data in and out of Google products. Here you can find instructions to backup from every Google service.

Blogger

Gmail

  • gmail_safe incremental gmail backup nodejs package. It saves thread information (Google Mail 'conversations') and Google Mail labels. It is decently fast (about 20 emails per second) without using much CPU or RAM.
  • Gmail Backup allows you to backup your emails in EML format and optionally upload them again into a separate Gmail account.
  • Gmail provides IMAP access, so you can use OfflineIMAP to backup and sync your complete archive in standard UNIX maildir format, usable by Mutt, Thunderbird and most sane e-mail clients. See this blog post for more details.
  • POP access is a very simple way to continuously download all your emails in Gmail to your favorite email client. This method doesn't preserve the label/folder structure, though - but does include your emails that are sent from Gmail.
  • You may also want to consider setting up forwarding of all your emails in Gmail to an Outlook account or some other email provider (that has enough quota to work as your archive).

Google Calendar

Google Docs Editors

Tools

URL patterns

Note: Due to https://github.com/ArchiveTeam/wpull/issues/425 and Google's use of HTTP 307 Redirects, the Export Menu based URLs currently do not work in ArchiveBot. They also do not work in Chromebot. They do work in IA SPN. The view/edit/mobilebasic URLs do appear to work in AB and/or Chromebot.

Documents

Using Document docid 17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y (The Declaration of Independence - modern translation 2012) as an example:

  • Document -> View/Edit/MobileBasic/HTML Export
  1. https://docs.google.com/document/d/17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y/view
  2. https://docs.google.com/document/d/17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y/edit
  3. https://docs.google.com/document/d/17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y/mobilebasic
  4. https://docs.google.com/document/d/17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y/export
  • Document Export Menu Options -> odt, docx, pdf, zip (of HTML), epub, rtf, txt
  1. https://docs.google.com/document/export?format=odt&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
  2. https://docs.google.com/document/export?format=docx&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
  3. https://docs.google.com/document/export?format=pdf&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
  4. https://docs.google.com/document/export?format=zip&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
  5. https://docs.google.com/document/export?format=epub&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
  6. https://docs.google.com/document/export?format=rtf&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
  7. https://docs.google.com/document/export?format=txt&id=17-kk6FR8PGku4XhCy2TFBSbEeIWUlm082qgaqXAP81Y
Spreadsheets

Using Spreadsheet docid 17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s (Recipe Cost Calculator) as an example:

  • Spreadsheets -> View/Edit/XLSX Export
  1. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/view
  2. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/edit
  3. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export
  • Spreadsheets Export Menu Options -> ods, xlsx, pdf, zip (of HTML), csv (current sheet), tsv (current sheet)
  1. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=ods
  2. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=xlsx
  3. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=pdf
  4. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=zip
  5. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=csv
  6. https://docs.google.com/spreadsheets/d/17SP23Ce2IuJQJx5qbBRGNiRuv0My_ddvcIZBTnBdA7s/export?format=tsv
Slides

Using Slide docid 1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y (Timing Individual Google Slides) as an example:

  • Slides -> View/Edit
  1. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/view
  2. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/edit
  • Slides Export Menu Options -> odp, pptx, pdf, png, jpg, svg, txt
  1. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/odp
  2. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/pptx
  3. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/pdf
  4. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/png
  5. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/jpg
  6. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/svg
  7. https://docs.google.com/presentation/d/1YHAddiVocHyU38NKT-SIFBMk0gHRg-DBDQOINRjV73Y/export/txt

Google Drive

Additional tools that may be useful:

Google Gears

  • Is not a backup tool per se but at least for Google Docs and Gmail GGears downloads all documents/attachments to your computer as readable documents (which can be found in your user profile/Google folder(s)). Google Gears is no longer supported by Google.

Google Notebook

  • Has been announced to be discontinued. GNotebook (luckily) has an export-to-XML function (a link at the bottom of the screen) that at least Diigo and Evernote are able to import (without coding skills).

Google Reader

An RSS/feed reader webapp with discoverability features for finding new feeds. On the 13th of March, Google announced that they would shut down Google Reader at 1st of July.

Miscellaneous

Does a tool suite exist that backs up all of the Google Apps cloud?

Generally, data can be exported with the "Google Takeout" interface which sends a series of compressed archives with data from the various services. It's not always reliable.[1]

Vital Signs

Pump up the NASDAQ.

Google Photos

November 2018
Unsupported videos will no longer have unlimited space

Google Plus

April 2, 2019
Google+ was shut down. [1]

Inactive accounts

Starting in 2024, the content of accounts inactive for over 2 years will be eligible for deletion across all Google products: «if a Google Account has not been used or signed into for at least 2 years, we may delete the account and its contents – including content within Google Workspace (Gmail, Docs, Drive, Meet, Calendar), YouTube and Google Photos.»

See the 2023-05-16 announcement.

Other

Over 150 products, including web services, discontinued by Google: https://killedbygoogle.com/

See also