Difference between revisions of "Google Drive"

From Archiveteam
Jump to navigation Jump to search
(Changed "...an URL..." to "...a URL..."; changed IRC channel and server info.)
(Added in GWorkspace Exemption)
Line 11: Line 11:
'''Google Drive''' is a filehosting service, a la Dropbox, run by Google (not to be confused with [https://cloud.google.com/ Google Cloud Storage] and similar more technical storage solutions). It is popular both for personal storage and for sharing of files.
'''Google Drive''' is a filehosting service, a la Dropbox, run by Google (not to be confused with [https://cloud.google.com/ Google Cloud Storage] and similar more technical storage solutions). It is popular both for personal storage and for sharing of files.


Google Drive IDs are not random (anecdotally, IDs of folders in the same tree often share long parts), which makes them predictable, a problem which Google has been trying to rectify across its products (others of which have similar issues) throughout 2021<ref>https://www.theverge.com/google/2021/6/23/22548007/youtube-google-drive-security-link-sharing</ref>. As such, on September 13, 2021, Google will be requiring that, in order to access files and folders, users either have permissions tied to their signed-in Google Accounts, or access the item through a URL with a random per-item parameter called resourceKey, apparently introduced in 2021.<ref>https://support.google.com/a/answer/10685032</ref> The result of this will be that at least millions of links across the Web will effectively break.
Google Drive IDs are not random (anecdotally, IDs of folders in the same tree often share long parts), which makes them predictable, a problem which Google has been trying to rectify across its products (others of which have similar issues) throughout 2021<ref>https://www.theverge.com/google/2021/6/23/22548007/youtube-google-drive-security-link-sharing</ref>. As such, on September 13, 2021, Google will be requiring that, in order to access files and folders, users either have permissions tied to their signed-in Google Accounts, or access the item through a URL with a random per-item parameter called resourceKey, apparently introduced in 2021.<ref>https://support.google.com/a/answer/10685032</ref> The result of this will be that at least millions of links across the Web will effectively break. It should be noted that Docs, Sheets and Slides will be exempted from this update <ref>https://support.google.com/a/answer/10685032</ref>


== Grab ==
== Grab ==

Revision as of 19:45, 6 September 2021

Google Drive
Google drive logo.png
URL http://drive.google.com/[IAWcite.todayMemWeb]
Status Special case (Technically files are only becoming inaccessible)
Archiving status Upcoming...
Archiving type Unknown
IRC channel #googlecrash (on hackint)

Google Drive is a filehosting service, a la Dropbox, run by Google (not to be confused with Google Cloud Storage and similar more technical storage solutions). It is popular both for personal storage and for sharing of files.

Google Drive IDs are not random (anecdotally, IDs of folders in the same tree often share long parts), which makes them predictable, a problem which Google has been trying to rectify across its products (others of which have similar issues) throughout 2021[1]. As such, on September 13, 2021, Google will be requiring that, in order to access files and folders, users either have permissions tied to their signed-in Google Accounts, or access the item through a URL with a random per-item parameter called resourceKey, apparently introduced in 2021.[2] The result of this will be that at least millions of links across the Web will effectively break. It should be noted that Docs, Sheets and Slides will be exempted from this update [3]

Grab

The upcoming grab script has 3 item types, folder:, file:, and user:. It is intended that all folder: items be run first, to get a pool (through backfeed) of file: items, that can be randomly sampled to determine a size threshold that the Internet Archive will accept; then files will be run. Users contain some user metadata but not links to files or folders.

Playback is theoretically possible with a flexible, POST-capable Wayback Machine, but this does not yet exist. In the meantime, it may be possible to get files with vanilla wget or similar from the WBM, and hopefully a tool to list folders can be created as well.