DocumentCloud
Jump to navigation
Jump to search
| DocumentCloud | |
All-in-one platform used by newsrooms around the world to manage primary source documents. | |
| URL | https://www.documentcloud.org/home/ |
| Status | Online! |
| Archiving status | Unknown (most of the site) Partially saved (some of it) |
| Archiving type | ArchiveBot Unknown |
| IRC channel | #archiveteam-bs (on hackint) |
DocumentCloud is a document-sharing website that allows users to upload, analyze, annotate, collaborate on, and publish primary-source documents, like court filings. It seems to be associated with the FOIA-supporting nonprofit MuckRock (https://www.muckrock.com/[IA•Wcite•.today]).
The document pages have, in the right side column, "Download File" links to the corresponding PDF URLs on s3, or you can convert the preview URL to a PDF URL like this:
Given a url like:
- https://www.documentcloud.org/documents/26468878-mehtavought/[IA•Wcite•.today]
- https://www.documentcloud.org/documents/26103424-bendler-sentencing-memo/[IA•Wcite•.today]
You can use sed 's,www.,s3.,;s,-,/,;s,/$,,;s,$,.pdf,' to change them to get the s3 url for archival via ArchiveBot.
That gives the urls:
- https://s3.documentcloud.org/documents/26468878/mehtavought.pdf[IA•Wcite•.today]
- https://s3.documentcloud.org/documents/26103424/bendler-sentencing-memo.pdf[IA•Wcite•.today]
| This article is a stub. You can help by expanding it. |