Difference between revisions of "URLs"

From Archiveteam
Jump to navigation Jump to search
(+specific sources)
((I didn't write this, JAA did, but this seems important to put on the wiki))
Line 11: Line 11:


The '''URLs project''' is a continuous generic project to archive random URLs from various sources, e.g. external links discovered in other projects or in older archives. Current projects as of early 2021 that send outlinks to URLs include the [[Reddit]] and [[Yahoo! Answers]] projects.
The '''URLs project''' is a continuous generic project to archive random URLs from various sources, e.g. external links discovered in other projects or in older archives. Current projects as of early 2021 that send outlinks to URLs include the [[Reddit]] and [[Yahoo! Answers]] projects.
Important note: If you run this project, you'll likely see your IP get banned from Facebook, Instagram, YouTube, etc., and using those sites may become difficult (e.g. constant captchas, forced login). Also, if you run at significant speed, you'll likely see abuse notices, IP blacklists, and so on.

Revision as of 02:14, 26 July 2021

URLs
URL https://url.spec.whatwg.org/
Status Special case
Archiving status In progress...
Archiving type Unknown
Project source urls-grab
Project tracker urls
IRC channel #// (on hackint)

The URLs project is a continuous generic project to archive random URLs from various sources, e.g. external links discovered in other projects or in older archives. Current projects as of early 2021 that send outlinks to URLs include the Reddit and Yahoo! Answers projects.

Important note: If you run this project, you'll likely see your IP get banned from Facebook, Instagram, YouTube, etc., and using those sites may become difficult (e.g. constant captchas, forced login). Also, if you run at significant speed, you'll likely see abuse notices, IP blacklists, and so on.