Difference between revisions of "List of websites excluded from the Wayback Machine"

From Archiveteam
Jump to navigation Jump to search
(michaelv.org – it was a web-based Windows 3.1 replica (no emulator) created by Michael Vincent that went defunct circa 2016 or 2017. They already disallowed ia_archiver via robots.txt prior to exclusion.)
m (Add judyrecords.com)
Line 3: Line 3:
This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the [[/Partial exclusions]] subpage.
This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the [[/Partial exclusions]] subpage.


<!-- atwikibot:urlCount -->This list currently contains 1687 URLs.<!-- /atwikibot:urlCount -->
<!-- atwikibot:urlCount -->This list currently contains 1688 URLs.<!-- /atwikibot:urlCount -->


* http://0x000000.com/
* http://0x000000.com/
Line 882: Line 882:
* http://josimar.com/
* http://josimar.com/
* https://jury.online/
* https://jury.online/
* https://www.judyrecords.com/
* http://www.justin.tv/ <!--redirect to https://www.twitch.tv/ (Mar 3 2019)-->
* http://www.justin.tv/ <!--redirect to https://www.twitch.tv/ (Mar 3 2019)-->
* https://www.justinobeirne.com/
* https://www.justinobeirne.com/

Revision as of 05:54, 23 February 2022

This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.

This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the /Partial exclusions subpage.

This list currently contains 1688 URLs.