Difference between revisions of "List of websites excluded from the Wayback Machine"

From Archiveteam
Jump to navigation Jump to search
(Merge edit by Flashfire42)
Tag: merged edit of another user
Line 1: Line 1:
This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.
This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.


<!-- atwikibot:urlCount -->This list currently contains 1606 URLs.<!-- /atwikibot:urlCount -->
<!-- atwikibot:urlCount -->This list currently contains 1607 URLs.<!-- /atwikibot:urlCount -->


* http://heritage.stsci.edu/
* http://0x000000.com/
* http://0x000000.com/
* http://0xad.net/
* http://0xad.net/
Line 758: Line 757:
* https://www.help.org/
* https://www.help.org/
* https://hentaihaven.org/
* https://hentaihaven.org/
* http://heritage.stsci.edu/
* http://heronews.top/ <!--dead (Jul 31 2019)-->
* http://heronews.top/ <!--dead (Jul 31 2019)-->
* http://www.herpy.net/ <!--redirect to https://e621.net/ (Jun 22 2019)-->
* http://www.herpy.net/ <!--redirect to https://e621.net/ (Jun 22 2019)-->

Revision as of 01:00, 16 July 2020

This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.

This list currently contains 1607 URLs.