Difference between revisions of "List of websites excluded from the Wayback Machine"

From Archiveteam
Jump to navigation Jump to search
(Added oberhumer.com – that's the parent site of the LZO data compression format!)
(booru.allthefallen.moe was not excluded like 1 or 3 days before now; now = Fri Apr 29 07:03:48 2022 UTC. shadbase.com = popular rule 34/hentai/porn artist website.)
Line 3: Line 3:
This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the [[/Partial exclusions]] subpage.
This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the [[/Partial exclusions]] subpage.


<!-- atwikibot:urlCount -->This list currently contains 1689 URLs.<!-- /atwikibot:urlCount -->
<!-- atwikibot:urlCount -->This list currently contains 1692 URLs.<!-- /atwikibot:urlCount -->


* http://0x000000.com/
* http://0x000000.com/
Line 161: Line 161:
* https://www.allirishcasino.com/
* https://www.allirishcasino.com/
* http://www.allsang.net/
* http://www.allsang.net/
* https://allthefallen.moe/
* http://allthingsbillbelichick.com/
* http://allthingsbillbelichick.com/
* http://alnie.net/
* http://alnie.net/
Line 947: Line 948:
* http://lightcash.io/ <!--dead (Mar 21 2019)-->
* http://lightcash.io/ <!--dead (Mar 21 2019)-->
* http://www.limitlessled.com/ <!--dead (Jun 22 2019)-->
* http://www.limitlessled.com/ <!--dead (Jun 22 2019)-->
* https://www.linguee.com/
* http://linguee.de/
* http://linguee.de/
* https://linktr.ee
* https://linktr.ee
Line 1,387: Line 1,389:
* http://www.sex-porn-lolitas-teens.com/ <!--dead (Mar 8 2019)-->
* http://www.sex-porn-lolitas-teens.com/ <!--dead (Mar 8 2019)-->
* http://seznamclanku.info/ <!--dead (May 16 2019)-->
* http://seznamclanku.info/ <!--dead (May 16 2019)-->
* http://www.shadbase.com/
* http://shahrazad.net/ <!--dead (Mar 20 2019)-->
* http://shahrazad.net/ <!--dead (Mar 20 2019)-->
* http://shii.org/ <!--dead (Mar 10 2019)-->
* http://shii.org/ <!--dead (Mar 10 2019)-->

Revision as of 09:07, 30 April 2022

This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.

This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the /Partial exclusions subpage.

This list currently contains 1692 URLs.