Difference between revisions of "List of websites excluded from the Wayback Machine"

From Archiveteam
Jump to navigation Jump to search
(8 intermediate revisions by 5 users not shown)
Line 5: Line 5:
Past exclusions that are no longer active are tracked on the [[/Former exclusions]] subpage.
Past exclusions that are no longer active are tracked on the [[/Former exclusions]] subpage.


<!-- atwikibot:urlCount -->This list currently contains 1820 URLs.<!-- /atwikibot:urlCount -->
<!-- atwikibot:urlCount -->This list currently contains 1825 URLs.<!-- /atwikibot:urlCount -->
<!--
<!--
Editing notes:
Editing notes:
Line 11: Line 11:
* The counter above is also automatically updated by JAABot.
* The counter above is also automatically updated by JAABot.
-->
-->
* http://0x000000.com/
* http://0x000000.com/
* http://0xad.net/
* http://0xad.net/
Line 437: Line 436:
* https://www.chronicle-tribune.com/
* https://www.chronicle-tribune.com/
* http://www.church-calls.com/
* http://www.church-calls.com/
* http://www.cia-on-campus.org/ <!--dead (Mar 9 2019)-->
* http://www.cia-on-campus.org/ <!--dead (Mar 9 2019), Daniel Brandt -->
* http://cia-on-campus.org/ <!-- Daniel Brandt -->
* http://cicorp.com/
* http://cicorp.com/
* https://www.cigarbid.com/
* https://www.cigarbid.com/
Line 549: Line 547:
* http://demorgan.com.au/ <!--broken (Mar 8 2019)-->
* http://demorgan.com.au/ <!--broken (Mar 8 2019)-->
* http://dennismichaellynch.com/
* http://dennismichaellynch.com/
* https://deno.com/
* http://www.derceto.info/
* http://www.derceto.info/
* http://dereksmart.com/
* http://dereksmart.com/
Line 631: Line 630:
* http://empowernation.net/ <!--dead (Jul 27 2019)-->
* http://empowernation.net/ <!--dead (Jul 27 2019)-->
* https://en.luxuretv.com/
* https://en.luxuretv.com/
* https://www.eneba.com/
* http://www.enterprise-logic.com/
* http://www.enterprise-logic.com/
* https://www.eobot.com/
* https://www.eobot.com/
Line 1,044: Line 1,044:
* https://lond.com.br/
* https://lond.com.br/
* https://londontrustmedia.com/
* https://londontrustmedia.com/
* https://www.loom.com/
* http://lop.com/ <!--dead (Jun 2 2019)-->
* http://lop.com/ <!--dead (Jun 2 2019)-->
* http://lovepeace.top/
* http://lovepeace.top/
Line 1,076: Line 1,077:
* http://www.marinetimes.com/ <!--redirect to https://www.marinecorpstimes.com (Apr 6 2019)-->
* http://www.marinetimes.com/ <!--redirect to https://www.marinecorpstimes.com (Apr 6 2019)-->
* http://www.marissamarchant.com/ <!--dead (Mar 27 2019)-->
* http://www.marissamarchant.com/ <!--dead (Mar 27 2019)-->
* https://market-ticker.org/ <!-- not excluded as of 2019: http://archive.today/2019.03.21-203858/https://web.archive.org/web/20081209020430/http://market-ticker.org/ -->
* https://marketexclusive.com/
* https://marketexclusive.com/
* http://marketinginunderwear.com/
* http://marketinginunderwear.com/
Line 1,254: Line 1,256:
* https://omp.com/
* https://omp.com/
* http://www.oneacre.online/
* http://www.oneacre.online/
* https://onerep.com/ <!-- as first documented by Brian Krebs https://krebsonsecurity.com/2024/03/ceo-of-data-privacy-company-onerep-com-founded-dozens-of-people-search-firms/ (Mar 14 2024) -->
* http://www.online-poker-play.eu/
* http://www.online-poker-play.eu/
* https://www.onlinebingofun.nl/
* https://www.onlinebingofun.nl/
Line 1,269: Line 1,272:
* https://osvdb.org/ <!--dead (Mar 3 2019)-->
* https://osvdb.org/ <!--dead (Mar 3 2019)-->
* http://otvet.grimuar.info/
* http://otvet.grimuar.info/
* https://outbyte.com/ <!-- known from Jody Bruchon drama: https://www.youtube.com/watch?v=-uwYzqy3Eq8 -->
* http://www.outlawjournalism.com/
* http://www.outlawjournalism.com/
* https://overgrow.com/
* https://overgrow.com/

Revision as of 18:00, 21 March 2024

This page collects sites that are manually excluded from the Wayback Machine. When a site is manually excluded, attempting to access it returns the error "This URL has been excluded from the Wayback Machine". This applies to all subdomains as well, and as usual in the Wayback Machine, a leading www. is insignificant. This page does not track websites that disallow IA crawlers in their robots.txt file or block them. This list is not provided by the Internet Archive.

This page only collects entire websites (domains). For cases where only some parts of a domain are excluded, see the /Partial exclusions subpage.

Past exclusions that are no longer active are tracked on the /Former exclusions subpage.

This list currently contains 1825 URLs.