Web Roasting
Jump to navigation
Jump to search
Web Roasting - Save all the web hosting sites! | |
Status | Online! |
Archiving status | scraping In progress..., downloading Upcoming... |
Archiving type | Unknown |
IRC channel | #webroasting (on hackint) |
Sites hosted on free web hosts, where you make an account and get <your name>.host.com or users.host.com/<your name>, are inextricably tied to the hoster; there's no way to transfer a subdomain to another host, so when the host goes down, everything it hosted is gone forever. Web Roasting is a project to save these old web hosting sites before they shut down.
How can I help?
There are two ways you can help right now:
- Add more web hosting sites to the ISP Hosting or University Web Hosting pages.
- Scrape the following for hosted web sites:
- Google (site:webhost.com)
- Bing (site:webhost.com)
- DuckDuckGo (site:webhost.com)
- Yandex (site:webhost.com)
- Baidu (site:webhost.com)
- Twitter (litterapi preferred)
- Reddit (http://www.reddit.com/domain/webhost.com/)
- Links from MediaWiki wikis
- The Open Directory Project
- The Common Crawl Index
- The Wayback Machine
- URLTeam crawls
- DNSdumpster.com (only for hosts that use subdomains)
- pentest-tools.com (only for hosts that use subdomains)
- Sitemaps or other types of indexes, if the web host provides any.
Lists of hosts