Difference between revisions of "Comcast Personal Web Pages"

From Archiveteam
Jump to navigation Jump to search
Line 22: Line 22:
* TODO: Scrape Reddit
* TODO: Scrape Reddit
* TODO: Scrape links from MediaWiki wikis
* TODO: Scrape links from MediaWiki wikis
* TODO: Scrape the Open Directory Project
* [http://paste.archivingyoursh.it/raw/jawacafexo Open Directory Project scrape]
* [http://paste.archivingyoursh.it/raw/busosagonu Common Crawl scrape]
* [http://paste.archivingyoursh.it/raw/busosagonu Common Crawl scrape]
* TODO: Scrape the Wayback Machine
* TODO: Scrape the Wayback Machine

Revision as of 05:10, 27 March 2015

Comcast Personal Web Pages
Comcast Personal Web Pages logo
URL home.comcast.net
Status Online!
Archiving status Upcoming...
Archiving type Unknown
IRC channel #comclose (on hackint)


Discovery

Sites follow two patterns:

Items

  • TODO: Scrape Google
  • TODO: Scrape Bing
  • TODO: Scrape DuckDuckGo
  • TODO: Scrape Twitter
  • TODO: Scrape Reddit
  • TODO: Scrape links from MediaWiki wikis
  • Open Directory Project scrape
  • Common Crawl scrape
  • TODO: Scrape the Wayback Machine
  • TODO: Scrape URLTeam dumps