Difference between revisions of "Saunalahti Iso G"
Jump to navigation
Jump to search
m (→Items) |
m (Adding reddit, Bing and DMOZ scrapes) |
||
Line 19: | Line 19: | ||
=== Items === | === Items === | ||
* Google scrape ([http://paste.nerds.io/raw/erapemokuq pp.fi Google Scrape] [http://paste.nerds.io/raw/ituyerawap saunalahti.fi Google Scrape]) | * Google scrape ([http://paste.nerds.io/raw/erapemokuq pp.fi Google Scrape] [http://paste.nerds.io/raw/ituyerawap saunalahti.fi Google Scrape]) | ||
* | * Scrape Bing ([http://paste.nerds.io/raw/jizicigolo Bing Scrape]) | ||
* TODO: Scrape DuckDuckGo | * TODO: Scrape DuckDuckGo | ||
* TODO: Scrape Twitter | * TODO: Scrape Twitter | ||
* | * Scrape Reddit ([http://paste.nerds.io/raw/umopojuvox reddit /domain/ search]) | ||
* TODO: Scrape links from MediaWiki wikis | * TODO: Scrape links from MediaWiki wikis | ||
* | * Scrape the Open Directory Project ([http://paste.nerds.io/raw/mevevoripi DMOZ domain search]) | ||
* TODO: Scrape the Common Crawl Index | * TODO: Scrape the Common Crawl Index | ||
* TODO: Scrape the Wayback Machine | * TODO: Scrape the Wayback Machine |
Revision as of 19:03, 6 April 2015
Saunalahti Iso G | |
URL | pp.fi, saunalahti.fi |
Status | Closing |
Archiving status | Upcoming... |
Archiving type | Unknown |
IRC channel | #isohno (on hackint) |
Shutting down on an unspecified date.
Discovery
Sites follow three patterns:
Items
- Google scrape (pp.fi Google Scrape saunalahti.fi Google Scrape)
- Scrape Bing (Bing Scrape)
- TODO: Scrape DuckDuckGo
- TODO: Scrape Twitter
- Scrape Reddit (reddit /domain/ search)
- TODO: Scrape links from MediaWiki wikis
- Scrape the Open Directory Project (DMOZ domain search)
- TODO: Scrape the Common Crawl Index
- TODO: Scrape the Wayback Machine
- TODO: Scrape URLTeam dumps