Difference between revisions of "MBinternet"
Jump to navigation
Jump to search
m (→Items) |
m (Add google scrape and CDX search data) |
||
Line 19: | Line 19: | ||
=== Items === | === Items === | ||
* | * [http://paste.nerds.io/raw/munafisaqo Google Search Scrape ] [http://paste.nerds.io/raw/pitebucuye Users Only] | ||
* [http://paste.archivingyoursh.it/raw/wecexewufo Bing scrape] | * [http://paste.archivingyoursh.it/raw/wecexewufo Bing scrape] | ||
* [http://paste.archivingyoursh.it/raw/sofowixoge Twitter scrape] | * [http://paste.archivingyoursh.it/raw/sofowixoge Twitter scrape] | ||
Line 26: | Line 26: | ||
* [http://paste.archivingyoursh.it/raw/wocefukaje Common Crawl scrape] | * [http://paste.archivingyoursh.it/raw/wocefukaje Common Crawl scrape] | ||
* [http://paste.archivingyoursh.it/raw/davuvecupu Scraped from http://mbi.mbnet.fi/mbinternet/kotisivulista/] | * [http://paste.archivingyoursh.it/raw/davuvecupu Scraped from http://mbi.mbnet.fi/mbinternet/kotisivulista/] | ||
* | * [http://paste.nerds.io/raw/rukasecesu Wayback CDX Scrape] [http://bigbird.nerds.io/webroasting/mbinternet/mbinternetcdxdomainraw.txt Raw CDX Search Data (121 MB)] | ||
* [http://paste.archivingyoursh.it/raw/pidomuxawi URLTeam scrape] | * [http://paste.archivingyoursh.it/raw/pidomuxawi URLTeam scrape] | ||
{{Navigation box}} | {{Navigation box}} |
Revision as of 05:45, 1 May 2015
MBinternet | |
URL | koti.mbnet.fi |
Status | Endangered |
Archiving status | Upcoming... |
Archiving type | Unknown |
IRC channel | #mobinternet (on hackint) |
Vital Signs
From ISP Hosting: "No longer an ISP, future of hosted sites uncertain."
Discovery
Sites follow the following 2 patterns (both contain the same content):
Items
- Google Search Scrape Users Only
- Bing scrape
- Twitter scrape
- MediaWiki scrape
- Open Directory Project scrape
- Common Crawl scrape
- Scraped from http://mbi.mbnet.fi/mbinternet/kotisivulista/
- Wayback CDX Scrape Raw CDX Search Data (121 MB)
- URLTeam scrape