Difference between revisions of "MBinternet"

From Archiveteam
Jump to navigation Jump to search
m (Add google scrape and CDX search data)
Line 19: Line 19:


=== Items ===
=== Items ===
* TODO: Scrape Google Search
* [http://paste.nerds.io/raw/munafisaqo Google Search Scrape ] [http://paste.nerds.io/raw/pitebucuye Users Only]
* [http://paste.archivingyoursh.it/raw/wecexewufo Bing scrape]
* [http://paste.archivingyoursh.it/raw/wecexewufo Bing scrape]
* [http://paste.archivingyoursh.it/raw/sofowixoge Twitter scrape]
* [http://paste.archivingyoursh.it/raw/sofowixoge Twitter scrape]
Line 26: Line 26:
* [http://paste.archivingyoursh.it/raw/wocefukaje Common Crawl scrape]
* [http://paste.archivingyoursh.it/raw/wocefukaje Common Crawl scrape]
* [http://paste.archivingyoursh.it/raw/davuvecupu Scraped from http://mbi.mbnet.fi/mbinternet/kotisivulista/]
* [http://paste.archivingyoursh.it/raw/davuvecupu Scraped from http://mbi.mbnet.fi/mbinternet/kotisivulista/]
* TODO: Scrape Wayback
* [http://paste.nerds.io/raw/rukasecesu Wayback CDX Scrape] [http://bigbird.nerds.io/webroasting/mbinternet/mbinternetcdxdomainraw.txt Raw CDX Search Data (121 MB)]
* [http://paste.archivingyoursh.it/raw/pidomuxawi URLTeam scrape]
* [http://paste.archivingyoursh.it/raw/pidomuxawi URLTeam scrape]


{{Navigation box}}
{{Navigation box}}

Revision as of 05:45, 1 May 2015

MBinternet
MBinternet logo
URL koti.mbnet.fi
Status Endangered
Archiving status Upcoming...
Archiving type Unknown
IRC channel #mobinternet (on hackint)

Vital Signs

From ISP Hosting: "No longer an ISP, future of hosted sites uncertain."

Discovery

Sites follow the following 2 patterns (both contain the same content):

Items