Difference between revisions of "Saunalahti Iso G"

From Archiveteam
Jump to navigation Jump to search
m (Adding reddit, Bing and DMOZ scrapes)
(Mass-edit to update uses of Template:IRC)
 
(21 intermediate revisions by 6 users not shown)
Line 5: Line 5:
| logo = Saunalahti-Logo.png
| logo = Saunalahti-Logo.png
| project_status = {{closing}}
| project_status = {{closing}}
| archiving_status = {{upcoming}}
| archiving_status = {{saved}}
| source = [https://github.com/ArchiveTeam/iso-g-items iso-g-items].
| irc = isohno
| irc = isohno
| irc_network = EFnet
| irc_abandoned = true
}}
}}


Shutting down on an unspecified date.
Finnish ISP hosting shutting down on an unspecified date. Downloaded by ArchiveBot.


== Discovery ==
== Discovery ==
Sites follow three patterns:
Sites follow several patterns:
* http://www.saunalahti.fi/~USERNAME/  
* http://www.saunalahti.fi/*****/ (sequential, no padding)
* http://USERNAME.pp.fi  
* http://www.saunalahti.fi/~*****/ (sequential, no padding)
* http://www.saunalahti.fi/voas****/ (sequential, 4 characters padded with zeros)
* http://www.saunalahti.fi/~voas****/ (sequential, 4 characters padded with zeros)
* http://www.saunalahti.fi/nl*****/ (sequential, 5 characters padded with zeros)
* http://www.saunalahti.fi/~nl*****/ (sequential, 5 characters padded with zeros)
* http://www.saunalahti.fi/USERNAME/
* http://www.saunalahti.fi/~USERNAME/
* http://USERNAME.pp.fi
* http://www.USERNAME.pp.fi
* http://www.USERNAME.pp.fi


=== Items ===
=== Items ===
* Google scrape ([http://paste.nerds.io/raw/erapemokuq pp.fi Google Scrape] [http://paste.nerds.io/raw/ituyerawap saunalahti.fi Google Scrape])
* Google scrape ([http://paste.nerds.io/raw/erapemokuq pp.fi], [http://paste.nerds.io/raw/ituyerawap saunalahti.fi])
* Scrape Bing ([http://paste.nerds.io/raw/jizicigolo Bing Scrape])
* [http://paste.nerds.io/raw/jizicigolo Bing scrape]
* TODO: Scrape DuckDuckGo
* [http://paste.nerds.io/raw/rajozikewo DuckDuckGo scrape]
* TODO: Scrape Twitter
* [http://paste.archivingyoursh.it/raw/qoroceqali Twitter scrape]
* Scrape Reddit ([http://paste.nerds.io/raw/umopojuvox reddit /domain/ search])
* [http://paste.nerds.io/raw/umopojuvox Reddit scrape]
* TODO: Scrape links from MediaWiki wikis
* [http://paste.archivingyoursh.it/raw/witupuxutu MediaWiki scrape]
* Scrape the Open Directory Project ([http://paste.nerds.io/raw/mevevoripi DMOZ domain search])
* [http://paste.nerds.io/raw/mevevoripi Open Directory Project scrape]
* TODO: Scrape the Common Crawl Index
* [http://paste.archivingyoursh.it/raw/wiqalafima Common Crawl scrape]
* TODO: Scrape the Wayback Machine
* Scrape the Wayback Machine [https://github.com/chpwssn/saunalahti-iso-g/tree/master/discovery/wayback Wayback cdx scrape results]
* TODO: Scrape URLTeam dumps
* [http://paste.archivingyoursh.it/raw/soxixugaja URLTeam scrape]
* [http://paste.archivingyoursh.it/raw/ridubeqeto Start's list of sequential sites]
 
Combined list of results from Chip's scrapes [http://paste.nerds.io/raw/ucoxoronap here].
 
== Archives ==
 
Browse the [http://web.archive.org Wayback Machine].
 
Or, if you're looking for the WARC files, start [http://archive.fart.website/archivebot/viewer/?q=pp.fi here] or [http://archive.fart.website/archivebot/viewer/?q=saunalahti.fi here].


{{Navigation box}}
{{Navigation box}}
[[Category:ISP hosting]]

Latest revision as of 18:55, 31 October 2021

Saunalahti Iso G
Saunalahti Iso G logo
URL pp.fi, saunalahti.fi
Status Closing
Archiving status Saved!
Archiving type Unknown
Project source iso-g-items.
IRC channel #archiveteam-bs (on hackint)
(formerly #isohno (on EFnet))

Finnish ISP hosting shutting down on an unspecified date. Downloaded by ArchiveBot.

Discovery

Sites follow several patterns:

Items

Combined list of results from Chip's scrapes here.

Archives

Browse the Wayback Machine.

Or, if you're looking for the WARC files, start here or here.