Difference between revisions of "SmolNet"

From Archiveteam
Jump to navigation Jump to search
(found a second gopher proxy)
(a gemini site not in the observatory known hosts)
Line 10: Line 10:


<pre>
<pre>
gemini://abyss.cinderblock.moe/
scorpion://zzo38computer.org/specification.txt
scorpion://zzo38computer.org/specification.txt
guppy://guppy.000090000.xyz
guppy://guppy.000090000.xyz

Revision as of 02:15, 7 August 2024

The SmolNet consists of content available through alternative protocols outside the web such as gemini:// gopher:// gophers:// finger:// spartan:// text:// nex:// scorpion:// mercury:// titan:// guppy:// scroll://.

At this time the WARC format does not support these protocols and the WBM does not support them, so the SmolNet is not archivable nor can archives be accessed.

Fortunately there are proxies to HTTP and HTML that can be used. The most prominent one is https://portal.mozz.us/ and it doesn't require JavaScript, but it doesn't support the scorpion:// mercury:// titan:// guppy:// scroll:// protocols. Several proxies that are only for gopher and redirect to the proxied version of their corresponding gopher sites are https://gopher.tildeverse.org/ https://gopher.envs.net/

The portal links to a few seed SmolNet sites and there are some pages on the SmolNet listing more SmolNet sites: Known Gemini Caspules

ArchiveBot job 84rmt67gwgaah8r70zqtjzw84 is crawling most of the SmolNet (and outlinks to HTTP) via the portal.mozz.us proxy. Unfortunately some sites do not allow their content to be downloaded via proxies, and some sites are of course down, so a minority of sites just give errors. Some sites contain git commits, those are ignored in favour of Codearchiver/SWH. Since SmolNet folks are often data hoarders/packrats, there may be large archives of resources already saved via HTTP, those should be ignored when they are encountered. These additional sites are either not available or were possibly not found through the portal.mozz.us proxy AB job:

gemini://abyss.cinderblock.moe/
scorpion://zzo38computer.org/specification.txt
guppy://guppy.000090000.xyz
finger://tilde.pink/$user
finger://db.debian.org/$user
finger://db.debian.org/$user/key
gopher://aussies.space/1/~freet/
gopher://gopher.viste.fr/
gopher://gopher.linuxgalaxy.org/
gopher://occ.deadnet.se/
gopher://si3t.ch/
gopher://thunix.net/

Should the WARC format ever add support for the SmolNet protocols, then the AB job could be useful for seeding a native recrawl of all known SmolNet sites, including those that block proxies to HTTP.

The SmolNet has the Delorean Time Machine for archiving Geminispace and there is a 2007 mirror of Gopherspace.