SmolNet
The SmolNet consists of content available through alternative protocols outside the web such as gemini:// gopher:// Gopher+ gophers:// finger:// spartan:// text:// SuperText nex:// scorpion:// mercury:// titan:// guppy:// scroll:// molerat:// terse:// fsp://. There is a summary of the main SmolNet protocols.
At this time the WARC format does not support these protocols and the WBM does not support them, so the SmolNet is not archivable nor can archives be accessed.
Fortunately there are proxies to HTTP and HTML that can be used. The most prominent one is https://portal.mozz.us/ and it doesn't require JavaScript, but it only supports the gemini:// gopher:// finger:// spartan:// text:// nex:// protocols. There are several proxies that are only for gopher (usually running Gophernicus, often redirect to the proxied version of their corresponding gopher sites) are https://gopher.tildeverse.org/ https://gopher.envs.net/ https://gopherproxy.meulie.net/
The portal links to a few seed SmolNet sites and there are some pages on the SmolNet listing many more SmolNet sites which means that a lot of the SmolNet can be reached via the portal.mozz.us proxy.
Known lists of SmolNet sites include: "SmolNet Portal", "Known Gemini Caspules", "SuperTXT known_hosts" and probably others.
Unfortunately some sites do not allow their content to be downloaded via proxies, and some sites are of course down, so a minority of sites just give errors. Some sites contain git commits, those are ignored in favour of Codearchiver/SWH. Since SmolNet folks are often data hoarders/packrats, there may be large archives of resources already saved via HTTP, those should be ignored when they are encountered.
ArchiveBot job 84rmt67gwgaah8r70zqtjzw84 is crawling the SmolNet (and outlinks to HTTP) via the portal.mozz.us proxy.
ArchiveBot job eab5n85lfcb4dj9wrznwihcal crawled all of finger://db.debian.org/ including all users (finger://db.debian.org/$user) and their OpenPGP keys (finger://db.debian.org/$user/key) via the portal.mozz.us proxy. Outlinks were manually extracted and the single URLs archived in job eab5n85lfcb4dj9wrznwihcal.
These additional sites were possibly not found through the portal.mozz.us proxy AB jobs due to various issues:
gemini://abyss.cinderblock.moe/ gemini://gemini.ctrl-c.club/ scorpion://zzo38computer.org/ guppy://guppy.000090000.xyz/ finger://tilde.pink/$user gopher://aussies.space/1/~freet/ gopher://gopher.viste.fr/ gopher://gopher.linuxgalaxy.org/ gopher://occ.deadnet.se/ gopher://si3t.ch/ gopher://thunix.net/ gopher://rak.ac text://textprotocol.org/ ssh://supertxt.net/ terse://reports.frontline/aug2019austin
Should the WARC format ever add support for the SmolNet protocols, then the AB job could be useful for seeding a native recrawl of all known SmolNet sites, including those that block proxies to HTTP.
The SmolNet has the Delorean Time Machine for archiving Geminispace and there is a 2007 mirror of Gopherspace.