User talk:Antonizoon

From Archiveteam
Revision as of 09:54, 27 September 2025 by Bear (talk | contribs) (→‎Question regarding catbox: fixed signature)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

4chan Uploads on IA

Uh, hi, I used to be a part of the ArchiveTeam but now I just preserve things all on my own. You're a part of Bibliotheca Anonoma, right? Yeah, well, I've been helping upload some 4chan (and even 8chan) stuff on to the Internet Archive, like YouTube videos, MediaFire uploads, WARC crawls, etc. You might be interested in the "Everything You Need To Know Ever" upload I recently did here. Also excuse me for this but I also uploaded Bibliotheca Anonoma's Google Drive account here.

I'm saying all of this thinking that you may be interested that I've been indirectly helping you guys. I also re-uploaded the archive.moe /sp/ archive at one point because users reported that your upload was broken. See here. Archive Maniac 18:27, 14 October 2015 (EDT)

Awesome, thanks for inviting me. I can a little bit of regex, but I can't code. I'm an expert at searching online, though. Archive Maniac 22:58, 21 October 2015 (EDT)

Question regarding catbox

I know you're inactive as of writing, but just in case you read this, I have a question:

On Pomf.se/Clones, you wrote this:

Worryingly, Catbox also doesn't allow Wayback Machine to save posts from Catbox Blog due to robots.txt.

Did the robots.txt file specifically target the Wayback Machine similar to this?

User-agent: ia_archiver
Disallow: /

Or did it just blanket-disallow all robots like this?

User-agent: *
Disallow: /

If the former was the case, it would increase the likelihood that they manually requested an exclusion from the Wayback Machine. As of writing, there is neither a User-agent: ia_archiver nor a User-agent: * Disallow: / entry in the robots.txt file (pastebin copy because archive.today refuses to save robots.txt files for unknown reasons). Given that all Wayback Machine exclusions have been manual since the late 2010s, there would have been no need to keep the ia_archiver entry in the robots.txt file anymore. Bear (talk) 09:53, 27 September 2025 (UTC)