Qwarc

From Archiveteam
Revision as of 21:06, 29 November 2024 by OrIdow6 (talk | contribs) (Created page with "A Python framework written by JAA for quickly crawling sites and saving them to WARC. Its source is found [https://gitea.arpa.li/JustAnotherArchivist/qwarc/src/branch/0.2 here], the master branch is outdated. Its grab scripts are put into meta WARCs in its uploads, e.g. in https://archive.org/download/forum.canucks.com_topic_updates_202309/forum.canucks.com-updates-meta.warc.gz . Currently JAA is the only one who really knows how to use it.")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

A Python framework written by JAA for quickly crawling sites and saving them to WARC.

Its source is found here, the master branch is outdated.

Its grab scripts are put into meta WARCs in its uploads, e.g. in https://archive.org/download/forum.canucks.com_topic_updates_202309/forum.canucks.com-updates-meta.warc.gz .

Currently JAA is the only one who really knows how to use it.