Difference between revisions of "ArchiveBot"
Jump to navigation
Jump to search
(Created page with "ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all conte...") |
|||
Line 1: | Line 1: | ||
ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites). | ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, [[Wget_with_WARC_output|records it in a WARC]], and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites). | ||
To use ArchiveBot, drop by #archivebot on EFNet. | To use ArchiveBot, drop by #archivebot on EFNet. | ||
ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues. | ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues. |
Revision as of 04:41, 21 September 2013
ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).
To use ArchiveBot, drop by #archivebot on EFNet.
ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues.