Difference between revisions of "ArchiveBot"
Jump to navigation
Jump to search
(linkify IRC channel) |
|||
Line 3: | Line 3: | ||
ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, [[Wget_with_WARC_output|records it in a WARC]], and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites). | ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, [[Wget_with_WARC_output|records it in a WARC]], and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites). | ||
To use ArchiveBot, drop by #archivebot on EFNet. | To use ArchiveBot, drop by [http://chat.efnet.org:9090/?nick=&channels=%23archivebot&Login=Login #archivebot] on EFNet. | ||
ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues. | ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues. |
Revision as of 21:06, 1 January 2014
ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).
To use ArchiveBot, drop by #archivebot on EFNet.
ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues.