Difference between revisions of "ArchiveBot"

From Archiveteam
Jump to navigation Jump to search
(Created page with "ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all conte...")
 
Line 1: Line 1:
ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs).  You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).
ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs).  You give it a URL to start at, and it grabs all content under that URL, [[Wget_with_WARC_output|records it in a WARC]], and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).


To use ArchiveBot, drop by #archivebot on EFNet.
To use ArchiveBot, drop by #archivebot on EFNet.


ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot.  Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues.
ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot.  Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues.

Revision as of 04:41, 21 September 2013

ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).

To use ArchiveBot, drop by #archivebot on EFNet.

ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. Issues may be filed at https://github.com/ArchiveTeam/ArchiveBot/issues.