Dev/Source Code

From Archiveteam
< Dev
Revision as of 16:44, 4 December 2013 by Chfoo (talk | contribs) (add dockerfile repo)
Jump to navigation Jump to search

Fork me on GitHub! The warrior uses the following repos:

Client code

Client code includes code that the warrior executes.

warrior-preseed

For constructing the virtual appliance image

warrior-code2

Bootstrap code that is pulled from GitHub by the appliance

seesaw-kit

Library that helps build grab scripts and the web interface for the warrior. The name "seesaw" comes from its original behavior: download, upload, and repeat.

warrior-dockerfile

Dockerfile that runs the warrior inside a Docker container.

Projects are in separate repositories typically with the name -grab as a suffix.

Server code

Server code includes code that the Tracker executes.

universal-tracker

The server of which the Seesaw contacts

warrior-hq

The server of which the warrior appliances contact for project metadata

archiveteam-megawarc-factory

The scripts that bundles the WARC files.

URLTeam code

URLTeam code is independent from the tracker and warrior.

tinyback

The client code that scrapes the shortlinks. It includes a pipeline shim to run the code.

tinyarchive

The server code for the tracker.


Developer Documentation