Dev/Source Code
Jump to navigation
Jump to search
Fork me on GitHub! The warrior uses the following repos:
Client code
Client code includes code that the warrior executes.
- For constructing the virtual appliance image
- Bootstrap code that is pulled from GitHub by the appliance
- Library that helps build grab scripts and the web interface for the warrior. The name "seesaw" comes from its original behavior: download, upload, and repeat.
- Dockerfile that runs the warrior inside a Docker container.
Projects are in separate repositories typically with the name -grab
as a suffix.
Server code
Server code includes code that the Tracker executes.
- The server of which the Seesaw contacts
- The server of which the warrior appliances contact for project metadata
- The scripts that bundles the WARC files.
URLTeam code
URLTeam code is independent from the tracker and warrior.
- The client code that scrapes the shortlinks. It includes a pipeline shim to run the code.
- The server code for the tracker.