Discovery Data
Jump to navigation
Jump to search
Sometimes when starting a new project, we gather discovery data from discovery scripts.
Raw Data
Raw data contains unprocessed data from discovery scripts.
- https://archive.org/details/archiveteam-github-repository-index-201212
- https://archive.org/details/2013_10_09_bliptv_urls
Smaug's Secret Stash
Smaug's Secret Stash is data rsync'ed to user:chfoo's host.
- https://archive.org/details/shipwretched-items
- https://archive.org/details/fotodisco-raw-items
- https://archive.org/details/quizilladisco-raw-items
- https://archive.org/details/qwikidisco-raw-items
- https://archive.org/details/twitpicdisco-raw-items
Item Lists
Item lists are prepared lists of items, such as username or hostname, to be fed into the tracker.
- https://archive.org/details/archiveteam-xanga-userlist-20130142
- https://archive.org/details/2013-02-22-posterous-hostname-list
- https://archive.org/details/2013-02-22-posterous-hostname-list-not-posterous
- https://archive.org/details/Posterous.comHostnames
More can be found on ArchiveTeam's GitHub repository. The repositories are usually suffixed with "-items"
Datasets
Datasets are sets of data useful for research.
- https://archive.org/details/friendster-dataset-201107
- https://archive.org/details/friendster-groups-201107