From Archiveteam
Revision as of 02:11, 4 June 2015 by JesseW (talk | contribs) (more detail about mailing lists)
Jump to navigation Jump to search

Script to download files:

And some discovery of the various parts and software packages projects can have, random project names are used here where the url patterns were found, imagine $projectname instead.

  • this looks like browsable repos; we probably don't want to scrape these

  • we don't want feeds, so reject: patches/[0-9]+/feed\.(atom|rss)
  • attachments are in scummvm/patches/_discuss/thread/

  • simply some pages with ? , not a directory

Donation links (which appear to just be redirects to a PayPal URL, seem to be of the form):

wiki might be hosted elsewhere

homepage might be hosted elsewhere

domains from which files are served

It's somewhat lower priority, but a download stats API seems to be documented here: