From Archiveteam
Revision as of 11:50, 14 November 2015 by Its notjack (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

BerliOS seems to still be online (?) its_notjack 06:50, 14 November 2015 (EST)

Script to download files:

And some discovery of the various parts and software packages projects can have, random project names are used here where the url patterns were found, imagine $projectname instead.

Issue trackers can have multiple, arbitrary names:

  • The admin interface lets you pick make arbitrary numbers of issue trackers (called "tickets" in the interface) with arbitrary names. We'll need to find them by parsing the summary page ( )

  • This appears to be a standard instance of phpBB (which we hopefully know how to archive?)

  • this looks like browsable repos; we probably don't want to scrape these

  • we don't want feeds, so reject: patches/[0-9]+/feed\.(atom|rss)
  • attachments are in scummvm/patches/_discuss/thread/

  • simply some pages with ? , not a directory

Donation links (which appear to just be redirects to a PayPal URL, seem to be of the form):

wiki might be hosted elsewhere

homepage might be hosted elsewhere

domains from which files are served

It's somewhat lower priority, but a download stats API seems to be documented here: