From Archiveteam
Jump to navigation Jump to search

BerliOS seems to still be online (?) its_notjack 06:50, 14 November 2015 (EST)

Preparation work, 2015

Script to download files:

And some discovery of the various parts and software packages projects can have, random project names are used here where the url patterns were found, imagine $projectname instead.

Issue trackers can have multiple, arbitrary names:

  • The admin interface lets you pick make arbitrary numbers of issue trackers (called "tickets" in the interface) with arbitrary names. We'll need to find them by parsing the summary page ( )

  • This appears to be a standard instance of phpBB (which we hopefully know how to archive?)

  • this looks like browsable repos; we probably don't want to scrape these

  • we don't want feeds, so reject: patches/[0-9]+/feed\.(atom|rss)
  • attachments are in scummvm/patches/_discuss/thread/

  • simply some pages with ? , not a directory

Donation links (which appear to just be redirects to a PayPal URL, seem to be of the form):

wiki might be hosted elsewhere

homepage might be hosted elsewhere

domains from which files are served

It's somewhat lower priority, but a download stats API seems to be documented here:

Thoughts, 2022

It's still desirable to archive SourceForge. --Random (talk) 11:50, 27 May 2022 (UTC)