Jira
Jira is a bug tracking system developed by Atlassian. They previously offered a self-hosted version (Jira Server), but have ended sales on Feb 15, 2023[1] and support will fully end on Feb 15, 2024,[2] leaving the only options as their cloud version[3] or the comically expensive data center version.[4]
Strategy
An ArchiveBot !a command will not properly discover all issues.
The approach I've used is to find the highest-numbered ticket in each project (use the "all issues" filter and order by "key"), and then generated a text file listing all of the ticket URLs, which then can be ran (by an op) as an !a < list job. However, this will not save issue attachments due to the no-parent rule, so in addition the job's database needs to be manually saved and the relevant links need to be extracted from the database, and can be run as a separate job.
Attachment mime types do get detected as a relative URL, and will need to be ignored, along the lines of https?://{primary_netloc}/browse/(application|text|image|video)/
. Not all attachments are found like this due to a wpull quirk, so this is insufficient for getting a list of attachments.[5] Additionally, attachment sorting is not useful to save, so [?&]attachment(Order|ViewMode|SortBy)=
can be ignored.
Issue comments are loaded via JS and are not properly saved. However, all comments are included when exporting an issue in "printable" form, as well as in the XML and DOC export formats. As such, these export links are worth saving (though they either need to be generated as a list or extracted from the job's database). https://example.com/si/jira.issueviews:issue-html/PROJ-12345/PROJ-12345.html
https://transfer.archivete.am/qMgJa/jira_url_list.py[IA•Wcite•.today•MemWeb] generates a list of URLs, also using the Jira API[IA•Wcite•.today•MemWeb] to determine the maximum issue for each project. https://transfer.archivete.am/6owk9/jira_process_db.py[IA•Wcite•.today•MemWeb] extracts attachment URLs from databases. (Offsite links aren't done yet.)
A few sites have attachments that redirected to a login page, e.g. https://hub.spigotmc.org/jira/secure/attachment/10005/daf45dfd-da9b-498b-96e4-5782bbac11c7.dat from https://hub.spigotmc.org/jira/browse/SPIGOT-31. These are simply broken, and logging in results in a 404 page for those attachments. The attachment zip for those issues also gives a 500.
https://transfer.archivete.am/6rcal/jira_url_list_delux.py[IA•Wcite•.today•MemWeb] was used to generate lists of tickets from the API, as well as the API calls themselves, for two sites, though in both cases the script needed to be modified and using 1000 entries/request does not work well (100 entries/request seems much more stable). This was needed due to large numbers of nonexistent tickets due to either importing from another issue tracker or large amounts of deleted spam.
Status
Site | Status | Archive details | ||||||
---|---|---|---|---|---|---|---|---|
Tool | URL list | Job | Date | Size | Objects | Notes | ||
https://bugs.mojang.com/ | Saved! | ArchiveBot (!a < list) | bugs.mojang.com_issue_seed_urls.txt (AB · WB) | 9k71c | 2023-09-29 | 10 warcs, DB | includes offsite links, does not include XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | bugs.mojang.com_missed_binary_attachments.txt (AB · WB) | elu1d | 2023-10-04 | 188 warcs | |||
Saved! | ArchiveBot (!ao < list) | bugs.mojang.com_missed_text_attachments.txt (AB · WB) | 474e9 | 2023-10-04 | 1 warc | split so that the text attachment WARC could be downloaded separately for later analysis | ||
Saved! | ArchiveBot (!ao < list) | bugs.mojang.com_missed_urls_other_than_attachments.txt (AB · WB) | e7bzh | 2023-10-04 | 3 warcs | XML/HTML/DOC versions (included in the seed list for later jobs) as well as project descriptions (not included) | ||
https://hub.spigotmc.org/jira/ | Saved! | ArchiveBot (!a < list) | hub.spigotmc.org_jira_seed_urls.txt (AB · WB) | 2nxku | 2023-10-04 | 1 warc, DB | includes offsite links, does not include XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | https://transfer.archivete.am/10RVuY/hub.spigotmc.org_attachments.txt (AB · WB) | 7gnzz | 2024-02-14 | 1 warc | attachments | ||
Saved! | ArchiveBot (!a < list) | hub.spigotmc.org_html_word_xml.txt (AB · WB) | 2kvqe | 2024-02-15 | 1 warc | XML/HTML/DOC; offsite links | ||
https://issues.asterisk.org/ | Lost/Saved by itself | ArchiveBot (!ao) | aehw8 | 2023-12-12 | 1 warc | redirect to GitHub, but see below | ||
https://issues-archive.asterisk.org/ | Saved! | ArchiveBot (!a) | 97syu | 2023-12-12 | 1 warc | static conversion, includes attachments | ||
https://bugreports.qt.io/ | Saved! | ArchiveBot (!a < list) | bugreports.qt.io_urls.txt (AB · WB) | 33ea8 | 2024-02-05 | 24 warcs, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | bugreports.qt.io_attachments.txt (AB · WB) | 31az9 | 2024-02-14 | 6 warcs | attachments | ||
https://jira.lfnetworking.org/ | Saved! | ArchiveBot (!a) | 52ily | 2024-02-06 | 1 warc | no public issues | ||
https://jira.hyperledger.org/ | Lost | 41xj9 | 2024-02-06 | DNS resolution failed | ||||
https://issues.freepbx.org/ | Lost | a81hn | 2024-02-06 | all URLs gave 403s | ||||
https://jira.anuket.io/ | Saved! | ArchiveBot (!a < list) | jira.anuket.io.txt (AB · WB) | b9azb | 2024-02-06 | 1 warc, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | https://transfer.archivete.am/qMHpW/jira.anuket.io_attachments.txt (AB · WB) | cabvh | 2024-02-14 | 1 warc | attachments | ||
https://jira.automotivelinux.org/ | Saved! | ArchiveBot (!a < list) | jira.automotivelinux.org.txt (AB · WB) | abx2j | 2024-02-06 | 5 warcs, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.automotivelinux.org_attachments.txt (AB · WB) | 7k9u1 | 2024-02-14 | 1 warc | attachments | ||
https://jira.ecmwf.int/ | Saved! | ArchiveBot (!a < list) | jira.ecmwf.int.txt (AB · WB) | 75yja | 2024-02-06 | 1 warc, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.ecmwf.int_attachments.txt (AB · WB) | 9nbhj | 2024-02-14 | 1 warc | attachments | ||
https://jira.fd.io/ | Saved! | ArchiveBot (!a < list) | jira.fd.io.txt (AB · WB) | 97uf1 | 2024-02-06 | 1 warc, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.fd.io_attachments.txt (AB · WB) | axp2q | 2024-02-14 | 1 warc | attachments; nearly all are broken redirects to login pages | ||
https://jira.linuxfoundation.org/ | Saved! | ArchiveBot (!a < list) | jira.linuxfoundation.org.txt (AB · WB) | 8e44u | 2024-02-06 | 7 warcs, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.linuxfoundation.org_attachments.txt (AB · WB) | a3m3h | 2024-02-14 | 1 warc | attachments | ||
https://ugene.dev/tracker/ | Saved! | ArchiveBot (!a < list) | ugene.dev.txt (AB · WB) | 17ibq | 2024-02-06 | 1 warc, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | ugene.dev_attachments.txt (AB · WB) | 8v7s5 | 2024-02-14 | 1 warc | attachments | ||
https://jira.opendaylight.org/ | Saved! | ArchiveBot (!a < list) | jira.opendaylight.org.txt (AB · WB) | 9yr1a | 2024-02-07 | 2 warcs, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.opendaylight.org_attachments.txt (AB · WB) | 5cy2j | 2024-02-14 | 1 warc | attachments | ||
https://jira.onap.org/ | Saved! | ArchiveBot (!a < list) | jira.onap.org.txt (AB · WB) | dzx0h | 2024-02-07 | 4 warcs, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.onap.org_attachments.txt (AB · WB) | nvr4i | 2024-02-14 | 1 warc | attachments | ||
https://tracker.moodle.org/ | Saved! | ArchiveBot (!a < list) | tracker.moodle.org.txt (AB · WB) | 88pks | 2024-02-07 | 2 warcs, DB | includes XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | tracker.moodle.org_attachments.txt (AB · WB) | 50aaf | 2024-02-14 | 9 warcs | attachments | ||
Not saved yet | offsite links | |||||||
https://jira.mongodb.org/ | Saved! | ArchiveBot (!a < list) | jira.mongodb.org.txt (AB · WB) | 59izp | 2024-02-07 | 2 warcs, DB | includes XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.mongodb.org_attachments.txt (AB · WB) | bekcz | 2024-02-14 | 15 warcs | attachments | ||
Not saved yet | offsite links | |||||||
https://jira.atlassian.com/secure/Dashboard.jspa | Saved! | ArchiveBot (!a < list) | jira.atlassian.com.txt (AB · WB) | 86p16 | 2024-02-07 | 7 warcs, DB | includes XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.atlassian.com_attachments.txt (AB · WB) | e8gqt | 19 warcs | attachments | |||
Not saved yet | offsite links | |||||||
https://issues.redhat.com/ | Saved! | ArchiveBot (!a < list) | issues.redhat.com.txt (AB · WB) | 358sy | 2024-02-07 | 7 warcs, DB | includes XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | https://transfer.archivete.am/15MfDf/issues.redhat.com_attachments.txt (AB · WB) | 25wjr | 2024-02-14 | 24 warcs | attachments | ||
Not saved yet | offsite links | |||||||
https://jira.mariadb.org/ | Saved! | ArchiveBot (!a < list) | jira.mariadb.org.txt (AB · WB) | da3z7 | 2024-02-08 | 8 warcs, DB | includes offsite links and XML/HTML/DOC | |
Saved! | ArchiveBot (!ao < list) | jira.mariadb.org_attachments.txt (AB · WB) | 50n4e | 2024-02-14 | 3 warcs | attachments | ||
https://issues.apache.org/jira/browse/ | Partially saved | ArchiveBot (!a < list) | issues.apache.org.txt (AB · WB) | ebggh | 2024-02-07 | 1 warc, DB | lots of deleted/skipped issues due to spam, and the site banned us due to loading lots of those (I think) | |
Saved! | ArchiveBot (!ao < list) | https://transfer.archivete.am/6KG1A/issues.apache.org_redo_search_urls.txt (AB · WB) | 6zi69 | 2024-02-12 | 1 warc | search API URLs from script | ||
Partially saved | ArchiveBot (!a < list) | https://transfer.archivete.am/4EHyS/issues.apache.org_redo_ticket_urls.txt (AB · WB) | 5ztk8 | 2024-02-12 | 1 warc, DB | ticket and XML/HTML/DOC from script | ||
In progress... | ArchiveBot (!a < list) | https://transfer.archivete.am/gQlNh/issues.apache.org_redo_ticket_urls_continue_browse_urls.txt (AB · WB) | ayckm | 2024-02-15 | ticket URLs not completed by the job above, no offsite links or XML/HTML/DOC | |||
Not saved yet | attachments | |||||||
Not saved yet | XML/HTML/DOC | |||||||
Not saved yet | offsite links | |||||||
https://bugs.openjdk.org/ | Partially saved | ArchiveBot (!ao < list) | bugs.openjdk.org_search_urls.txt (AB · WB) | cgwcr | 2024-02-12 | 1 warc | search API urls from script, some failed due to size | |
Saved! | ArchiveBot (!ao < list) | bugs.openjdk.org_search_urls_cleanup_maxResults_100.txt (AB · WB) | 6rq26 | 2024-02-13 | 1 warc | failing URLs from above converted to use 100 results (done via issue list below) | ||
Partially saved | ArchiveBot (!a < list) | bugs.openjdk.org_issue_urls.txt (AB · WB) | 8aaum | 2024-02-12 | 1 warc, DB | lots of skipped issues presumably due to import from another tracker; script; banned partway through; includes XML/HTML/DOC | ||
In progress... | ArchiveBot (!a < list) | bugs.openjdk.org_issue_urls_continue_browse_urls.txt (AB · WB) | 1olv7 | 2024-02-15 | ticket URLs not completed by the job above, no offsite links or XML/HTML/DOC | |||
Not saved yet | attachments | |||||||
Not saved yet | XML/HTML/DOC | |||||||
Not saved yet | offsite links | |||||||
More... | mostly saved, to be entered Soon™ |
References
- ↑ https://www.atlassian.com/migration/assess/journey-to-cloud[IA•Wcite•.today•MemWeb]
- ↑ https://www.atlassian.com/migration[IA•Wcite•.today•MemWeb]
- ↑ https://www.atlassian.com/software/jira/pricing?tab=cloud
- ↑ https://www.atlassian.com/software/jira/pricing?tab=data-center
- ↑ https://hackint.logs.kiska.pw/archiveteam-bs/20240206#c408954[IA•Wcite•.today•MemWeb]