Difference between revisions of "Glitch"

From Archiveteam
Jump to navigation Jump to search
(add JS note)
m (typo)
Line 29: Line 29:
After the deadline, accounts will still be able to download their projects "through the end of 2025".
After the deadline, accounts will still be able to download their projects "through the end of 2025".


The https://glitch.com/@user and https://glitch.com/~project URLs use JavaScript, but might be archivable by hard-coding the behaviour of the page JS in a DPoS script. The https://project.glitch.com/ subdomains will mostly but not always use JavaScript, with individual domain JS behaviours being different, so not easily archivable.
The https://glitch.com/@user and https://glitch.com/~project URLs use JavaScript, but might be archivable by hard-coding the behaviour of the page JS in a DPoS script. The https://project.glitch.me/ subdomains will mostly but not always use JavaScript, with individual domain JS behaviours being different, so not easily archivable.


== Archives ==
== Archives ==


* [https://gitlab.softwareheritage.org/swh/infra/sysadm-environment/-/issues/5667 Software Heritage effort]
* [https://gitlab.softwareheritage.org/swh/infra/sysadm-environment/-/issues/5667 Software Heritage effort]

Revision as of 03:03, 30 June 2025

Glitch is a website where users can create web applications (projects) and share them.

It supports static, as well as dynamic websites though 11ty or NodeJS - the latter are only online for limited amounts of time every month.

Discovery

Users and projects can be enumerated using the API. Request these two URLs and continuously fetch the nextPage field of the resulting JSON. Python scripts for that are available[1] [2].

https://api.glitch.com/v1/users/?limit=1000
https://api.glitch.com/v1/projects/?limit=1000

Archiving

Each project corresponds to a subdomain of glitch.me, thus until July 8th 2025 it'll be possible to access projects via their subdomains.

Each project is also a git repository which can usually be cloned by as anonymous users with the URL format https://api.glitch.com/git/PROJECT-SLUG, replacing PROJECT-SLUG with the actual project slug, e.g. PROJECT-SLUG.glitch.me or glitch.com/~PROJECT-SLUG. However, the git repositories do not include any uploaded "assets", nor databases or other data (typically stored under .data/), nor the .env file, among other things. However, the git repository may not be accessible for projects using outdated versions of Node, or suspended users.

Projects, including their git histories, usually include a .glitch-assets file, which is a JSONL file enumerating assets uploaded to the Assets tab of that project (or a project's ancestor). The url and thumbnail properties of each object link to the CDN servers, which are usually on the domains cdn.glitch.global, cdn.glitch.me or cdn.hyperdev.com for older projects. Only the latest version of the file needs processing, because file deletions just add a new item for the uuid with deleted set to true, and the old URL is still present and still works.

After the deadline, accounts will still be able to download their projects "through the end of 2025".

The https://glitch.com/@user and https://glitch.com/~project URLs use JavaScript, but might be archivable by hard-coding the behaviour of the page JS in a DPoS script. The https://project.glitch.me/ subdomains will mostly but not always use JavaScript, with individual domain JS behaviours being different, so not easily archivable.

Archives