Difference between revisions of "Pages Perso Orange"

From Archiveteam
Jump to navigation Jump to search
m
(it's over)
Line 2: Line 2:
| logo = Logo-orange.png
| logo = Logo-orange.png
| URL = https://pages.perso.orange.fr/
| URL = https://pages.perso.orange.fr/
| project_status = {{closing}}
| project_status = {{offline}}
| archiving_status = {{inprogress}}
| archiving_status = {{saved}}
| archiving_type = DPoS
| archiving_type = DPoS
| tracker = [https://tracker.archiveteam.org/pagespersoorange/ pagespersoorange]
| tracker = [https://tracker.archiveteam.org/pagespersoorange/ pagespersoorange]

Revision as of 14:39, 5 October 2023

Pages Perso Orange
Pages Perso Orange logo
URL https://pages.perso.orange.fr/
Status Offline
Archiving status Saved!
Archiving type DPoS
Project source pagespersoorange-grab
Project tracker pagespersoorange
IRC channel #webroasting (on hackint)

Pages Perso Orange is the ISP Hosting service of French provider Orange. It was originally to shut down on 2023-09-05[1] but got extended to 2023-10-05[2].

Site structure

Sites are identified by a slug of the form [-_.A-Za-z0-9]+.

Orange's hosting has gone through a number of rebrandings and transitions, and as such URL structure is somewhat messy.

  • <slug>.monsite-orange.fr/<path> is live. It is HTTP-only if the slug contains .; otherwise it redirects to HTTPS. The same site may once have been available at:
    • monsite-orange.fr/<slug>/<path> 301-redirects to <slug>.monsite-orange.fr/<path>.
    • <slug>.monsite.orange.fr is a landing page with a link to <slug>.monsite-orange.fr (HTTP only).
    • <slug>.monsite.orange.fr/<path> 302-redirects to a 404 page (HTTP only).
    • monsite.orange.fr/<slug> 301-redirects to <slug>.monsite.orange.fr (HTTP only).
    • monsite.orange.fr/<slug>/<path> 301-redirects to <slug>.monsite.orange.fr (HTTP only).
    • <slug>.monsite.wanadoo.fr is a landing page with a link to <slug>.monsite-orange.fr (HTTP only).
    • <slug>.monsite.wanadoo.fr/<path> 302-redirects to a 404 page (HTTP only).
    • monsite.wanadoo.fr/<slug> 301-redirects to <slug>.monsite.orange.fr (HTTP only).
    • monsite.wanadoo.fr/<slug>/<path> 301-redirects <slug>.monsite.orange.fr (HTTP only).
  • <slug>.pagesperso-orange.fr/<path> is live. It is HTTP-only if the slug contains .; otherwise it redirects to HTTPS. The same site may once have been available at:
    • pagesperso-orange.fr/<slug>/<path> 301-redirects to <slug>.pagesperso-orange.fr/<path>.
    • <slug>.perso.orange.fr is a landing page with a link to <slug>.pagesperso-orange.fr (HTTP only).
    • <slug>.perso.orange.fr/<path> 302-redirects to a 404 page (HTTP only).
    • perso.orange.fr/<slug> 301-redirects to <slug>.perso.orange.fr (HTTP only).
    • perso.orange.fr/<slug>/<path> 301-redirects to <slug>.perso.orange.fr (HTTP only).
    • <slug>.perso.wanadoo.fr is dead.
    • <slug>.perso.wanadoo.fr/<path> is dead.
    • perso.wanadoo.fr/<slug> is dead.
    • perso.wanadoo.fr/<slug>/<path> is dead.
  • <slug>.pagespro-orange.fr/<path> is live. It is HTTP-only if the slug contains .; otherwise it redirects to HTTPS. The same site may once have been available at:
    • <slug>.assoc.pagespro-orange.fr/<path> is identical to <slug>.pagespro-orange.fr/<path>.
    • <slug>.ecole.pagespro-orange.fr/<path> is identical to <slug>.pagespro-orange.fr/<path>.
    • <slug>.mairie.pagespro-orange.fr/<path> is identical to <slug>.pagespro-orange.fr/<path>.
    • pagespro-orange.fr/<slug> 301-redirects to <slug>.pagespro-orange.fr.
    • pagespro-orange.fr/<slug>/<path> 301-redirects to <slug>.pagespro-orange.fr/<path>.
    • assoc.pagespro-orange.fr/<slug> 301-redirects to <slug>.assoc.pagespro-orange.fr.
    • assoc.pagespro-orange.fr/<slug>/<path> 301-redirects to <slug>.assoc.pagespro-orange.fr/<path>.
    • ecole.pagespro-orange.fr/<slug> 301-redirects to <slug>.ecole.pagespro-orange.fr.
    • ecole.pagespro-orange.fr/<slug>/<path> 301-redirects to <slug>.ecole.pagespro-orange.fr/<path>.
    • mairie.pagespro-orange.fr/<slug> 301-redirects to <slug>.mairie.pagespro-orange.fr.
    • mairie.pagespro-orange.fr/<slug>/<path> 301-redirects to <slug>.mairie.pagespro-orange.fr/<path>.
    • <slug>.pros.orange.fr is a landing page with a link to <slug>.pagespro-orange.fr (HTTP only).
    • <slug>.pros.orange.fr/<path> 302-redirects to a 404 page (HTTP only).
    • pros.orange.fr/<slug> 301-redirects to <slug>.pros.orange.fr (HTTP only).
    • pros.orange.fr/<slug>/<path> 301-redirects to <slug>.pros.orange.fr (HTTP only).
    • <slug>.pro.orange.fr is dead.
    • <slug>.pro.orange.fr/<path> is dead.
    • pro.orange.fr/<slug> 302-redirects to a 404 page (HTTPS always).
    • pro.orange.fr/<slug>/<path> 302-redirects to a 403 page (HTTPS always).
    • <slug>.pro.wanadoo.fr is dead.
    • <slug>.pro.wanadoo.fr/<path> is dead.
    • pro.wanadoo.fr/<slug> is dead.
    • pro.wanadoo.fr/<slug>/<path> is dead.

Some assets are at monsite.woopic.com or *.cdn.woopic.com.

Archival

Sites were discovered using IA CDX search and Orange's directory.

Initial ArchiveBot jobs were not able to complete in time and were superseded by a DPoS project to circumvent rate limits. Bans took the form of connection timeouts, were approximately 24 hours long, and appeared to be triggered by high request rates for 4xx pages on non-CDN domains; once received they applied to all main and CDN domains.

References