Pages Perso Orange
Pages Perso Orange | |
URL | https://pages.perso.orange.fr/ |
Status | Offline |
Archiving status | Saved! (in 2023) |
Archiving type | DPoS |
Project source | pagespersoorange-grab |
Project tracker | pagespersoorange |
IRC channel | #webroasting (on hackint) |
Data[how to use] | archiveteam_pagespersoorange |
Pages Perso Orange is the ISP Hosting service of French provider Orange. It was originally to shut down on 2023-09-05[1] but got extended to 2023-10-05[2].
Site structure
Sites are identified by a slug of the form [-_.A-Za-z0-9]+
.
Orange's hosting has gone through a number of rebrandings and transitions, and as such URL structure is somewhat messy.
<slug>.monsite-orange.fr/<path>
is live. It is HTTP-only if the slug contains.
; otherwise it redirects to HTTPS. The same site may once have been available at:monsite-orange.fr/<slug>/<path>
301-redirects to<slug>.monsite-orange.fr/<path>
.<slug>.monsite.orange.fr
is a landing page with a link to<slug>.monsite-orange.fr
(HTTP only).<slug>.monsite.orange.fr/<path>
302-redirects to a 404 page (HTTP only).monsite.orange.fr/<slug>
301-redirects to<slug>.monsite.orange.fr
(HTTP only).monsite.orange.fr/<slug>/<path>
301-redirects to<slug>.monsite.orange.fr
(HTTP only).<slug>.monsite.wanadoo.fr
is a landing page with a link to<slug>.monsite-orange.fr
(HTTP only).<slug>.monsite.wanadoo.fr/<path>
302-redirects to a 404 page (HTTP only).monsite.wanadoo.fr/<slug>
301-redirects to<slug>.monsite.orange.fr
(HTTP only).monsite.wanadoo.fr/<slug>/<path>
301-redirects<slug>.monsite.orange.fr
(HTTP only).
<slug>.pagesperso-orange.fr/<path>
is live. It is HTTP-only if the slug contains.
; otherwise it redirects to HTTPS. The same site may once have been available at:pagesperso-orange.fr/<slug>/<path>
301-redirects to<slug>.pagesperso-orange.fr/<path>
.<slug>.perso.orange.fr
is a landing page with a link to<slug>.pagesperso-orange.fr
(HTTP only).<slug>.perso.orange.fr/<path>
302-redirects to a 404 page (HTTP only).perso.orange.fr/<slug>
301-redirects to<slug>.perso.orange.fr
(HTTP only).perso.orange.fr/<slug>/<path>
301-redirects to<slug>.perso.orange.fr
(HTTP only).<slug>.perso.wanadoo.fr
is dead.<slug>.perso.wanadoo.fr/<path>
is dead.perso.wanadoo.fr/<slug>
is dead.perso.wanadoo.fr/<slug>/<path>
is dead.
<slug>.pagespro-orange.fr/<path>
is live. It is HTTP-only if the slug contains.
; otherwise it redirects to HTTPS. The same site may once have been available at:<slug>.assoc.pagespro-orange.fr/<path>
is identical to<slug>.pagespro-orange.fr/<path>
.<slug>.ecole.pagespro-orange.fr/<path>
is identical to<slug>.pagespro-orange.fr/<path>
.<slug>.mairie.pagespro-orange.fr/<path>
is identical to<slug>.pagespro-orange.fr/<path>
.pagespro-orange.fr/<slug>
301-redirects to<slug>.pagespro-orange.fr
.pagespro-orange.fr/<slug>/<path>
301-redirects to<slug>.pagespro-orange.fr/<path>
.assoc.pagespro-orange.fr/<slug>
301-redirects to<slug>.assoc.pagespro-orange.fr
.assoc.pagespro-orange.fr/<slug>/<path>
301-redirects to<slug>.assoc.pagespro-orange.fr/<path>
.ecole.pagespro-orange.fr/<slug>
301-redirects to<slug>.ecole.pagespro-orange.fr
.ecole.pagespro-orange.fr/<slug>/<path>
301-redirects to<slug>.ecole.pagespro-orange.fr/<path>
.mairie.pagespro-orange.fr/<slug>
301-redirects to<slug>.mairie.pagespro-orange.fr
.mairie.pagespro-orange.fr/<slug>/<path>
301-redirects to<slug>.mairie.pagespro-orange.fr/<path>
.<slug>.pros.orange.fr
is a landing page with a link to<slug>.pagespro-orange.fr
(HTTP only).<slug>.pros.orange.fr/<path>
302-redirects to a 404 page (HTTP only).pros.orange.fr/<slug>
301-redirects to<slug>.pros.orange.fr
(HTTP only).pros.orange.fr/<slug>/<path>
301-redirects to<slug>.pros.orange.fr
(HTTP only).<slug>.pro.orange.fr
is dead.<slug>.pro.orange.fr/<path>
is dead.pro.orange.fr/<slug>
302-redirects to a 404 page (HTTPS always).pro.orange.fr/<slug>/<path>
302-redirects to a 403 page (HTTPS always).<slug>.pro.wanadoo.fr
is dead.<slug>.pro.wanadoo.fr/<path>
is dead.pro.wanadoo.fr/<slug>
is dead.pro.wanadoo.fr/<slug>/<path>
is dead.
Some assets are at monsite.woopic.com
or *.cdn.woopic.com
.
Archival
Sites were discovered using IA CDX search and Orange's directory.
Initial ArchiveBot jobs were not able to complete in time and were superseded by a DPoS project to circumvent rate limits. Bans took the form of connection timeouts, were approximately 24 hours long, and appeared to be triggered by high request rates for 4xx pages on non-CDN domains; once received they applied to all main and CDN domains.