Blog.pl

From Archiveteam
Jump to navigation Jump to search

Blog.pl was one of the first and largest blog platforms in Poland, established in 2001 as an independent project and later acquired by the mainstream web company Onet.pl in 2006. The site was set to be closed on 2018-01-31 or 2018-02-28 (see below). Users were able to export the content and images of their blogs until 2018-02-28[1].

Blog.pl is not to be confused with Blox.pl.

blog.pl
Blog pl.png
URL blog.pl
Status Offline
Archiving status Partially saved
Archiving type Unknown
IRC channel #archiveteam-bs (on hackint)
(formerly #blog.pls (on EFnet))
Project lead User:JustAnotherArchivist

Shutdown

Originally, the shutdown was planned for 2018-01-31. Until 2018-01-24, the homepage included this notice:

Informujemy, że 31 stycznia 2018 platforma blog.pl zostanie zamknięta.

[We inform you that on 31 January 2018, the platform blog.pl will close.]

On 2018-01-24 or 25, Onet announced that the blogs would be exportable until the end of February due to popular demand by replacing the above notice with the following text and in an FAQ entry[1].

W związku z Waszymi prośbami zdecydowaliśmy się wydłużyć możliwość archiwizacji blogów do 28.02.2018 r

[In accordance with your requests, we decided to extend the possibility of archiving blogs until 28.02.2018]

As of 2018-02-06, all blogs were still online, even though the relevant FAQ entry on their website still mentioned 2018-01-31 as the shutdown date of the blog subdomains[2].

On 2018-03-01, blog.pl started redirecting to Onet.pl.

Site structure

Every blog has its own subdomain under a number of domains, including blog.pl, blog.onet.pl, piszecomysle.pl, and crazylife.pl. There is a catalogue of the blogs registered in the system. The catalogue can be refined through categories and tags. However, it does not appear to be possible to discover all blogs through the catalogue as each query is limited to 100 pages of results.

The platform is based on WordPress.

Archival efforts

Due to extremely strict rate limiting (persistent IP ban when the request rate is higher than two per second just once, with a reCAPTCHA to get unbanned) and the lack of ability to run warrior projects currently, Blog.pl cannot be archived properly. As explained above, it is also not easily possible to discover all blogs on the platform. Nevertheless, there was an attempt at archiving at least the homepage of each blog; roughly 100k of these were retrieved successfully.

References