Difference between revisions of "Indafotó"

From Archiveteam
Jump to navigation Jump to search
(Archiving finished! And the website is still up.)
(Comprehensive archiving restarted and continued while possible)
Line 5: Line 5:
| URL = {{url|1=http://indafoto.hu}}
| URL = {{url|1=http://indafoto.hu}}
| project_status = {{closing}}
| project_status = {{closing}}
| archiving_status = {{saved}}
| archiving_status = {{saved}} (shallow),<br>{{inprogress}} (comprehensive)
| lead = [[user:bzc6p]]
| lead = [[user:bzc6p]]
}}
}}
Line 11: Line 11:
'''Indafotó''' is a large Hungarian image hosting service, launched in 2007, and operated by Inda-Labs, that runs quite popular services in Hungary such as [[Blog.hu]], [[Indavideó]], [[Index Fórum]], and is also related to the #1 Hungarian website [[index.hu]].
'''Indafotó''' is a large Hungarian image hosting service, launched in 2007, and operated by Inda-Labs, that runs quite popular services in Hungary such as [[Blog.hu]], [[Indavideó]], [[Index Fórum]], and is also related to the #1 Hungarian website [[index.hu]].


Late February 2025 they announced that they would shut the service down and delete 13.6 million photos on {{datetime|2025-03-31}}.
Late February 2025 they announced that they would shut the service down and delete 13.6 million photos on {{datetime|2025-03-31}}. As of {{datetime|2025-04-02}}, the website is still operational.


== Site reconnaissance ==
== Site reconnaissance ==
Line 55: Line 55:
Time permitting, he used only one thread, this way also minimalized the risk of being banned. The script went after all pages of a user profile, downloaded all image size variants that might be referenced in embed links, and even downloaded unlisted images if any.
Time permitting, he used only one thread, this way also minimalized the risk of being banned. The script went after all pages of a user profile, downloaded all image size variants that might be referenced in embed links, and even downloaded unlisted images if any.


Starting with users with fewer images, he was able to archive ~96% of the users and ~33% of the image corpus by the time of the announcement of the shutdown. This almost concludes users having less than 1,000 public images.
Starting with users with fewer images, he was able to archive ~96% of the users and ~33% of the image corpus by the time of the announcement of the shutdown. This concludes users having less than 1,000 public images.


=== Phase 3: Shallow archiving to get most possible images ===
=== Phase 3: Shallow archiving to get most possible images ===


The website serves the user profile/image pages relatively slowly, but serves the images themselves fairly quickly. Considering this and the short deadline, starting {{datetime|2025-03-02}}, for users with more than 1,000 public images, the archiving switched to a more shallow approach, which targets saving only the following:
The website serves the user profile/image pages relatively slowly, but serves the images themselves fairly quickly. Considering this and the short deadline, starting {{datetime|2025-03-02}}, for users with more than 1,000 public images, the archiving switched to a more shallow approach, which targeted saving only the following:
* main image listing pages
* main image listing pages
* album pages
* album pages
Line 65: Line 65:
* the highest resolution version of all these images
* the highest resolution version of all these images


This way the most meaningful content of the user – the images themselves – gets saved, also it'll be possible to simply browse through the user's images when playing back the WARC. However, since images are not simply embedded in HTML in image browsing pages, it'll need some tricks or detailed guidance to view full-size images on playback, but this is something to think about later. The goal is to save the most possible content.
This way the most meaningful content of the user – the images themselves – got saved, also it'll be possible to simply browse through the user's images when playing back the WARC. However, since images are not simply embedded in HTML in image browsing pages, it'll need some tricks or detailed guidance to view full-size images on playback, but this is something to think about later. The goal was to save the most possible content.


The reamining ~200 users from the "less than 1,000 images" set, however, continue to be archived in entirety.
By the announced shutdown date, all users with more than 1,000 images (those not covered in Phase 2) were saved with this shallow approach, so technially all (public) images got saved.


=== Progress ===
=== Phase 4: Continued comprehensive archival until the actual end ===


* As of 2025-03-02, approximately 96.8% of the users and 33.7% of the images have been downloaded. The rate of progress is expected to significantly rise with the multiplied efforts and the introduction of the aforementioned shallow approach.
With the website still up on 2025-04-02, the comprehensive archiving scripts have been restarted for the 1,000+ image users, to complement the existing archives with all user pages, as well as with all size variants of images, so that whole user accounts are browsable and all links to embedded images work.
* As of 2025-03-08, approximately 98.3% of the users and 45.5% of the images have been downloaded. With this rate, a 91% coverage would be expected, but a further increase in speed is coming shortly, so it seems possible to download all images by the deadline.
* As of 2025-03-15, approximately 98.9% of the users and 62.4% of the images have been downloaded. With this rate, it is expected that all images get saved just in time.
* As of 2025-03-22, approximately 99.4% of the users and 77.2% of the images have been downloaded. With this rate, it is ''likely'' that all images get saved in time.
* As of 2025-03-29, approximately 99.7% of the users and 92.1% of the images have been downloaded. This means that ''almost'' all images will be saved by the shutdown.


'''At 2025-04-01 01:50 CEST, the planned archival of the last user completed, so the archiving effort finished successfully.''' In the coming days, statistics are being made about the exact number of users and images saved.
This effort is continued until the website gets actually switched off.


=== Archives ===
=== Archives ===

Revision as of 17:33, 2 April 2025

Indafotó is a large Hungarian image hosting service, launched in 2007, and operated by Inda-Labs, that runs quite popular services in Hungary such as Blog.hu, Indavideó, Index Fórum, and is also related to the #1 Hungarian website index.hu.

Late February 2025 they announced that they would shut the service down and delete 13.6 million photos on 2025-03-31. As of 2025-04-02, the website is still operational.

Site reconnaissance

As of February 2023, according to the search page, it is hosting more than 8,900,000 photos, but according to the image ids, it is, or has hosted more than 27 million images (it might be that not all images are public).

Most of the images seem to be photos of good quality, with little sexually explicit content.

Through the search function, even the oldest pictures are available, dating back to 2007.

58,482 users belong to the above mentioned images (i.e. that many users have at least one image uploaded); some users have tens of thousands of images uploaded.

Shutdown

Sometime in February 2025, a notice appeared on the top of image pages, reading:

Original Hungarian Translated English
Szolgáltatásunk hamarosan megszűnik, már csak korlátozott funkcionalitással érhető el. Részletekkel hamarosan emailben és itt a felületen jelentkezünk! Our service is soon to be discontinued, now it is available only with limited functionality. We will follow up with details via email and on this interface shortly.
Detailed shutdown notice

In the last days of February, they specified the closure date in 2025-03-31 and gave the exact number of images to be deleted: 13,641,979. Logged-in users are able to see a more detailed notice (depicted on the right), which also states that uploading of images is not possible since 2025-02-01, and that users are able to download their uploaded images, browsable in folders representing the albums, however, metadata such as description and comments of the images are not included.

As of 2025-04-01 07:00 CEST, the website is still operational.

Archiving

Phase 1: Planning and design

user:bzc6p first conceived the idea of archiving Indafotó in 2020, with the following manifesto:

With a strong business and technological background, one would think the website is not in danger. However, user:bzc6p considers the time has arrived for archiving the content, for the following reasons:
– Technical issues (broken search not showing images since April 2019, no https on the main page etc.)
– Seems to be abandoned by staff
– The world has changed a lot in 13 years, and while back then blog.hu and index.hu themselves seem to have used Indafotó to store images, they don't any more
– There is some turmoil around the flagship service index.hu, this might have further consequences
– Image hosting services are incremental. That is, already uploaded images can be continuously saved – one day they will have to be, anyway.

Phase 2: Slow but steady comprehensive archiving

User:bzc6p discovered 58,482 users and their corresponding 8,914,482 images, the archival of which he started in June 2023.

Time permitting, he used only one thread, this way also minimalized the risk of being banned. The script went after all pages of a user profile, downloaded all image size variants that might be referenced in embed links, and even downloaded unlisted images if any.

Starting with users with fewer images, he was able to archive ~96% of the users and ~33% of the image corpus by the time of the announcement of the shutdown. This concludes users having less than 1,000 public images.

Phase 3: Shallow archiving to get most possible images

The website serves the user profile/image pages relatively slowly, but serves the images themselves fairly quickly. Considering this and the short deadline, starting 2025-03-02, for users with more than 1,000 public images, the archiving switched to a more shallow approach, which targeted saving only the following:

  • main image listing pages
  • album pages
  • preview (medium) size images displayed on these pages
  • the highest resolution version of all these images

This way the most meaningful content of the user – the images themselves – got saved, also it'll be possible to simply browse through the user's images when playing back the WARC. However, since images are not simply embedded in HTML in image browsing pages, it'll need some tricks or detailed guidance to view full-size images on playback, but this is something to think about later. The goal was to save the most possible content.

By the announced shutdown date, all users with more than 1,000 images (those not covered in Phase 2) were saved with this shallow approach, so technially all (public) images got saved.

Phase 4: Continued comprehensive archival until the actual end

With the website still up on 2025-04-02, the comprehensive archiving scripts have been restarted for the 1,000+ image users, to complement the existing archives with all user pages, as well as with all size variants of images, so that whole user accounts are browsable and all links to embedded images work.

This effort is continued until the website gets actually switched off.

Archives

Uploading of archived user content to the following items has been started: https://archive.org/details/@bzc6p?query=indafoto_hu_users_u1k. Items get populated with WARCs slowly (as IA upload speed permits...) throughout 2025.



Notable websites operated by Indamedia Group
blog.hu  · Indavideó  · Indafotó  · Index Fórum  · Index.hu
     Hungarian websites     
Red entries indicate websites which don't have an article on this wiki yet. Striked-through entries indicate websites that have already been shut down.
Archives & Digital Libraries mek.oszk.hu  · epa.oszk.hu  · dka.oszk.hu  · webarchivum.oszk.hu  · NAVA  · Fortepan  · fentrol.hu
Blogging Blog.hu  · Blogter  · Freeblog  · Blogger.hu  · reblog.hu  · xfree.hu  · cafeblog.hu
Social networks iWiW  · myVIP  · hotdog.hu  · Baratikor.com  · network.hu  · Mommo  · privi.hu
Webhosting Extra  · tar.hu  · ATW  · Ingyenweb  · Freeweb  · Ultraweb  · x3.hu  · ini.hu  · ininet.hu  · G-Portál  · uCoz  · eOldal  · ewk  · 5mp.eu  · mindenkilapja  · Webnode
Forums, message boards* Index  · SG  · Nők Lapja Cafe  · Hoxa
Video hosting Indavideó  · Videa  · videoplayer.hu  · xfree.hu  · videok.hu
Image hosting Kepfeltoltes.hu  · Fotoalbum.hu  · Indafotó  · Kephost.com  · pics.coldline.hu  · kep.tar.hu  · noob.hu  · PSharing (a.k.a. ivPicture)  · Kephost.hu  · kepfeltoltes.eu  · kephost.net  · kepkuldes.com  · xfree.hu  · GTF Képhost  · fotozz.hu  · Kepkezelo.com  · keptarad.hu  · darkweb.hu  · fos.hu
Questions and Answers gyakorikerdesek.hu  · tudjatok.hu
File sharing data.hu  · toldacuccot.hu  · hellshare.hu  · addat.hu  · fileposta.hu
Document sharing doksi.hu  · Docplayer
Fun Demotiváló  · keptelenseg.hu  · csubakka.hu  · funpic.hu  · nemkutya.com  · legalja.hu  · szanalmas.hu  · trollfesz.cc  · gumicsizma.hu
Trash napiszar.com  · napiszar.hu  · netszar.com  · napiszar.org
Other .hu domains seed  · News+C  · moly.hu  · gyertyalang.hu  · Volán websites  · Szuperinfó