Difference between revisions of "Kepfeltoltes.hu"

From Archiveteam
Jump to navigation Jump to search
m (→‎Data deletion: they definitely do delete)
(Lesson learned: Google doesn't show everything.)
Line 5: Line 5:
| URL = {{url|1=http://kepfeltoltes.hu}}
| URL = {{url|1=http://kepfeltoltes.hu}}
| project_status = {{online}}
| project_status = {{online}}
| archiving_status = {{Blue|Continuous}}
| archiving_status = Some saved continuously (see below)
}}
}}


Line 38: Line 38:
From 2014-11-13, [[user:bzc6p]] exports a daily index of uploaded pictures. (The last day's image URLs can be parsed from the list of last uploaded images, see above.) Later, after at least 30 days, he downloads the actual images and uploads them to the [[Internet Archive]]. (The 30-days is some kind of "deletion-by-user" window, should the uploader get their pictures be deleted soon after uploading. Not many users may do that, though. However, bzc6p respects this will. And it's not likely that kepfeltoltes.hu deletes images so quickly.)
From 2014-11-13, [[user:bzc6p]] exports a daily index of uploaded pictures. (The last day's image URLs can be parsed from the list of last uploaded images, see above.) Later, after at least 30 days, he downloads the actual images and uploads them to the [[Internet Archive]]. (The 30-days is some kind of "deletion-by-user" window, should the uploader get their pictures be deleted soon after uploading. Not many users may do that, though. However, bzc6p respects this will. And it's not likely that kepfeltoltes.hu deletes images so quickly.)


A Google discovery is also being made for images uploaded before the indexing started. A small amount of older images can be saved that way.
=== Older pictures ===
One can't find pictures with the <code>site:kepfeltoltes.hu</code> Google search term. One must search for kepfeltoltes links inserted into websites, and copy-paste the links from there, or, write a script that parses them out automatically.
 
Problem is that Google doesn't show all the results. Not at all. It seems that it leaves out some big websites that have a ton of these links. However, if search is done only for those sites (with <code>site:</code>), they are found.
 
So, besides the general Google search, directed searching must be done for major websites. This has been done for the following sites:
 
* http://audikklub.hu
* http://gyakorikerdesek.hu (This one shows links only in questions, so Google finds only them. This is a new thing, and may be reverted in the future.)
 
More to come.


== Archives ==
== Archives ==

Revision as of 11:01, 25 December 2014

Kepfeltoltes.hu
Kepfeltoltes.hu logo
Kepfeltoltes screenshot.png
URL http://kepfeltoltes.hu[IAWcite.todayMemWeb]
Status Online!
Archiving status Some saved continuously (see below)
Archiving type Unknown
IRC channel #archiveteam-bs (on hackint)

Kepfeltoltes.hu is probably the most popular one-click image sharing service in Hungary. Since its start in 2004, it has hosted millions of images. However, the maintainers delete old pictures after an uncertain amount of time.

Site structure

Users can select files (up to 20) from their computers and upload them very easily. Then they get the URLs of their pictures, which they can insert to forums, chats or anywhere. Uploaders can decide whether their pictures shall be public, that is, appear on the list of pictures recently uploaded by users (http://kepfeltoltes.hu/public.php?ok=ok&btn_tovabb=Tov%E1bb; NSFW alert!).

Image page URLs are all unique, they are like the following: http://kepfeltoltes.hu/view/YYMMDD/FILENAME_www.kepfeltoltes.hu_.EXT, where YYMMDD is the date in this format, FILENAME is the name of the original file, EXT is the file extension (jpg/jpeg, png and gif files are accepted). The actual image URLs are like http://kepfeltoltes.hu/YYMMDD/FILENAME_www.kepfeltoltes.hu_.EXT, but those are, of course, linked from the image pages. What is not linked is the thumbnail version: http://kepfeltoltes.hu/thumb/YYMMDD/FILENAME_www.kepfeltoltes.hu_.EXT

The way the URL is generated prevents others to find the images not made public. Thus, in the following, we talk about public pictures only.

The site lists the last 6000 uploaded public pictures on http://kepfeltoltes.hu/public.php?ok=ok&btn_tovabb=Tov%E1bb. There is no index for older pictures, and, as seen above, they can't be efficiently discovered with brute force.

Content

Although some percent of the image corpus is NSFW content, most of the files are screenshots and photos of objects and people.

Data growth

The number of public pictures grows by 3 to 5 thousand a day; more are uploaded at weekends than at weekdays.

Data deletion

There is no certain information about the image deletion mechanism used by the maintainers. Some people say that pictures may be deleted after some months, however, some testing shows that several years old pictures are still up. It is also said that staff removes users' uploaded pictures on request (uploader can optionally enter their email address when uploading). What is for sure that all images uploaded before July 2010 seem to be gone (as of December 2014).

So it's unknown when and what, but they regularly delete some older pictures to free up space.

Archiving

From 2014-11-13, user:bzc6p exports a daily index of uploaded pictures. (The last day's image URLs can be parsed from the list of last uploaded images, see above.) Later, after at least 30 days, he downloads the actual images and uploads them to the Internet Archive. (The 30-days is some kind of "deletion-by-user" window, should the uploader get their pictures be deleted soon after uploading. Not many users may do that, though. However, bzc6p respects this will. And it's not likely that kepfeltoltes.hu deletes images so quickly.)

Older pictures

One can't find pictures with the site:kepfeltoltes.hu Google search term. One must search for kepfeltoltes links inserted into websites, and copy-paste the links from there, or, write a script that parses them out automatically.

Problem is that Google doesn't show all the results. Not at all. It seems that it leaves out some big websites that have a ton of these links. However, if search is done only for those sites (with site:), they are found.

So, besides the general Google search, directed searching must be done for major websites. This has been done for the following sites:

More to come.

Archives

Coming soon.


     Hungarian websites     
Red entries indicate websites which don't have an article on this wiki yet. Striked-through entries indicate websites that have already been shut down.
Archives & Digital Libraries mek.oszk.hu  · epa.oszk.hu  · dka.oszk.hu  · webarchivum.oszk.hu  · NAVA  · Fortepan  · fentrol.hu
Blogging Blog.hu  · Blogter  · Freeblog  · Blogger.hu  · reblog.hu  · xfree.hu  · cafeblog.hu
Social networks iWiW  · myVIP  · hotdog.hu  · Baratikor.com  · network.hu  · Mommo  · privi.hu
Webhosting Extra  · tar.hu  · ATW  · Ingyenweb  · Freeweb  · Ultraweb  · x3.hu  · ini.hu  · ininet.hu  · G-Portál  · uCoz  · eOldal  · ewk  · 5mp.eu  · mindenkilapja  · Webnode
Forums, message boards* Index  · SG  · Nők Lapja Cafe  · Hoxa
Video hosting Indavideó  · Videa  · videoplayer.hu  · xfree.hu  · videok.hu
Image hosting Kepfeltoltes.hu  · Fotoalbum.hu  · Indafotó  · Kephost.com  · pics.coldline.hu  · kep.tar.hu  · noob.hu  · PSharing (a.k.a. ivPicture)  · Kephost.hu  · kepfeltoltes.eu  · kephost.net  · kepkuldes.com  · xfree.hu  · GTF Képhost  · fotozz.hu  · Kepkezelo.com  · keptarad.hu  · darkweb.hu  · fos.hu
Questions and Answers gyakorikerdesek.hu  · tudjatok.hu
File sharing data.hu  · toldacuccot.hu  · hellshare.hu  · addat.hu  · fileposta.hu
Document sharing doksi.hu  · Docplayer
Fun Demotiváló  · keptelenseg.hu  · csubakka.hu  · nemkutya.com  · legalja.hu  · szanalmas.hu  · trollfesz.cc  · gumicsizma.hu
Trash napiszar.com  · napiszar.hu  · netszar.com  · napiszar.org
Other News+C  · moly.hu  · gyertyalang.hu  · Volán websites  · Szuperinfó