User:bzc6p

From Archiveteam
Revision as of 15:15, 26 December 2024 by Bzc6p (talk | contribs) (→‎My toolbox: Update)
Jump to navigation Jump to search

Elindult az archiveteam.hu!

Magyar nyelvű információk az ArchiveTeam tevékenységéről, illetve a magyar weboldalak sorsáról!

Elindult az RSZI magyar webarchívum!

3 képfeltöltő szolgáltatás közel 2,5 millió, a Wayback Machine-ból sem elérhető képe újra hozzáférhető!

     Hungarian websites     
Red entries indicate websites which don't have an article on this wiki yet. Striked-through entries indicate websites that have already been shut down.
Archives & Digital Libraries mek.oszk.hu  · epa.oszk.hu  · dka.oszk.hu  · webarchivum.oszk.hu  · NAVA  · Fortepan  · fentrol.hu
Blogging Blog.hu  · Blogter  · Freeblog  · Blogger.hu  · reblog.hu  · xfree.hu  · cafeblog.hu
Social networks iWiW  · myVIP  · hotdog.hu  · Baratikor.com  · network.hu  · Mommo  · privi.hu
Webhosting Extra  · tar.hu  · ATW  · Ingyenweb  · Freeweb  · Ultraweb  · x3.hu  · ini.hu  · ininet.hu  · G-Portál  · uCoz  · eOldal  · ewk  · 5mp.eu  · mindenkilapja  · Webnode
Forums, message boards* Index  · SG  · Nők Lapja Cafe  · Hoxa
Video hosting Indavideó  · Videa  · videoplayer.hu  · xfree.hu  · videok.hu
Image hosting Kepfeltoltes.hu  · Fotoalbum.hu  · Indafotó  · Kephost.com  · pics.coldline.hu  · kep.tar.hu  · noob.hu  · PSharing (a.k.a. ivPicture)  · Kephost.hu  · kepfeltoltes.eu  · kephost.net  · kepkuldes.com  · xfree.hu  · GTF Képhost  · fotozz.hu  · Kepkezelo.com  · keptarad.hu  · darkweb.hu  · fos.hu
Questions and Answers gyakorikerdesek.hu  · tudjatok.hu
File sharing data.hu  · toldacuccot.hu  · hellshare.hu  · addat.hu  · fileposta.hu
Document sharing doksi.hu  · Docplayer
Fun Demotiváló  · keptelenseg.hu  · csubakka.hu  · funpic.hu  · nemkutya.com  · legalja.hu  · szanalmas.hu  · trollfesz.cc  · gumicsizma.hu
Trash napiszar.com  · napiszar.hu  · netszar.com  · napiszar.org
Other .hu domains seed  · News+C  · moly.hu  · gyertyalang.hu  · Volán websites  · Szuperinfó


bzc6p is a Hungarian amateur archivist who joined the efforts of ArchiveTeam. "Specialized" in watching and saving Hungarian websites.

Contact: vichratimot (at) archiveteam (dot) hu

Not been doing much spectacular activity recently, but still operating my long-running projects in my, now much less, free time. You may, however, successfully contact me on my talk page or via email if necessary.

See what I'm archiving.


My projects

Websites that I've archived, I'm archiving or I've taken part in organizing their archival, in reversed chronological order in each category. If the website has an entry on this wiki, consult that page for the archives. If not, a link to the archives should be found in the appropriate line.

Large websites


Medium-sized websites

Small websites

Non-web stuff

I'm also archiving some Hungarian TV and radio programmes, magazines and shop flyers.

Archiving schedule

This is a list of my currently going and planned future projects. They are usually preemptive efforts affecting websites that are fine at the moment, but seem to be approaching the end (abandonment, read-only state, operational issues, change in operator etc.), or they are easy to archive with an incremental approach.

Continuous

Already started

2025

  1. blogger.hu
  2. cafeblog.hu
  3. fotozz.hu

2026

  1. G-Portál
  2. Ultraweb.hu
  3. gyertyalang.hu

As needed (keeping an eye on them)

Nothing is safe! We have seen multi-terabyte websites go down immediately or with a few months notice!

However, they might be difficult to archive, too much to archive, not be of high historical importance, run by stable operators (rare!), or a combination of these, which keeps them out of focus.

Philosophy

My experience with my few website archiving endavours so far suggests that there are very few websites today that can be mirrored completely in automated ways without human control and intervention. Thus, if one wants to make quality archives even of a small website, it needs more or less attention, often additional work, or several, supplemental runs of archiving tools.

These archiving tools (wget, wpull, ArchiveBot etc.) are very important and useful, but in most cases, are themselves incapable of making complete archives. My philosophy is that we should do as complete and quality archives as possible, if we set off on the journey of archiving a website, so we cannot rely solely on these tools. Of course, constrained by time and resources, we must make a compromise. Otherwise, however, the above applies. At least for me. This is how I archive.

My toolbox

Archiving websites

  • Chfoo's wpull: No longer maintained, but it's still my favorite tool for archiving websites
    • I'm running Debian 8 (EOL 2020) in VirtualBox in 2025 just for wpull to work... 😅
  • wget: Old but gold, now also with WARC support. Very fast, but lacks some handy features Wpull has got, but it's true the other way around as well.
    • Notably, it can also save POST requests to WARC, which wpull can't
    • Otherwise I use it for website discovery in my archiving scripts. I do the actual WARCing with wpull.
  • Internet Archive's warcprox: provides a proxy to your web browser, so you can easily create WARCs as you browse. Very useful for the News+C project combined with automating a web browser.
  • Bash scripts for website discovery, as well as for collecting URLs in archiving scripts. Simple and fast.
  • Python scripts for more sophisticated tasks (rare).

Replaying WARCs

  • ReplayWebPage: Very convenient, and is similar to how the Wayback Machine works.

Uploading to IA

Further plans

I hope one day I can re-host Hungarian websites that are dead now but have been archived. Or, at least, create a Wayback Machine for Hungarian websites, that would also serve as a mirror to the corresponding Internet Archive items.

As for the URL Team project, given that the discovered URLs have not been saved in WARC format (yet) but in a format difficult to access and read, a shorturl-resolver service for already gone URL shorteners would be useful. It would be kind of a Wayback Machine for URL shorteners. It wouldn't even be difficult to set up, based on URL Team databases.

I would also be glad to record Hungarian radio and television channels' programme 24/7, but that would require a vast amount of resources, Until / instead of that, I'm collecting some recordings of notable Hungarian TV and radio programmes and moments from YouTube (and of course, I'm uploading them to the Archive).

Hungarian articles about Archive Team

Below I've collected online Hungarian news articles published about Archive Team that I've been able to find. The list is in reversed chronological order.