User:bzc6p
bzc6p is a Hungarian amateur archivist who joined the efforts of ArchiveTeam. "Specialized" in watching and saving Hungarian websites.
Contact: vichratimot (at) archiveteam (dot) hu
I also check this wiki once a week, so you can contact me on my talk page as well.
See what I'm archiving.
My projects
Websites that I've archived, I'm archiving or I've taken part in organizing their archival, in reversed chronological order in each category. If the website has an entry on this wiki, consult that page for the archives. If not, a link to the archives should be found in the appropriate line.
Large websites
The archive of each is in the terabyte range.
Medium-sized websites
The archive of each ranges from a few gigabytes to a few hundred gigabytes.
- cafeblog.hu
- mavcsoport.hu
- volanbusz.hu
- vadhajtasok.hu, as part of News+C/hu (continuous)
- kephost.net (continuous)
- videok.hu
- ingyenweb.hu
- kepfeltoltes.eu (continuous)
- Szuperinfó
- 444.hu, as part of News+C/hu
- hvg.hu, as part of News+C/hu (continuous)
- kuruc.info, as part of News+C/hu (continuous)
- nol.hu (archive)
- Wikispot
- PSharing
- tudjatok.hu
- nolblog.hu
- legalja.hu
- netszar.com
- Volán websites
- keptarad.hu
- kepkezelo.com
- noob.hu
- GTF Képhost
- Demotivalo.net
Non-web stuff
I'm also archiving some Hungarian TV and radio programs, magazines and shop flyers.
Archiving schedule
This is a list of my currently going and planned future projects. They are usually preemptive efforts affecting websites that are fine at the moment, but seem to be approaching the end (abandonment, read-only state, operational issues, change in operator etc.), or they are easy to archive with an incremental approach.
Continuous
- kephost.net
- kepfeltoltes.eu
- hvg.hu as part of News+C
- kuruc.info as part of News+C
- vadhajtasok.hu as part of News+C
Already started
2025
2026
As needed (keeping an eye on them)
Nothing is safe! We have seen multi-terabyte websites go down immediately or with a few months notice!
However, they might be difficult to archive, too much to archive, not be of high historical importance, run by stable operators (rare!), or a combination of these, which keeps them out of focus.
Restored websites
I'm hunting for Hungarian domain names the underlying websites of which have been completely archived, but the domains are currently parked. The goal is to restore content at its original location, thus reviving lots of dead links, as well as providing a near-perfect browsing experience (Wayback Machine is sometimes unable to correctly reproduce links to other pages and page requisites).
Websites successfully restored so far:
These archives are entirely self-hosted, they don't rely on the Internet Archive. Due to the way they are served, there may be some lag on occasion, but it's still usable.
archiveteam.hu
On 2021-01-01, I started a Hungarian website for ArchiveTeam, archiveteam.hu, with the most important information about ArchiveTeam in general, and archiving efforts of Hungarian websites, for Hungarian readers. (The design of the website is intentionally minimalistic. What I hate about the web these days is that it's full of bloat!) It has also been hosting some interactive services, see below.
I have various plans to make this website better, but at the moment my focus is on saving content, and uploading already saved content to the Internet Archive. The archiveteam.org wiki and its pages continue to qualify the comprehensive and most up-to-date source of information about Hungarian websites.
RSZI
A Wayback Machine-like search tool to obtain certain archived files – currently, images saved from image hosting websites. Launched at the same time as archiveteam.hu itself. The motivation for it was that WARCs I archived recently (since ~2016) have not been ingested into the Wayback Machine, so after the websites went down, there was no way to easily access a given file by URL. Currently, the RSZI service provides access to cca. 2.5 million images of three image hosting websites, with relying on WARC files hosted by the Internet Archive.
Lecsű
A short-lived (August 2021 – January 2022) on-demand semi-automated YouTube video archiving service. Internet Archive didn't like me uploading thousands of random YouTube videos, so the service got discontinued. Fortunately, now there's ArchiveTeam's own service that can be used for this purpose.
Philosophy
My experience with my few website archiving endavours so far suggests that there are very few websites today that can be mirrored completely in automated ways without human control and intervention. Thus, if one wants to make quality archives even of a small website, it needs more or less attention, often additional work, or several, supplemental runs of archiving tools.
These archiving tools (wget, wpull, ArchiveBot etc.) are very important and useful, but in most cases, are themselves incapable of making complete archives. My philosophy is that we should do as complete and quality archives as possible, if we set off on the journey of archiving a website, so we cannot rely solely on these tools. Of course, constrained by time and resources, we must make a compromise. Something is better than nothing. Otherwise, however, the above applies. At least for me. This is how I archive.
My toolbox
Archiving websites
- Chfoo's wpull: No longer maintained, but it's still my favorite tool for archiving websites
- I'm running Debian 8 (EOL 2020) in VirtualBox in 2025 just for wpull to work... 😅
- wget: Old but gold, now also with WARC support. Very fast, but lacks some handy features Wpull has got, but it's true the other way around as well.
- Notably, it can also save POST requests to WARC, which wpull can't
- Otherwise I use it for website discovery in my archiving scripts. I do the actual WARCing with wpull.
- Internet Archive's warcprox: provides a proxy to your web browser, so you can easily create WARCs as you browse. Very useful for the News+C project combined with automating a web browser.
- Bash scripts for website discovery, as well as for collecting URLs in archiving scripts. Simple and fast.
- Python scripts for more sophisticated tasks (rare).
Replaying WARCs
- ReplayWebPage: Very convenient, and is similar to how the Wayback Machine works.
Uploading to IA
- Direct use of the Internet Archive S3 API with the curl program and custom scripts.
Further plans
As for the URL Team project, given that the discovered URLs have not been saved in WARC format but in a format difficult to access and read, a shorturl-resolver service for already gone URL shorteners would be useful. It would be kind of a Wayback Machine for URL shorteners. It wouldn't even be difficult to set up, based on URL Team databases.
As for Hungarian ones, until the corresponding domain names get caught, this could be a new feature on archiveteam.hu. But, this is a future project.
Hungarian articles about Archive Team
Below I've collected online Hungarian news articles published about Archive Team that I've been able to find. The list is in reversed chronological order.
- I've proudly discovered that Archive Team got its own article (among Organizations) on the knowledge base of the Hungarian Internet Archive, that is, the Web Archiving Department of National Széchényi Library, the national library of Hungary! (Date: 2017-07-25).
- In 2021, I've been approached by them and we started conversations that appeared to become fruitful (e.g. they keeping a copy of all stuff I archived), but after a change in their contact person, I got ignored.)
- Péter Szűcs: Az internet nem felejt (The internet doesn't forget). itcafe.hu, 2015-03-05. (About ArchiveTeam's activity in general.)
- Dániel Dojcsák: Elpusztulhat a nem profitképes online tartalom (Non-profitable online content may vanish). hwsw.hu, 2013-12-03. (Mentions ArchiveTeam saving Blip videos.)
- Mit szóltok filmletöltők? Két héttel a bezárása után ismét működik a népszerű torrentoldal (What do you say, movie leechers? Two weeks after its closure popular torrent site runs again). hvg.hu, 2013-10-30. (About IsoHunt restoration.)
- Lementik a legnagyobb torrentkeresőt (They download the biggest torrent search site). index.hu, 2013-10-21. (About saving IsoHunt.)
- Ádám Szedlák: Új otthont kaptak az őshonlapok (The ancient websites got a new home). origo.hu, 2009-11-02. (About Geocities.)
- Ádám Szedlák: Megmentik az őshonlapokat (They are saving the ancient websites). origo.hu, 2009-05-13. (About Geocities.)
- Sándor Berta: Archiválják a GeoCities-tartalmakat (They archive GeoCities' contents). sg.hu, 2009-05-04.