Difference between revisions of "User:Bzc6p"

From Archiveteam
Jump to navigation Jump to search
m
(what about my dreams)
Line 1: Line 1:
=== What I'm trying to look smart with ===
== What I'm trying to look smart with ==
::''That era of the web is far behind us when a single'' <code>wget -r -p</code> ''command could mirror a website in its entirety. Nowadays each and every website has its own soul, its own hideous Javascript-linked content, not to mention the various file formats and ways of embedding content. Thus, if one is serious about web archiving, each and every website must be discovered carefully, often painstakingly, which is in too few cases possible in any automated ways.''
::''That era of the web is far behind us when a single'' <code>wget -r -p</code> ''command could mirror a website in its entirety. Nowadays each and every website has its own soul, its own hideous Javascript-linked content, not to mention the various file formats and ways of embedding content. Thus, if one is serious about web archiving, each and every website must be discovered carefully, often painstakingly, which is in too few cases possible in any automated ways.''


=== What I probably shouldn't have archived ===
== What I probably shouldn't have archived ==
<center><div style="text-align:center; width:90%; border: 2px solid red; font-size:120%;">
<center><div style="text-align:center; width:90%; border: 2px solid red; font-size:120%;">
<font color="red">'''Általam feltöltött tartalom eltávolításával kapcsolatos kéréseket a <code>vichra</code><code>timot</code><code>@euro</code><code>mail.</code><code>hu</code> címre kell küldeni.'''</font>
<font color="red">'''Általam feltöltött tartalom eltávolításával kapcsolatos kéréseket a <code>vichra</code><code>timot</code><code>@euro</code><code>mail.</code><code>hu</code> címre kell küldeni.'''</font>
Line 9: Line 9:
</div></center>
</div></center>


=== Who I am ===
== Who I am ==
[http://en.wikipedia.org/wiki/Hungary Hungarian] amateur who joined the efforts of ArchiveTeam. "Specialized" in watching and saving Hungarian websites.
[http://en.wikipedia.org/wiki/Hungary Hungarian] amateur who joined the efforts of ArchiveTeam. "Specialized" in watching and saving Hungarian websites.


=== What I've done ===
== What I've done ==
[[File:Keep_calm_and_hate_javascript.png|thumb|This user is on a tin can connected to a windmill,<ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2015-07-05,Sun&sel=163#l159</ref> likes simplicity, likes archiving websites, therefore '''hates Javascript''' being used to just show components of websites.]]
[[File:Keep_calm_and_hate_javascript.png|thumb|This user is on a tin can connected to a windmill,<ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2015-07-05,Sun&sel=163#l159</ref> likes simplicity, likes archiving websites, therefore '''hates Javascript''' being used to just show components of websites.]]
*What I've saved or am saving:
*What I've saved or am saving:
Line 27: Line 27:
I sometimes also take part in [[Warrior]] projects, in those rare cases when the tracker limit is not already saturated.
I sometimes also take part in [[Warrior]] projects, in those rare cases when the tracker limit is not already saturated.


=== What I recommend ===
== What I recommend ==
*Chfoo's [http://github.com/chfoo/wpull Wpull] for saving to [[WARC]]
*Chfoo's [http://github.com/chfoo/wpull Wpull] for saving to [[WARC]]
* Ikreymer's [http://webrecorder.io webrecorder.io] when things are too difficult
* Ikreymer's [http://webrecorder.io webrecorder.io] when things are too difficult
Line 36: Line 36:
**No, I tried but didn't like the [https://pypi.python.org/pypi/internetarchive internetarchive] tool.
**No, I tried but didn't like the [https://pypi.python.org/pypi/internetarchive internetarchive] tool.


=== What I've found in Hungarian about ArchiveTeam ===
== My dreams ==
=== Short term (will maybe realized one day) ===
I'd like to create a small Hungarian Internet Archive, collecting and presenting archives of websites saved by me or others, just like the [[Internet Archive]] and the Wayback Machine does. (Don't think of large scale, just starting with a few second-hand 1 TB hard disks connected to a home server.) Also, I'd restore died websites on their original location (domain) if possible, so rotten links could resurrect.
 
This could also act as a mirror of some websites uploaded to IA.
 
=== Long term (isn't likely to be realized ever) ===
Recording main Hungarian radio and television channels' complete program (0–24), and also give the public restricted access to these archives. (There is [http://nava.hu/what-is-nava/ NAVA], but that doesn't record everything, and is often a bit difficult to access.)
 
== What I've found in Hungarian about ArchiveTeam ==
*Sándor Berta: ''[http://sg.hu/cikkek/67175/archivaljak-a-geocities-tartalmakat Archiválják a GeoCities-tartalmakat]'' (''They archive GeoCities' contents''). sg.hu, 2009-05-04.
*Sándor Berta: ''[http://sg.hu/cikkek/67175/archivaljak-a-geocities-tartalmakat Archiválják a GeoCities-tartalmakat]'' (''They archive GeoCities' contents''). sg.hu, 2009-05-04.
*Ádám Szedlák: ''[http://www.origo.hu/techbazis/internet/20090513-geocities-freeweb-archivalokra-varnak-az-ingyenes-tarhelyek.html Megmentik az őshonlapokat]'' (''They are saving the ancient websites''). origo.hu, 2009-05-13. (About [[Geocities]].)
*Ádám Szedlák: ''[http://www.origo.hu/techbazis/internet/20090513-geocities-freeweb-archivalokra-varnak-az-ingyenes-tarhelyek.html Megmentik az őshonlapokat]'' (''They are saving the ancient websites''). origo.hu, 2009-05-13. (About [[Geocities]].)
Line 45: Line 54:
*Péter Szűcs: ''[http://itcafe.hu/hir/az_internet_nem_felejt.html Az internet nem felejt]'' (''The internet doesn't forget''). itcafe.hu, 2015-03-05. (About ArchiveTeam's activity in general.)
*Péter Szűcs: ''[http://itcafe.hu/hir/az_internet_nem_felejt.html Az internet nem felejt]'' (''The internet doesn't forget''). itcafe.hu, 2015-03-05. (About ArchiveTeam's activity in general.)


=== References ===
== References ==
<references/>
<references/>


{{DISPLAYTITLE:User&#58;bzc6p}}__NOTOC__
{{DISPLAYTITLE:User&#58;bzc6p}}

Revision as of 18:11, 11 September 2015

What I'm trying to look smart with

That era of the web is far behind us when a single wget -r -p command could mirror a website in its entirety. Nowadays each and every website has its own soul, its own hideous Javascript-linked content, not to mention the various file formats and ways of embedding content. Thus, if one is serious about web archiving, each and every website must be discovered carefully, often painstakingly, which is in too few cases possible in any automated ways.

What I probably shouldn't have archived

Általam feltöltött tartalom eltávolításával kapcsolatos kéréseket a vichratimot@euromail.hu címre kell küldeni.

Requests for removal of content uploaded by me should be sent to vichratimot@euromail.hu.

Who I am

Hungarian amateur who joined the efforts of ArchiveTeam. "Specialized" in watching and saving Hungarian websites.

What I've done

This user is on a tin can connected to a windmill,[1] likes simplicity, likes archiving websites, therefore hates Javascript being used to just show components of websites.

Also saved a few wikis in the beginning.

I sometimes also take part in Warrior projects, in those rare cases when the tracker limit is not already saturated.

What I recommend

  • Chfoo's Wpull for saving to WARC
  • Ikreymer's webrecorder.io when things are too difficult
    • wget still lacks some handy features wpull already has got
    • No, I don't prefer ArchiveBot as most websites can't be saved automatically, also one can't really fine-tune a specific ArchiveBot job. Can be useful and powerful, but it's still quite dull under development. Use with caution.
  • Alard's warc-proxy or Ikreymer's webarchiveplayer for testing WARCs
  • Kngenie's ias3upload for uploading to IA

My dreams

Short term (will maybe realized one day)

I'd like to create a small Hungarian Internet Archive, collecting and presenting archives of websites saved by me or others, just like the Internet Archive and the Wayback Machine does. (Don't think of large scale, just starting with a few second-hand 1 TB hard disks connected to a home server.) Also, I'd restore died websites on their original location (domain) if possible, so rotten links could resurrect.

This could also act as a mirror of some websites uploaded to IA.

Long term (isn't likely to be realized ever)

Recording main Hungarian radio and television channels' complete program (0–24), and also give the public restricted access to these archives. (There is NAVA, but that doesn't record everything, and is often a bit difficult to access.)

What I've found in Hungarian about ArchiveTeam

References