Difference between revisions of "PDF 2016"

From Archiveteam
Jump to navigation Jump to search
m
Line 11: Line 11:


In March 2016, user '''davidar''' informed ArchiveTeam on IRC that he obtained a list of hundreds of millions of links to PDF files from around the Web.<ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2016-03-03,Thu&sel=67#l63</ref><ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2016-03-09,Wed&sel=134#l130</ref><ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2016-03-13,Sun&sel=133#l129</ref> ArchiveTeam decided to make a [[Warrior]] project for downloading these files.
In March 2016, user '''davidar''' informed ArchiveTeam on IRC that he obtained a list of hundreds of millions of links to PDF files from around the Web.<ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2016-03-03,Thu&sel=67#l63</ref><ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2016-03-09,Wed&sel=134#l130</ref><ref>http://archive.fart.website/bin/irclogger_log/archiveteam?date=2016-03-13,Sun&sel=133#l129</ref> ArchiveTeam decided to make a [[Warrior]] project for downloading these files.
Most of the files are open-access scientific documents. Besides being uploaded to the [[Internet Archive]], http://citeseerx.ist.psu.edu/ will also host and index them.


To follow or join the discussion of the project, join the {{IRC|pdflush}} channel on EFNet.
To follow or join the discussion of the project, join the {{IRC|pdflush}} channel on EFNet.

Revision as of 12:55, 13 March 2016

PDF 2016
Status Online!
Archiving status Upcoming...
Archiving type Unknown
IRC channel #pdflush (on hackint)

PDF (Portable Document Format) is a file format used to present documents in a manner independent of application software, hardware, and operating systems.[1]

PDF 2016 is a codename for an ArchiveTeam project, saving a lot of PDFs.

In March 2016, user davidar informed ArchiveTeam on IRC that he obtained a list of hundreds of millions of links to PDF files from around the Web.[2][3][4] ArchiveTeam decided to make a Warrior project for downloading these files.

Most of the files are open-access scientific documents. Besides being uploaded to the Internet Archive, http://citeseerx.ist.psu.edu/ will also host and index them.

To follow or join the discussion of the project, join the #pdflush (on hackint) channel on EFNet.

References