Difference between revisions of "The Pirate Bay"

From Archiveteam
Jump to navigation Jump to search
Line 16: Line 16:
Since the Pirate Bay is only a search engine nowadays, it would be preferable to simply scrape the magnet links, descriptions, and comments. Since the magnet links also include a tracker address, torrent files might just be a thing of the past (might be wrong though, but [https://torrentfreak.com/the-pirate-bay-will-stop-serving-torrents-120112/ the Pirate Bay itself] seems to be betting on it)
Since the Pirate Bay is only a search engine nowadays, it would be preferable to simply scrape the magnet links, descriptions, and comments. Since the magnet links also include a tracker address, torrent files might just be a thing of the past (might be wrong though, but [https://torrentfreak.com/the-pirate-bay-will-stop-serving-torrents-120112/ the Pirate Bay itself] seems to be betting on it)


==Archival Tools==
== Archival Tools ==


We're going to have to make our own tools if we ever want to backup the entire Pirate Bay. To make it usable, we would also have to make it easily editable and searchable, perhaps with website scripts...
* [http://pastebin.com/8RXXthXB Magnet link Dumper]: A perl script that dumps magnet links into a single text file. It was used to make the below magnet archive.


A good idea would be to put magnet links, comments, and the descriptions into a Markdown-formatted wikipage, and then check the files into a git repository. Then, we could simply run the [https://github.com/github/gollum Gollum] wiki engine on it, and voila, Pirate Bay on the go.
== Backups ==


Of course, we would have to make a few modifications to Gollum, such as getting it to support tags. catergories, and other metadata essential to searching such a huge database. It shouldn't be too hard, though.
* [http://thepiratebay.se/torrent/7016365/The_whole_Pirate_Bay_magnet_archive The entire Pirate Bay Magnet Archive]: Every magnet link on the Pirate Bay, all in a tiny little text file. No comments, though.
* [http://thepiratebay.se/torrent/7046494/Pirate_bay_Magnet_Archive_viewer Magnet Archive Viewer]: Parsing text files can be a pain, so this program makes it easy to search and look at the magnet links.


===Existing Archival Tools===
* [http://thepiratebay.se/torrent/7028505/The_Pirate_Bay_full_siterip_2012 Siterip]: This 3GB archive saves all the html pages from the Piratebay, including comments. No torrent files, as usual.
 
* [http://kat.ph/the-pirate-bay-scraper-t2833005.html The Pirate Bay Scraper] - An old set of scripts that saves torrents and their metadata from the Pirate Bay into PHP websites, complete with a SQL database and bzipped torrents. It was instrumental to the creation of the [https://thepiratebay.se/torrent/5051418/Pirate_Bay_Archive_-_884004_torrents___MySQL_db___PHP_demo_site 2009 Pirate Bay Archive]


{{Navigation box}}
{{Navigation box}}

Revision as of 02:31, 23 February 2012

The Pirate Bay
ThePirateBay.png
URL http://www.thepiratebay.org/
Status Online!
Archiving status Not saved yet
Archiving type Unknown
IRC channel #archiveteam-bs (on hackint)

Template:Expand

The Pirate Bay is one of the largest and most popular torrent search engines. It's still having persistent legal problems. The tracker went down in November, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB all over this wiki, this site is pretty dang important.

Archival Methods

Since the Pirate Bay is only a search engine nowadays, it would be preferable to simply scrape the magnet links, descriptions, and comments. Since the magnet links also include a tracker address, torrent files might just be a thing of the past (might be wrong though, but the Pirate Bay itself seems to be betting on it)

Archival Tools

  • Magnet link Dumper: A perl script that dumps magnet links into a single text file. It was used to make the below magnet archive.

Backups

  • Siterip: This 3GB archive saves all the html pages from the Piratebay, including comments. No torrent files, as usual.