https://wiki.archiveteam.org/api.php?action=feedcontributions&user=Fenn&feedformat=atomArchiveteam - User contributions [en]2024-03-29T06:29:29ZUser contributionsMediaWiki 1.37.1https://wiki.archiveteam.org/index.php?title=GitHub&diff=30605GitHub2018-06-20T22:25:13Z<p>Fenn: /* List of Repositories */ fix link</p>
<hr />
<div>{{Infobox project<br />
| title = GitHub<br />
| logo = GitHub_logo.png<br />
| image = GitHub 1303511667338.png<br />
| description = A screen shot of the GitHub home page taken on 2015-11-08<br />
| URL = {{url|1=https://github.com/|2=GitHub}}<br />
| project_status = {{online}}<br />
| archiving_status = {{nosavedyet}}<br />
| irc = getgit<br />
}}<br />
<br />
:''See also [[GitHub Downloads]]''<br />
<br />
'''GitHub''' is a software repository powered by Git. Does not seem to have any site issues, often 24 hours uptime (see [http://status.github.com/ site status]). Looks pretty sunny at the moment, but when disaster strikes it would be a problem archiving the private repositories.<br />
<br />
As of 12th August 2012: 1,963,652 people hosting over 3,460,582 repositories [https://github.com/search?type=Repositories&q=fork%3Atrue 1,117,147 public repositories] are forks, which greatly reduces the amount of data required to archive it.<br />
As of 22 November 2015: There are 32,000,000 repositories, with a similar fork ratio. Back-of-the-envelope calculations suggest 120TB of data in git repositories.<br />
<br />
== Acquisition by Microsoft ==<br />
<br />
It was [https://www.bloomberg.com/news/articles/2018-06-03/microsoft-is-said-to-have-agreed-to-acquire-coding-site-github reported by Bloomberg] and [https://news.microsoft.com/2018/06/04/microsoft-to-acquire-github-for-7-5-billion/ confirmed on June 4, 2018], that Microsoft bought GitHub for 7.5 billion dollars.<br />
<br />
A discussion into the feasibility of archiving GitHub has commenced in {{IRC|getgit}}.<br />
* Users in the FOSS community fear Microsoft's "embrace, extend, extinguish" schemes in the 1990s and 2000s and many called for a move to rival [[GitLab]] in the wake of the news.<br />
* [[LinkedIn]] shows how user content can be gradually taken away (by means of paywalls and login walls).<br />
<br />
== Backup tools ==<br />
=== git itself ===<br />
<tt>git clone</tt> is the simplest one (and also works outside of GitHub, obviously). However, it does not get some project data that is not stored in git, including issue reports, comments, pull requests.<br />
<br />
When cloning a repository for archival, it is best to use the <tt>--mirror</tt> option. This mirror will include all branches and even the code associated with pull requests. (Note however that the PR code will get purged eventually by Git's GC when you create a clone from this mirror as the PR commits aren't referenced by any branches, though this can be solved by adding a line like <tt>fetch = +refs/pull/*/head:refs/remotes/origin/pr/*</tt> to the repository config file.)<br />
<br />
To pack a clone/mirror into a single, easily handleable file, use <tt>git bundle create FILE --all</tt> inside the clone/mirror.<br />
<br />
=== Other tools ===<br />
<br />
[https://github-backup.branchable.com/ github-backup] runs in a git repository and chases down that information, committing it to a "github" branch. It also chases down the forks and efficiently downloads them as well.<br />
<br />
[http://www.githubarchive.org/ githubarchive.org] and [http://ghtorrent.org/ GHTorrent] are both creating archives of the GitHub "timeline", that is, all events like git pushes, forks, created issues, pull requests, etc.<br />
<br />
[http://codearchive.org codearchive.org] Effort to backup all the versions of all the repos on GitHub and other sources. [https://speakerdeck.com/filosottile/the-code-archive-hope-xi Slides from a talk about it].<br />
<br />
See also [[Software Heritage]].<br />
<br />
== GitHub Replacement Engines ==<br />
<br />
If we ever have to archive the data out of GitHub, the data will need to be exportable to a GitHub-style engine.<br />
<br />
Currently<sup>[when?]</sup>, the best GitHub-style engine that has a Wiki, issues, Git Repo hosting, and is free and open source to use is [http://gitlab.com GitLab]. The engine is used by and paid for by many major organizations, so it is likely to live on in a stable way. Other popular FOSS alternatives to GitHub include [https://gitea.io/en-US/ Gitea] and [https://gogs.io/ Gogs].<br />
<br />
We will need a complete migration system to move a git repository and all related GitHub service information of a repository to GitLab.<br />
<br />
== Things to Scrape ==<br />
<br />
In case of emergency, these are the items we need to grab.<br />
<br />
* Git Repository - Accomplished by github-backup<br />
** Forked Repositories - Accomplished by github-backup<br />
** '''Notes on Commits/Lines of Code''' - Not supported by github-backup yet. GitHub API support exists since ca. 2011.<br />
* '''GitHub Gollum Wiki''' - No tool yet, but just clone the whole thing, and then push it to GitLab.<br />
** The wiki is a full-blown git repository, though only few features are exposed on the user interfaces (e.g. no branches). The clone URL is shown on wiki pages and is <tt>https://github.com/owner/repository.wiki.git</tt>.<br />
* '''Releases''' - Tags on GitHub can have binaries attached. These are of high priority to archive.<br />
* Issues + Comments - Accomplished by github-backup<br />
** '''Milestones''' - ''github-backup currently does not archive this yet.''<br />
** '''Labels''' - ''github-backup currently does not archive this yet.''<br />
* '''Hooks''' - Needs some kind of tool to archive GitHub Hooks<br />
<br />
== List of Repositories ==<br />
<br />
A list of repositories from GitHub API data are maintained by an archive team member at [https://za3k.com/github/ za3k.com]. It scrapes continuously. Public downloads are updated once a day. This list does not include gists.<br />
<br />
== GitHub Archive ==<br />
<br />
The metadata generated by the GitHub API are archived to Google BigQuery every hour by [https://www.githubarchive.org/ GithubArchive]. <br />
<br />
It obviously doesn't grab events '''dating before 2011''', so a targeted repository scrape may still be ideal.<br />
<br />
But at least it could be possible to grab all info about a single repository using Google BigQuery's free version, since it would use a low amount of CPU power. However, we need to create such an export script for it when the time comes.<br />
<br />
== External links ==<br />
* {{url|1=https://github.com/|2=GitHub}}<br />
<br />
{{Navigation box}}</div>Fennhttps://wiki.archiveteam.org/index.php?title=Woohoo&diff=22545Woohoo2015-03-23T19:37:57Z<p>Fenn: /* Sites */ tumblr stats</p>
<hr />
<div>Over the years, [[Yahoo]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Yahoo-owned sites, but not all. Therefore, Archive Team has decided to take a census of Yahoo services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/woohoo #woohoo].<br />
<br />
<!-- Feel free to rewrite this description, I just took Froogle's and replaced Google with Yahoo. --><br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Flickr]]<br />
| Online<br />
| No<br />
| 92 million<ref>http://expandedramblings.com/index.php/flickr-stats/</ref><br />
|<br />
|<br />
|-<br />
| [[Tumblr]]<br />
| Online<br />
| No<br />
| 420 million<ref name="tumblr">http://expandedramblings.com/index.php/tumblr-user-stats-fact/</ref><br />
| 217 million blogs, 99 billion posts<ref name="tumblr"/><br />
|<br />
|-<br />
| [[Yahoo! Answers]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Yahoo! Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Yahoo! News<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Yahoo! Screen<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Yahoo! Search<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== See also ==<br />
<br />
* [[Froogle]]<br />
<br />
== External links ==<br />
<br />
* [http://en.wikipedia.org/wiki/List_of_Yahoo!-owned_sites_and_services List of Yahoo!-owned sites and services] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Froogle&diff=22353Froogle2015-03-13T11:56:53Z<p>Fenn: /* Sites */ freebase</p>
<hr />
<div>Over the years, [[Google]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Google-owned sites, but not all. Therefore, Archive Team has decided to take a census of Google services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/froogle #froogle].<br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Blogger]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| FeedBurner<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| FreeBase<br />
| Read-only<br />
| migrating<ref>https://plus.google.com/109936836907132434202/posts/bu3z2wVqcQc</ref><br />
|<br />
| <br />
| 25GB<br />
|-<br />
| Goo.gl<br />
| Online<br />
| [[URLTeam#New_table|started]]<br />
|<br />
|<br />
|<br />
|-<br />
| Google+<br />
| Online<br />
| No<br />
| ~2.2 billion<ref>http://www.businessinsider.com/google-active-users-2015-1</ref>, ~200 million with any public content<br />
|<br />
|<br />
|-<br />
| Google Account (user profiles)<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Answers]]<br />
| Offline<br />
| [https://archive.org/details/google-answers-archive Yes]<br />
|<br />
|<br />
| 1.5GB compressed<br />
|-<br />
| Google Bookmarks<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Books<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Books Ngram]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Business Sitebuilder]]<br />
| Closing<br />
| Yes<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Code]]<br />
| Closing<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| Google Contacts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Catalogs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Earth<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Fonts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Helpouts]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Keep<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Knol<br />
| Offline<br />
| ?<br />
|<br />
|<br />
|<br />
|-<br />
| Google Play<br />
| Online<br />
| No<br />
|<br />
| 1.43 million apps<ref>http://blog.appfigures.com/app-stores-growth-accelerates-in-2014/</ref><br />
|<br />
|-<br />
| Google Map Maker<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Maps<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Moderator<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google News Archive<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Questions and Answers]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Voice<br />
| Online<br />
| No<br />
| 3.5 million ish<br />
|<br />
|<br />
|-<br />
| Google Web History<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Orkut]]<br />
| [[http://orkut.google.com/en.html Read-only]]<br />
| No<br />
| 66 million<br />
| >1 billion<br />
|<br />
|-<br />
| [[Panoramio]]<br />
| Closing<ref>https://www.change.org/p/google-larry-and-sergey-google-keep-the-panoramio-community-alive</ref><br />
| No<br />
| 1 to 5 million<br />
| 72 million photos<ref>http://stackoverflow.com/questions/20291451/how-to-obtain-the-total-number-of-photos-in-one-area-using-the-panoramio-data-ap</ref><br />
|<br />
|-<br />
| [[Picasa|Picasa Web Albums]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[YouTube]]<br />
| Online<br />
| No<br />
| >1 billion<ref>https://www.youtube.com/yt/press/statistics.html</ref><br />
|<br />
|<br />
|-<br />
| Zagat<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== Google Apps ==<br />
<br />
Many different Google sites have been "featurized" into Google Apps, including:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| Google Calendar<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Classroom<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Docs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Drive<br />
| Online<br />
| No<br />
| >190 million users<ref>http://9to5google.com/2014/06/25/google-drive-hits-190-million-30-day-active-users/</ref><br />
|<br />
|<br />
|-<br />
| Google Sites<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
<br />
* [https://twitter.com/textfiles/status/576220473131868162 @textfiles: But no, at this point, it's obvious archive team should audit every Google information store...]<br />
* [https://en.wikipedia.org/wiki/List_of_Google_products List of Google products] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Froogle&diff=22352Froogle2015-03-13T10:49:01Z<p>Fenn: /* Sites */ panoramio</p>
<hr />
<div>Over the years, [[Google]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Google-owned sites, but not all. Therefore, Archive Team has decided to take a census of Google services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/froogle #froogle].<br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Blogger]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| FeedBurner<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Goo.gl<br />
| Online<br />
| [[URLTeam#New_table|started]]<br />
|<br />
|<br />
|<br />
|-<br />
| Google+<br />
| Online<br />
| No<br />
| ~2.2 billion<ref>http://www.businessinsider.com/google-active-users-2015-1</ref>, ~200 million with any public content<br />
|<br />
|<br />
|-<br />
| Google Account (user profiles)<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Answers]]<br />
| Offline<br />
| [https://archive.org/details/google-answers-archive Yes]<br />
|<br />
|<br />
| 1.5GB compressed<br />
|-<br />
| Google Bookmarks<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Books<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Books Ngram]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Business Sitebuilder]]<br />
| Closing<br />
| Yes<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Code]]<br />
| Closing<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| Google Contacts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Catalogs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Earth<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Fonts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Helpouts]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Keep<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Knol<br />
| Offline<br />
| ?<br />
|<br />
|<br />
|<br />
|-<br />
| Google Play<br />
| Online<br />
| No<br />
|<br />
| 1.43 million apps<ref>http://blog.appfigures.com/app-stores-growth-accelerates-in-2014/</ref><br />
|<br />
|-<br />
| Google Map Maker<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Maps<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Moderator<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google News Archive<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Questions and Answers]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Voice<br />
| Online<br />
| No<br />
| 3.5 million ish<br />
|<br />
|<br />
|-<br />
| Google Web History<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Orkut]]<br />
| [[http://orkut.google.com/en.html Read-only]]<br />
| No<br />
| 66 million<br />
| >1 billion<br />
|<br />
|-<br />
| [[Panoramio]]<br />
| Closing<ref>https://www.change.org/p/google-larry-and-sergey-google-keep-the-panoramio-community-alive</ref><br />
| No<br />
| 1 to 5 million<br />
| 72 million photos<ref>http://stackoverflow.com/questions/20291451/how-to-obtain-the-total-number-of-photos-in-one-area-using-the-panoramio-data-ap</ref><br />
|<br />
|-<br />
| [[Picasa|Picasa Web Albums]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[YouTube]]<br />
| Online<br />
| No<br />
| >1 billion<ref>https://www.youtube.com/yt/press/statistics.html</ref><br />
|<br />
|<br />
|-<br />
| Zagat<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== Google Apps ==<br />
<br />
Many different Google sites have been "featurized" into Google Apps, including:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| Google Calendar<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Classroom<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Docs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Drive<br />
| Online<br />
| No<br />
| >190 million users<ref>http://9to5google.com/2014/06/25/google-drive-hits-190-million-30-day-active-users/</ref><br />
|<br />
|<br />
|-<br />
| Google Sites<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
<br />
* [https://twitter.com/textfiles/status/576220473131868162 @textfiles: But no, at this point, it's obvious archive team should audit every Google information store...]<br />
* [https://en.wikipedia.org/wiki/List_of_Google_products List of Google products] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Froogle&diff=22351Froogle2015-03-13T10:23:27Z<p>Fenn: /* Sites */ orkut</p>
<hr />
<div>Over the years, [[Google]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Google-owned sites, but not all. Therefore, Archive Team has decided to take a census of Google services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/froogle #froogle].<br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Blogger]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| FeedBurner<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Goo.gl<br />
| Online<br />
| [[URLTeam#New_table|started]]<br />
|<br />
|<br />
|<br />
|-<br />
| Google+<br />
| Online<br />
| No<br />
| ~2.2 billion<ref>http://www.businessinsider.com/google-active-users-2015-1</ref>, ~200 million with any public content<br />
|<br />
|<br />
|-<br />
| Google Account (user profiles)<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Answers]]<br />
| Offline<br />
| [https://archive.org/details/google-answers-archive Yes]<br />
|<br />
|<br />
| 1.5GB compressed<br />
|-<br />
| Google Bookmarks<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Books<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Books Ngram]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Business Sitebuilder]]<br />
| Closing<br />
| Yes<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Code]]<br />
| Closing<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| Google Contacts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Catalogs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Earth<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Fonts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Helpouts]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Keep<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Knol<br />
| Offline<br />
| ?<br />
|<br />
|<br />
|<br />
|-<br />
| Google Play<br />
| Online<br />
| No<br />
|<br />
| 1.43 million apps<ref>http://blog.appfigures.com/app-stores-growth-accelerates-in-2014/</ref><br />
|<br />
|-<br />
| Google Map Maker<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Maps<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Moderator<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google News Archive<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Questions and Answers]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Voice<br />
| Online<br />
| No<br />
| 3.5 million ish<br />
|<br />
|<br />
|-<br />
| Google Web History<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Orkut]]<br />
| [[http://orkut.google.com/en.html Read-only]]<br />
| No<br />
| 66 million<br />
| >1 billion<br />
|<br />
|-<br />
| [[Picasa|Picasa Web Albums]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[YouTube]]<br />
| Online<br />
| No<br />
| >1 billion<ref>https://www.youtube.com/yt/press/statistics.html</ref><br />
|<br />
|<br />
|-<br />
| Zagat<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== Google Apps ==<br />
<br />
Many different Google sites have been "featurized" into Google Apps, including:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| Google Calendar<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Classroom<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Docs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Drive<br />
| Online<br />
| No<br />
| >190 million users<ref>http://9to5google.com/2014/06/25/google-drive-hits-190-million-30-day-active-users/</ref><br />
|<br />
|<br />
|-<br />
| Google Sites<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
<br />
* [https://twitter.com/textfiles/status/576220473131868162 @textfiles: But no, at this point, it's obvious archive team should audit every Google information store...]<br />
* [https://en.wikipedia.org/wiki/List_of_Google_products List of Google products] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Froogle&diff=22350Froogle2015-03-13T10:00:26Z<p>Fenn: /* Sites */ google voice</p>
<hr />
<div>Over the years, [[Google]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Google-owned sites, but not all. Therefore, Archive Team has decided to take a census of Google services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/froogle #froogle].<br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Blogger]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| FeedBurner<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Goo.gl<br />
| Online<br />
| [[URLTeam#New_table|started]]<br />
|<br />
|<br />
|<br />
|-<br />
| Google+<br />
| Online<br />
| No<br />
| ~2.2 billion<ref>http://www.businessinsider.com/google-active-users-2015-1</ref>, ~200 million with any public content<br />
|<br />
|<br />
|-<br />
| Google Account (user profiles)<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Answers]]<br />
| Offline<br />
| [https://archive.org/details/google-answers-archive Yes]<br />
|<br />
|<br />
| 1.5GB compressed<br />
|-<br />
| Google Bookmarks<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Books<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Books Ngram]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Business Sitebuilder]]<br />
| Closing<br />
| Yes<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Code]]<br />
| Closing<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| Google Contacts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Catalogs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Earth<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Fonts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Helpouts]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Keep<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Knol<br />
| Offline<br />
| ?<br />
|<br />
|<br />
|<br />
|-<br />
| Google Play<br />
| Online<br />
| No<br />
|<br />
| 1.43 million apps<ref>http://blog.appfigures.com/app-stores-growth-accelerates-in-2014/</ref><br />
|<br />
|-<br />
| Google Map Maker<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Maps<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Moderator<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google News Archive<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Questions and Answers]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Voice<br />
| Online<br />
| No<br />
| 3.5 million ish<br />
|<br />
|<br />
|-<br />
| Google Web History<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Picasa|Picasa Web Albums]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[YouTube]]<br />
| Online<br />
| No<br />
| >1 billion<ref>https://www.youtube.com/yt/press/statistics.html</ref><br />
|<br />
|<br />
|-<br />
| Zagat<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== Google Apps ==<br />
<br />
Many different Google sites have been "featurized" into Google Apps, including:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| Google Calendar<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Classroom<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Docs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Drive<br />
| Online<br />
| No<br />
| >190 million users<ref>http://9to5google.com/2014/06/25/google-drive-hits-190-million-30-day-active-users/</ref><br />
|<br />
|<br />
|-<br />
| Google Sites<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
<br />
* [https://twitter.com/textfiles/status/576220473131868162 @textfiles: But no, at this point, it's obvious archive team should audit every Google information store...]<br />
* [https://en.wikipedia.org/wiki/List_of_Google_products List of Google products] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Froogle&diff=22349Froogle2015-03-13T09:50:08Z<p>Fenn: /* Sites */ goo.gl</p>
<hr />
<div>Over the years, [[Google]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Google-owned sites, but not all. Therefore, Archive Team has decided to take a census of Google services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/froogle #froogle].<br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Blogger]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| FeedBurner<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Goo.gl<br />
| Online<br />
| [[URLTeam#New_table|started]]<br />
|<br />
|<br />
|<br />
|-<br />
| Google+<br />
| Online<br />
| No<br />
| ~2.2 billion<ref>http://www.businessinsider.com/google-active-users-2015-1</ref>, ~200 million with any public content<br />
|<br />
|<br />
|-<br />
| Google Account (user profiles)<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Answers]]<br />
| Offline<br />
| [https://archive.org/details/google-answers-archive Yes]<br />
|<br />
|<br />
| 1.5GB compressed<br />
|-<br />
| Google Bookmarks<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Books<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Books Ngram]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Business Sitebuilder]]<br />
| Closing<br />
| Yes<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Code]]<br />
| Closing<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| Google Contacts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Catalogs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Earth<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Fonts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Helpouts]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Keep<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Knol<br />
| Offline<br />
| ?<br />
|<br />
|<br />
|<br />
|-<br />
| Google Play<br />
| Online<br />
| No<br />
|<br />
| 1.43 million apps<ref>http://blog.appfigures.com/app-stores-growth-accelerates-in-2014/</ref><br />
|<br />
|-<br />
| Google Map Maker<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Maps<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Moderator<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google News Archive<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Questions and Answers]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Web History<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Picasa|Picasa Web Albums]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[YouTube]]<br />
| Online<br />
| No<br />
| >1 billion<ref>https://www.youtube.com/yt/press/statistics.html</ref><br />
|<br />
|<br />
|-<br />
| Zagat<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== Google Apps ==<br />
<br />
Many different Google sites have been "featurized" into Google Apps, including:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| Google Calendar<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Classroom<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Docs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Drive<br />
| Online<br />
| No<br />
| >190 million users<ref>http://9to5google.com/2014/06/25/google-drive-hits-190-million-30-day-active-users/</ref><br />
|<br />
|<br />
|-<br />
| Google Sites<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
<br />
* [https://twitter.com/textfiles/status/576220473131868162 @textfiles: But no, at this point, it's obvious archive team should audit every Google information store...]<br />
* [https://en.wikipedia.org/wiki/List_of_Google_products List of Google products] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Froogle&diff=22348Froogle2015-03-13T07:45:23Z<p>Fenn: /* Sites */ knol</p>
<hr />
<div>Over the years, [[Google]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Google-owned sites, but not all. Therefore, Archive Team has decided to take a census of Google services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/froogle #froogle].<br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Blogger]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| FeedBurner<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google+<br />
| Online<br />
| No<br />
| ~2.2 billion<ref>http://www.businessinsider.com/google-active-users-2015-1</ref>, ~200 million with any public content<br />
|<br />
|<br />
|-<br />
| Google Account (user profiles)<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Answers]]<br />
| Offline<br />
| [https://archive.org/details/google-answers-archive Yes]<br />
|<br />
|<br />
| 1.5GB compressed<br />
|-<br />
| Google Bookmarks<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Books<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Books Ngram]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Business Sitebuilder]]<br />
| Closing<br />
| Yes<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Code]]<br />
| Closing<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| Google Contacts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Catalogs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Earth<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Fonts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Helpouts]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Keep<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Knol<br />
| Offline<br />
| ?<br />
|<br />
|<br />
|<br />
|-<br />
| Google Play<br />
| Online<br />
| No<br />
|<br />
| 1.43 million apps<ref>http://blog.appfigures.com/app-stores-growth-accelerates-in-2014/</ref><br />
|<br />
|-<br />
| Google Map Maker<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Maps<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Moderator<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google News Archive<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Questions and Answers]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Web History<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Picasa|Picasa Web Albums]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[YouTube]]<br />
| Online<br />
| No<br />
| >1 billion<ref>https://www.youtube.com/yt/press/statistics.html</ref><br />
|<br />
|<br />
|-<br />
| Zagat<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== Google Apps ==<br />
<br />
Many different Google sites have been "featurized" into Google Apps, including:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| Google Calendar<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Classroom<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Docs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Drive<br />
| Online<br />
| No<br />
| >190 million users<ref>http://9to5google.com/2014/06/25/google-drive-hits-190-million-30-day-active-users/</ref><br />
|<br />
|<br />
|-<br />
| Google Sites<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
<br />
* [https://twitter.com/textfiles/status/576220473131868162 @textfiles: But no, at this point, it's obvious archive team should audit every Google information store...]<br />
* [https://en.wikipedia.org/wiki/List_of_Google_products List of Google products] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Froogle&diff=22347Froogle2015-03-13T07:41:53Z<p>Fenn: /* Sites */ shoulda used preview</p>
<hr />
<div>Over the years, [[Google]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Google-owned sites, but not all. Therefore, Archive Team has decided to take a census of Google services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/froogle #froogle].<br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Blogger]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| FeedBurner<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google+<br />
| Online<br />
| No<br />
| ~2.2 billion<ref>http://www.businessinsider.com/google-active-users-2015-1</ref>, ~200 million with any public content<br />
|<br />
|<br />
|-<br />
| Google Account (user profiles)<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Answers]]<br />
| Offline<br />
| [https://archive.org/details/google-answers-archive Yes]<br />
|<br />
|<br />
| 1.5GB compressed<br />
|-<br />
| Google Bookmarks<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Books<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Books Ngram]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Business Sitebuilder]]<br />
| Closing<br />
| Yes<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Code]]<br />
| Closing<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| Google Contacts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Catalogs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Earth<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Fonts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Helpouts]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Keep<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Play<br />
| Online<br />
| No<br />
|<br />
| 1.43 million apps<ref>http://blog.appfigures.com/app-stores-growth-accelerates-in-2014/</ref><br />
|<br />
|-<br />
| Google Map Maker<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Maps<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Moderator<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google News Archive<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Questions and Answers]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Web History<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Picasa|Picasa Web Albums]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[YouTube]]<br />
| Online<br />
| No<br />
| >1 billion<ref>https://www.youtube.com/yt/press/statistics.html</ref><br />
|<br />
|<br />
|-<br />
| Zagat<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== Google Apps ==<br />
<br />
Many different Google sites have been "featurized" into Google Apps, including:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| Google Calendar<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Classroom<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Docs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Drive<br />
| Online<br />
| No<br />
| >190 million users<ref>http://9to5google.com/2014/06/25/google-drive-hits-190-million-30-day-active-users/</ref><br />
|<br />
|<br />
|-<br />
| Google Sites<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
<br />
* [https://twitter.com/textfiles/status/576220473131868162 @textfiles: But no, at this point, it's obvious archive team should audit every Google information store...]<br />
* [https://en.wikipedia.org/wiki/List_of_Google_products List of Google products] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Froogle&diff=22346Froogle2015-03-13T07:40:58Z<p>Fenn: /* Sites */ web history</p>
<hr />
<div>Over the years, [[Google]] has discontinued many of its services and products. As a result, unknown amounts of user data have been deleted or are endangered. Archive Team has [[Projects|rescued]] some of the data on Google-owned sites, but not all. Therefore, Archive Team has decided to take a census of Google services, to see what has and hasn't been saved.<br />
<br />
Join the discussion in [irc://irc.efnet.org/froogle #froogle].<br />
<br />
== Sites ==<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| [[Blogger]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| FeedBurner<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google+<br />
| Online<br />
| No<br />
| ~2.2 billion<ref>http://www.businessinsider.com/google-active-users-2015-1</ref>, ~200 million with any public content<br />
|<br />
|<br />
|-<br />
| Google Account (user profiles)<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Answers]]<br />
| Offline<br />
| [https://archive.org/details/google-answers-archive Yes]<br />
|<br />
|<br />
| 1.5GB compressed<br />
|-<br />
| Google Bookmarks<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Books<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Books Ngram]]<br />
| Online<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Business Sitebuilder]]<br />
| Closing<br />
| Yes<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Code]]<br />
| Closing<br />
| In progress<br />
|<br />
|<br />
|<br />
|-<br />
| Google Contacts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Catalogs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Earth<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Fonts<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Groups<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Helpouts]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Keep<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Play<br />
| Online<br />
| No<br />
|<br />
| 1.43 million apps<ref>http://blog.appfigures.com/app-stores-growth-accelerates-in-2014/</ref><br />
|<br />
|-<br />
| Google Map Maker<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Maps<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Moderator<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google News Archive<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[Google Questions and Answers]]<br />
| Closing<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Web History<br />
| Online<br />
| No<br />
|<br />
|<br />
||-<br />
| [[Picasa|Picasa Web Albums]]<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| [[YouTube]]<br />
| Online<br />
| No<br />
| >1 billion<ref>https://www.youtube.com/yt/press/statistics.html</ref><br />
|<br />
|<br />
|-<br />
| Zagat<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== Google Apps ==<br />
<br />
Many different Google sites have been "featurized" into Google Apps, including:<br />
<br />
{| class="wikitable"<br />
|-<br />
! Site<br />
! Status<br />
! Saved?<br />
! Number of Users<br />
! Number of Items<br />
! Size (bytes)<br />
|-<br />
| Google Calendar<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Classroom<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Docs<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|-<br />
| Google Drive<br />
| Online<br />
| No<br />
| >190 million users<ref>http://9to5google.com/2014/06/25/google-drive-hits-190-million-30-day-active-users/</ref><br />
|<br />
|<br />
|-<br />
| Google Sites<br />
| Online<br />
| No<br />
|<br />
|<br />
|<br />
|}<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
<br />
* [https://twitter.com/textfiles/status/576220473131868162 @textfiles: But no, at this point, it's obvious archive team should audit every Google information store...]<br />
* [https://en.wikipedia.org/wiki/List_of_Google_products List of Google products] (Wikipedia)</div>Fennhttps://wiki.archiveteam.org/index.php?title=Google_Code&diff=22333Google Code2015-03-13T02:57:20Z<p>Fenn: /* URL lists */ add TODO</p>
<hr />
<div>{{Infobox project<br />
| title = Google Code<br />
| image = Google_Code_1303511937361.png<br />
| description = <br />
| URL = {{url|1=http://code.google.com|2=Google Code}}<br />
| project_status = {{closing}}<br />
| archiving_status = {{upcoming}}<br />
| irc = googlecodeblue<br />
}}<br />
<br />
'''Google Code''' (AKA Project Hosting) is a software repository that is owned by [[Google]]. It hosts only open source software paired with an open source license.<ref>[https://code.google.com/p/support/wiki/FAQ#Hosting_Your_Open_Source_Project_on_Google_Code FAQ - support - Project Hosting on Google Code FAQ - User support for Google Project Hosting - Google Project Hosting]</ref><br />
<br />
Google Code allows people to commit their code into either a Subversion (SVN), Git or Mercurial repository. It has a downloads section for people to upload their software packages (with a quota limit of 4GB, can be increased upon request) and also a wiki for projects to document their work at. There is also an issue tracker to track bugs in the project's software.<br />
<br />
== Vital signs ==<br />
<br />
Closing on 25th January, 2016<ref>[http://google-opensource.blogspot.com/ncr/2015/03/farewell-to-google-code.html Bidding farewell to Google Code]</ref>.<br />
<br />
== Archiving ==<br />
Archiving source code repositories is rather easy (and incremental). Just clone the git/hg repository, or checkout SVN repo. For SVN, make sure that you checkout all branches, not just trunk.<br />
<br />
Archiving bugtrackers and the other stuff will be a bit harder.<br />
<br />
A tool to export a repository to GitHub is available<ref>[http://code.google.com/export-to-github Export to GitHub - Google Code]</ref>.<br />
<br />
=== URL lists ===<br />
Some seeds for site discovery:<br />
* Underway: Scrape Google Code Search<br />
** Enumerate a list of labels, then fetch results for each label.<br />
*** [http://paste.archivingyoursh.it/govetoviko.avrasm '''Phase 2.5'''].<br />
** Google Code search results can be grabbed in packs of 100, just add "&num=100" to the end of the URL.<br />
** [http://paste.archivingyoursh.it/raw/fajesufise.vhdl '''Phase 1''']. Quick grep says 114,262 projects, plus 71,972 labels for further searching.<br />
* [http://paste.archivingyoursh.it/raw/himupisime URLs from ArchiveTeam IRC logs]<br />
* [http://paste.archivingyoursh.it/raw/pehobejoxi List scraped from MediaWiki wikis]<br />
* [http://paste.archivingyoursh.it/raw/yulugedasa List from FlossMole's data] (sorted from a possibly-incomplete survey in November 2012: http://flossdata.syr.edu/data/gc/)<br />
* [http://paste.archivingyoursh.it/raw/jepivocine Links from Open Directory Project]<br />
* TODO: Scrape Google Search<br />
* TODO: Scrape Bing<br />
* TODO: Scrape Twitter<br />
* TODO: Scrape the Common Crawl Index<br />
* TODO: ask chris dibona for a complete list of projects<br />
<br />
===Tools ===<br />
* FlossMole provides [https://code.google.com/p/flossmole/source/browse/#svn%2FFLOSSmoleGoogleCode%2Fsrc a set of tools] to spider projects from GC<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
* {{url|1=http://code.google.com|2=Google Code}}<br />
* {{url|1=http://www.wired.com/2015/03/github-conquered-google-microsoft-everyone-else/|2=How GitHub Conquered Google, Microsoft, and Everyone Else}}<br />
<br />
{{Navigation box}}<br />
[[Category:Google]]</div>Fennhttps://wiki.archiveteam.org/index.php?title=The_Pirate_Bay&diff=21127The Pirate Bay2014-12-21T23:29:23Z<p>Fenn: /* Backups */ another file size. this one is annoying because github doesnt tell you how big it is</p>
<hr />
<div>{{Infobox project<br />
| title = The Pirate Bay<br />
| logo = ThePirateBay.png<br />
| image = Thepiratebay_homepage_screenshot.png<br />
| description = <br />
| URL = http://www.thepiratebay.org/<br />
| project_status = {{offline}}<br />
| archiving_status = Partially {{saved}}<br />
| irc = yarharfiddlededee<br />
}}<br />
<br />
'''[[The Pirate Bay]]''' is one of the largest and most popular torrent search engines.<br />
<br />
It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB ''all over this wiki'', this site is pretty dang important.<br />
<br />
On December 2014, the website went offline due to an alleged raid<ref>https://torrentfreak.com/swedish-police-raid-the-pirate-bay-site-offline-141209/</ref>.<br />
<br />
==In case of Fire==<br />
<br />
To prevent damage to the Archive Team if The Pirate Bay ever goes down, we should include a Magnet Link next to every TPB link we have.<br />
<br />
=== Archival Methods ===<br />
<br />
We can simply scrape the magnet links, descriptions, and comments. The hard part would probably be keeping it all updated... (Maybe we could use a git repository, and pull as necessary?) <br />
<br />
Magnet links are provided in the Pirate Bay Magnet Archive below, and descriptions and comments are in the siterip.<br />
<br />
=== Archival Tools ===<br />
<br />
* [http://pastebin.com/8RXXthXB Magnet link Dumper]: A perl script that dumps magnet links into a single text file. It was used to make the below magnet archive.<br />
<br />
* [http://github.com/andronikov/tpb2csv tpb2csv]: scrapes the pirate bay website and strips out all the html crap, leaving only pure sweet metadata.<br />
** details.csv: Title, Type, Files, Size, IMDB, Spoken Languages, Texted Languages, Tags, Quality (+), Quality (-), Uploaded, By, User Type, Seeders, Leechers, Info Hash, Picture, Capture Date<br />
** description.txt<br />
** comments.csv: User Type, Username, Date, Text<br />
<br />
== Backups ==<br />
<br />
* [https://archive.org/details/PirateBayComplete20130219 rich.xml.7z] 662MB 7z database dump from 2013-02-19<br />
** torrent:urn:sha1:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
<br />
* [http://thepiratebay.se/torrent/7016365/The_whole_Pirate_Bay_magnet_archive The entire Pirate Bay Magnet Archive]: Every magnet link on the Pirate Bay, all in a tiny little text file. No comments, though.<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:938802790a385c49307f34cca4d30f80b03df59c&dn=The+whole+Pirate+Bay+magnet+archive&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8156416 Updated (February 2013) Listing]<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:277e1afa0038db7299cd8274310556526599f67c&dn=Small+pirate+bay+archive+%28february+2013%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7046494/Pirate_bay_Magnet_Archive_viewer Magnet Archive Viewer]: Parsing text files can be a pain, so this program makes it easy to search and look at the magnet links.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:f7a08a62a11ba6dfe39f1cd0b7e8a5a50d5379aa&dn=Pirate+bay+Magnet+Archive+viewer&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7028505/The_Pirate_Bay_full_siterip_2012 Siterip]: This 3GB archive saves all the html pages from the Piratebay, including comments. No torrent files, as usual.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:3ab8dd096aea63ddf668a127b81ba7fb6799364d&dn=The+Pirate+Bay+full+siterip+2012&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/7706886/Backup_of_The_Pirate_Bay_%28IDs__3200000_-_7700000%29 IDs 3200000-7699999]: tpb2csv 1.23GB 7z Backup as of 2012-10-06. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:0dfe31d5d91058bcbe5cfbcf98646700890afea0&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+3200000+-+7700000%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8044295/Backup_of_The_Pirate_Bay_%28IDs__7700000_-_7999999%29 IDs 7700000-7999999]: tpb2csv 69MB 7z Backup as of 2013-01-09. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:9f9c8cab8b68956a25d6e8c190e5e8dc8cf7186c&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+7700000+-+7999999%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
<br />
<br />
* '''Pirate Bay Metadata Git Repo''': https://github.com/tpb-archive 596MB zip tpb2csv scraped metadata including comments, file lists, descriptions, details. link broken, github censorship?<br />
** [https://github.com/tpb-archive/8xxxxxx IDs 8000000-8999999] tpb2csv scraped metadata not included by "Backup" torrents listed above, fetched on 2013-06-02<br />
<br />
at the end of 2014 there were 11000000 ish torrent ID's<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}</div>Fennhttps://wiki.archiveteam.org/index.php?title=The_Pirate_Bay&diff=21126The Pirate Bay2014-12-21T23:27:53Z<p>Fenn: /* Backups */ add some torrent/file sizes</p>
<hr />
<div>{{Infobox project<br />
| title = The Pirate Bay<br />
| logo = ThePirateBay.png<br />
| image = Thepiratebay_homepage_screenshot.png<br />
| description = <br />
| URL = http://www.thepiratebay.org/<br />
| project_status = {{offline}}<br />
| archiving_status = Partially {{saved}}<br />
| irc = yarharfiddlededee<br />
}}<br />
<br />
'''[[The Pirate Bay]]''' is one of the largest and most popular torrent search engines.<br />
<br />
It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB ''all over this wiki'', this site is pretty dang important.<br />
<br />
On December 2014, the website went offline due to an alleged raid<ref>https://torrentfreak.com/swedish-police-raid-the-pirate-bay-site-offline-141209/</ref>.<br />
<br />
==In case of Fire==<br />
<br />
To prevent damage to the Archive Team if The Pirate Bay ever goes down, we should include a Magnet Link next to every TPB link we have.<br />
<br />
=== Archival Methods ===<br />
<br />
We can simply scrape the magnet links, descriptions, and comments. The hard part would probably be keeping it all updated... (Maybe we could use a git repository, and pull as necessary?) <br />
<br />
Magnet links are provided in the Pirate Bay Magnet Archive below, and descriptions and comments are in the siterip.<br />
<br />
=== Archival Tools ===<br />
<br />
* [http://pastebin.com/8RXXthXB Magnet link Dumper]: A perl script that dumps magnet links into a single text file. It was used to make the below magnet archive.<br />
<br />
* [http://github.com/andronikov/tpb2csv tpb2csv]: scrapes the pirate bay website and strips out all the html crap, leaving only pure sweet metadata.<br />
** details.csv: Title, Type, Files, Size, IMDB, Spoken Languages, Texted Languages, Tags, Quality (+), Quality (-), Uploaded, By, User Type, Seeders, Leechers, Info Hash, Picture, Capture Date<br />
** description.txt<br />
** comments.csv: User Type, Username, Date, Text<br />
<br />
== Backups ==<br />
<br />
* [https://archive.org/details/PirateBayComplete20130219 rich.xml.7z] 662MB 7z database dump from 2013-02-19<br />
** torrent:urn:sha1:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
<br />
* [http://thepiratebay.se/torrent/7016365/The_whole_Pirate_Bay_magnet_archive The entire Pirate Bay Magnet Archive]: Every magnet link on the Pirate Bay, all in a tiny little text file. No comments, though.<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:938802790a385c49307f34cca4d30f80b03df59c&dn=The+whole+Pirate+Bay+magnet+archive&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8156416 Updated (February 2013) Listing]<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:277e1afa0038db7299cd8274310556526599f67c&dn=Small+pirate+bay+archive+%28february+2013%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7046494/Pirate_bay_Magnet_Archive_viewer Magnet Archive Viewer]: Parsing text files can be a pain, so this program makes it easy to search and look at the magnet links.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:f7a08a62a11ba6dfe39f1cd0b7e8a5a50d5379aa&dn=Pirate+bay+Magnet+Archive+viewer&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7028505/The_Pirate_Bay_full_siterip_2012 Siterip]: This 3GB archive saves all the html pages from the Piratebay, including comments. No torrent files, as usual.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:3ab8dd096aea63ddf668a127b81ba7fb6799364d&dn=The+Pirate+Bay+full+siterip+2012&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/7706886/Backup_of_The_Pirate_Bay_%28IDs__3200000_-_7700000%29 IDs 3200000-7699999]: tpb2csv 1.23GB 7z Backup as of 2012-10-06. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:0dfe31d5d91058bcbe5cfbcf98646700890afea0&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+3200000+-+7700000%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8044295/Backup_of_The_Pirate_Bay_%28IDs__7700000_-_7999999%29 IDs 7700000-7999999]: tpb2csv 69MB 7z Backup as of 2013-01-09. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:9f9c8cab8b68956a25d6e8c190e5e8dc8cf7186c&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+7700000+-+7999999%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
<br />
<br />
* '''Pirate Bay Metadata Git Repo''': https://github.com/tpb-archive tpb2csv scraped metadata including comments, file lists, descriptions, details. link broken, github censorship?<br />
** [https://github.com/tpb-archive/8xxxxxx IDs 8000000-8999999] tpb2csv scraped metadata not included by "Backup" torrents listed above, fetched on 2013-06-02<br />
<br />
at the end of 2014 there were 11000000 ish torrent ID's<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}</div>Fennhttps://wiki.archiveteam.org/index.php?title=The_Pirate_Bay&diff=21125The Pirate Bay2014-12-21T23:19:15Z<p>Fenn: /* Backups */ clarify format of files in torrent</p>
<hr />
<div>{{Infobox project<br />
| title = The Pirate Bay<br />
| logo = ThePirateBay.png<br />
| image = Thepiratebay_homepage_screenshot.png<br />
| description = <br />
| URL = http://www.thepiratebay.org/<br />
| project_status = {{offline}}<br />
| archiving_status = Partially {{saved}}<br />
| irc = yarharfiddlededee<br />
}}<br />
<br />
'''[[The Pirate Bay]]''' is one of the largest and most popular torrent search engines.<br />
<br />
It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB ''all over this wiki'', this site is pretty dang important.<br />
<br />
On December 2014, the website went offline due to an alleged raid<ref>https://torrentfreak.com/swedish-police-raid-the-pirate-bay-site-offline-141209/</ref>.<br />
<br />
==In case of Fire==<br />
<br />
To prevent damage to the Archive Team if The Pirate Bay ever goes down, we should include a Magnet Link next to every TPB link we have.<br />
<br />
=== Archival Methods ===<br />
<br />
We can simply scrape the magnet links, descriptions, and comments. The hard part would probably be keeping it all updated... (Maybe we could use a git repository, and pull as necessary?) <br />
<br />
Magnet links are provided in the Pirate Bay Magnet Archive below, and descriptions and comments are in the siterip.<br />
<br />
=== Archival Tools ===<br />
<br />
* [http://pastebin.com/8RXXthXB Magnet link Dumper]: A perl script that dumps magnet links into a single text file. It was used to make the below magnet archive.<br />
<br />
* [http://github.com/andronikov/tpb2csv tpb2csv]: scrapes the pirate bay website and strips out all the html crap, leaving only pure sweet metadata.<br />
** details.csv: Title, Type, Files, Size, IMDB, Spoken Languages, Texted Languages, Tags, Quality (+), Quality (-), Uploaded, By, User Type, Seeders, Leechers, Info Hash, Picture, Capture Date<br />
** description.txt<br />
** comments.csv: User Type, Username, Date, Text<br />
<br />
== Backups ==<br />
<br />
* [https://archive.org/details/PirateBayComplete20130219 rich.xml.7z] database dump from 2013-02-19<br />
** torrent:urn:sha1:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
<br />
* [http://thepiratebay.se/torrent/7016365/The_whole_Pirate_Bay_magnet_archive The entire Pirate Bay Magnet Archive]: Every magnet link on the Pirate Bay, all in a tiny little text file. No comments, though.<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:938802790a385c49307f34cca4d30f80b03df59c&dn=The+whole+Pirate+Bay+magnet+archive&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8156416 Updated (February 2013) Listing]<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:277e1afa0038db7299cd8274310556526599f67c&dn=Small+pirate+bay+archive+%28february+2013%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7046494/Pirate_bay_Magnet_Archive_viewer Magnet Archive Viewer]: Parsing text files can be a pain, so this program makes it easy to search and look at the magnet links.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:f7a08a62a11ba6dfe39f1cd0b7e8a5a50d5379aa&dn=Pirate+bay+Magnet+Archive+viewer&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7028505/The_Pirate_Bay_full_siterip_2012 Siterip]: This 3GB archive saves all the html pages from the Piratebay, including comments. No torrent files, as usual.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:3ab8dd096aea63ddf668a127b81ba7fb6799364d&dn=The+Pirate+Bay+full+siterip+2012&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/7706886/Backup_of_The_Pirate_Bay_%28IDs__3200000_-_7700000%29 IDs 3200000-7699999]: tpb2csvBackup as of 2012-10-06. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:0dfe31d5d91058bcbe5cfbcf98646700890afea0&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+3200000+-+7700000%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8044295/Backup_of_The_Pirate_Bay_%28IDs__7700000_-_7999999%29 IDs 7700000-7999999]: tpb2csv Backup as of 2013-01-09. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:9f9c8cab8b68956a25d6e8c190e5e8dc8cf7186c&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+7700000+-+7999999%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
<br />
<br />
* '''Pirate Bay Metadata Git Repo''': https://github.com/tpb-archive tpb2csv scraped metadata including comments, file lists, descriptions, details. link broken, github censorship?<br />
** [https://github.com/tpb-archive/8xxxxxx IDs 8000000-8999999] tpb2csv scraped metadata not included by "Backup" torrents listed above, fetched on 2013-06-02<br />
<br />
at the end of 2014 there were 11000000 ish torrent ID's<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}</div>Fennhttps://wiki.archiveteam.org/index.php?title=The_Pirate_Bay&diff=21124The Pirate Bay2014-12-21T23:17:04Z<p>Fenn: /* Archival Tools */ list data types and structure</p>
<hr />
<div>{{Infobox project<br />
| title = The Pirate Bay<br />
| logo = ThePirateBay.png<br />
| image = Thepiratebay_homepage_screenshot.png<br />
| description = <br />
| URL = http://www.thepiratebay.org/<br />
| project_status = {{offline}}<br />
| archiving_status = Partially {{saved}}<br />
| irc = yarharfiddlededee<br />
}}<br />
<br />
'''[[The Pirate Bay]]''' is one of the largest and most popular torrent search engines.<br />
<br />
It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB ''all over this wiki'', this site is pretty dang important.<br />
<br />
On December 2014, the website went offline due to an alleged raid<ref>https://torrentfreak.com/swedish-police-raid-the-pirate-bay-site-offline-141209/</ref>.<br />
<br />
==In case of Fire==<br />
<br />
To prevent damage to the Archive Team if The Pirate Bay ever goes down, we should include a Magnet Link next to every TPB link we have.<br />
<br />
=== Archival Methods ===<br />
<br />
We can simply scrape the magnet links, descriptions, and comments. The hard part would probably be keeping it all updated... (Maybe we could use a git repository, and pull as necessary?) <br />
<br />
Magnet links are provided in the Pirate Bay Magnet Archive below, and descriptions and comments are in the siterip.<br />
<br />
=== Archival Tools ===<br />
<br />
* [http://pastebin.com/8RXXthXB Magnet link Dumper]: A perl script that dumps magnet links into a single text file. It was used to make the below magnet archive.<br />
<br />
* [http://github.com/andronikov/tpb2csv tpb2csv]: scrapes the pirate bay website and strips out all the html crap, leaving only pure sweet metadata.<br />
** details.csv: Title, Type, Files, Size, IMDB, Spoken Languages, Texted Languages, Tags, Quality (+), Quality (-), Uploaded, By, User Type, Seeders, Leechers, Info Hash, Picture, Capture Date<br />
** description.txt<br />
** comments.csv: User Type, Username, Date, Text<br />
<br />
== Backups ==<br />
<br />
* [https://archive.org/details/PirateBayComplete20130219 rich.xml.7z] database dump from 2013-02-19<br />
** torrent:urn:sha1:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
<br />
* [http://thepiratebay.se/torrent/7016365/The_whole_Pirate_Bay_magnet_archive The entire Pirate Bay Magnet Archive]: Every magnet link on the Pirate Bay, all in a tiny little text file. No comments, though.<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:938802790a385c49307f34cca4d30f80b03df59c&dn=The+whole+Pirate+Bay+magnet+archive&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8156416 Updated (February 2013) Listing]<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:277e1afa0038db7299cd8274310556526599f67c&dn=Small+pirate+bay+archive+%28february+2013%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7046494/Pirate_bay_Magnet_Archive_viewer Magnet Archive Viewer]: Parsing text files can be a pain, so this program makes it easy to search and look at the magnet links.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:f7a08a62a11ba6dfe39f1cd0b7e8a5a50d5379aa&dn=Pirate+bay+Magnet+Archive+viewer&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7028505/The_Pirate_Bay_full_siterip_2012 Siterip]: This 3GB archive saves all the html pages from the Piratebay, including comments. No torrent files, as usual.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:3ab8dd096aea63ddf668a127b81ba7fb6799364d&dn=The+Pirate+Bay+full+siterip+2012&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/7706886/Backup_of_The_Pirate_Bay_%28IDs__3200000_-_7700000%29 IDs 3200000-7699999]: Backup as of 2012-10-06. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:0dfe31d5d91058bcbe5cfbcf98646700890afea0&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+3200000+-+7700000%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8044295/Backup_of_The_Pirate_Bay_%28IDs__7700000_-_7999999%29 IDs 7700000-7999999]: Backup as of 2013-01-09. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:9f9c8cab8b68956a25d6e8c190e5e8dc8cf7186c&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+7700000+-+7999999%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
<br />
<br />
* '''Pirate Bay Metadata Git Repo''': https://github.com/tpb-archive tpb2csv scraped metadata including comments, file lists, descriptions, details. link broken, github censorship?<br />
** [https://github.com/tpb-archive/8xxxxxx IDs 8000000-8999999] tpb2csv scraped metadata not included by "Backup" torrents listed above, fetched on 2013-06-02<br />
<br />
at the end of 2014 there were 11000000 ish torrent ID's<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}</div>Fennhttps://wiki.archiveteam.org/index.php?title=The_Pirate_Bay&diff=21123The Pirate Bay2014-12-21T23:08:53Z<p>Fenn: /* Backups */ more metadata sources</p>
<hr />
<div>{{Infobox project<br />
| title = The Pirate Bay<br />
| logo = ThePirateBay.png<br />
| image = Thepiratebay_homepage_screenshot.png<br />
| description = <br />
| URL = http://www.thepiratebay.org/<br />
| project_status = {{offline}}<br />
| archiving_status = Partially {{saved}}<br />
| irc = yarharfiddlededee<br />
}}<br />
<br />
'''[[The Pirate Bay]]''' is one of the largest and most popular torrent search engines.<br />
<br />
It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB ''all over this wiki'', this site is pretty dang important.<br />
<br />
On December 2014, the website went offline due to an alleged raid<ref>https://torrentfreak.com/swedish-police-raid-the-pirate-bay-site-offline-141209/</ref>.<br />
<br />
==In case of Fire==<br />
<br />
To prevent damage to the Archive Team if The Pirate Bay ever goes down, we should include a Magnet Link next to every TPB link we have.<br />
<br />
=== Archival Methods ===<br />
<br />
We can simply scrape the magnet links, descriptions, and comments. The hard part would probably be keeping it all updated... (Maybe we could use a git repository, and pull as necessary?) <br />
<br />
Magnet links are provided in the Pirate Bay Magnet Archive below, and descriptions and comments are in the siterip.<br />
<br />
=== Archival Tools ===<br />
<br />
* [http://pastebin.com/8RXXthXB Magnet link Dumper]: A perl script that dumps magnet links into a single text file. It was used to make the below magnet archive.<br />
<br />
* [http://github.com/andronikov/tpb2csv tpb2csv]: scrapes the pirate bay website and strips out all the html crap, leaving only pure sweet metadata.<br />
<br />
== Backups ==<br />
<br />
* [https://archive.org/details/PirateBayComplete20130219 rich.xml.7z] database dump from 2013-02-19<br />
** torrent:urn:sha1:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:e4b6f847647211b930219492ecf1a9c7bc696d29<br />
<br />
* [http://thepiratebay.se/torrent/7016365/The_whole_Pirate_Bay_magnet_archive The entire Pirate Bay Magnet Archive]: Every magnet link on the Pirate Bay, all in a tiny little text file. No comments, though.<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:938802790a385c49307f34cca4d30f80b03df59c&dn=The+whole+Pirate+Bay+magnet+archive&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8156416 Updated (February 2013) Listing]<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:277e1afa0038db7299cd8274310556526599f67c&dn=Small+pirate+bay+archive+%28february+2013%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7046494/Pirate_bay_Magnet_Archive_viewer Magnet Archive Viewer]: Parsing text files can be a pain, so this program makes it easy to search and look at the magnet links.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:f7a08a62a11ba6dfe39f1cd0b7e8a5a50d5379aa&dn=Pirate+bay+Magnet+Archive+viewer&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7028505/The_Pirate_Bay_full_siterip_2012 Siterip]: This 3GB archive saves all the html pages from the Piratebay, including comments. No torrent files, as usual.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:3ab8dd096aea63ddf668a127b81ba7fb6799364d&dn=The+Pirate+Bay+full+siterip+2012&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/7706886/Backup_of_The_Pirate_Bay_%28IDs__3200000_-_7700000%29 IDs 3200000-7699999]: Backup as of 2012-10-06. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:0dfe31d5d91058bcbe5cfbcf98646700890afea0&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+3200000+-+7700000%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8044295/Backup_of_The_Pirate_Bay_%28IDs__7700000_-_7999999%29 IDs 7700000-7999999]: Backup as of 2013-01-09. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:9f9c8cab8b68956a25d6e8c190e5e8dc8cf7186c&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+7700000+-+7999999%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
<br />
<br />
* '''Pirate Bay Metadata Git Repo''': https://github.com/tpb-archive tpb2csv scraped metadata including comments, file lists, descriptions, details. link broken, github censorship?<br />
** [https://github.com/tpb-archive/8xxxxxx IDs 8000000-8999999] tpb2csv scraped metadata not included by "Backup" torrents listed above, fetched on 2013-06-02<br />
<br />
at the end of 2014 there were 11000000 ish torrent ID's<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}</div>Fennhttps://wiki.archiveteam.org/index.php?title=The_Pirate_Bay&diff=21122The Pirate Bay2014-12-21T22:34:47Z<p>Fenn: /* Archival Tools */ mention tpb2csv</p>
<hr />
<div>{{Infobox project<br />
| title = The Pirate Bay<br />
| logo = ThePirateBay.png<br />
| image = Thepiratebay_homepage_screenshot.png<br />
| description = <br />
| URL = http://www.thepiratebay.org/<br />
| project_status = {{offline}}<br />
| archiving_status = Partially {{saved}}<br />
| irc = yarharfiddlededee<br />
}}<br />
<br />
'''[[The Pirate Bay]]''' is one of the largest and most popular torrent search engines.<br />
<br />
It's still having persistent legal problems. The tracker went down in November 2012, but the site still serves torrents and magnet links. If a torrent is lost, it becomes impossible to connect to other computers distributing the shared files. Considering that there are links to TPB ''all over this wiki'', this site is pretty dang important.<br />
<br />
On December 2014, the website went offline due to an alleged raid<ref>https://torrentfreak.com/swedish-police-raid-the-pirate-bay-site-offline-141209/</ref>.<br />
<br />
==In case of Fire==<br />
<br />
To prevent damage to the Archive Team if The Pirate Bay ever goes down, we should include a Magnet Link next to every TPB link we have.<br />
<br />
=== Archival Methods ===<br />
<br />
We can simply scrape the magnet links, descriptions, and comments. The hard part would probably be keeping it all updated... (Maybe we could use a git repository, and pull as necessary?) <br />
<br />
Magnet links are provided in the Pirate Bay Magnet Archive below, and descriptions and comments are in the siterip.<br />
<br />
=== Archival Tools ===<br />
<br />
* [http://pastebin.com/8RXXthXB Magnet link Dumper]: A perl script that dumps magnet links into a single text file. It was used to make the below magnet archive.<br />
<br />
* [http://github.com/andronikov/tpb2csv tpb2csv]: scrapes the pirate bay website and strips out all the html crap, leaving only pure sweet metadata.<br />
<br />
== Backups ==<br />
<br />
* [http://thepiratebay.se/torrent/7016365/The_whole_Pirate_Bay_magnet_archive The entire Pirate Bay Magnet Archive]: Every magnet link on the Pirate Bay, all in a tiny little text file. No comments, though.<br />
** '''Magnet link''': ''magnet:?xt=urn:btih:938802790a385c49307f34cca4d30f80b03df59c&dn=The+whole+Pirate+Bay+magnet+archive&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8156416 Updated (February 2013) Listing]<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:277e1afa0038db7299cd8274310556526599f67c&dn=Small+pirate+bay+archive+%28february+2013%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7046494/Pirate_bay_Magnet_Archive_viewer Magnet Archive Viewer]: Parsing text files can be a pain, so this program makes it easy to search and look at the magnet links.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:f7a08a62a11ba6dfe39f1cd0b7e8a5a50d5379aa&dn=Pirate+bay+Magnet+Archive+viewer&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [http://thepiratebay.se/torrent/7028505/The_Pirate_Bay_full_siterip_2012 Siterip]: This 3GB archive saves all the html pages from the Piratebay, including comments. No torrent files, as usual.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:3ab8dd096aea63ddf668a127b81ba7fb6799364d&dn=The+Pirate+Bay+full+siterip+2012&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/7706886/Backup_of_The_Pirate_Bay_%28IDs__3200000_-_7700000%29 IDs 3200000-7699999]: Backup as of 2012-10-06. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:0dfe31d5d91058bcbe5cfbcf98646700890afea0&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+3200000+-+7700000%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* [https://thepiratebay.se/torrent/8044295/Backup_of_The_Pirate_Bay_%28IDs__7700000_-_7999999%29 IDs 7700000-7999999]: Backup as of 2013-01-09. It includes all comments, filelists, and details in csv format.<br />
** '''Magnet Link''': ''magnet:?xt=urn:btih:9f9c8cab8b68956a25d6e8c190e5e8dc8cf7186c&dn=Backup+of+The+Pirate+Bay+%28IDs%3A+7700000+-+7999999%29&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Ftracker.publicbt.com%3A80&tr=udp%3A%2F%2Ftracker.istole.it%3A6969&tr=udp%3A%2F%2Ftracker.ccc.de%3A80''<br />
* '''Pirate Bay Git Repo''': https://github.com/tpb-archive<br />
<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}</div>Fenn