Facebook Logo | |
URL | https://www.facebook.com/ |
Status | Online! |
Archiving status | Not saved yet |
Archiving type | Unknown |
IRC channel | #archiveteam-bs (on hackint) |
Facebook is a social networking site whose popularity has exploded in recent years. As of January 2014, there are more than 1.2 billion active users of the site.[1] Facebook hosts untold billions of users' photos, videos, thoughts, conversations, and other content. The total size of Facebook's databases is easily in the hundreds of petabytes.[2][3]
The judicious user will have a well-designed backup plan for all that content that they retain full control over, but it is a reasonable assumption that the majority of users rely totally on Facebook to safeguard their data. This is a mistake.
It might seem completely unthinkable that a site as massive and as popular as Facebook could ever disappear, taking your data with it. The reality is that websites, even hugely popular ones, can decline in popularity over time and eventually go away, taking your data with them with little or no warning. We've seen it happen.
While Facebook may not be in any immediate danger, you should consider that the data you put on Facebook may be immensely important to you in 10 or 20 years, similar to your family's photo albums. Facebook could be long dead by then. Start planning for this eventuality right now.
Download Your Data From Facebook
Facebook has created a tool to download an incomplete archive of your Facebook account. This includes your own photos and videos, chat conversations, status updates and wall posts. Please bear in mind that the archive is INCOMPLETE. It does NOT include any content you posted on any Facebook group. It does NOT include any comments from your wall posts. It also does NOT include photos and videos belonging to other people even if you are tagged in them.
To create your archive, click the little down arrow next to your name in the upper right area of the page, click on "Settings", then click on "Your Facebook Information", then click on "Download Your Information".
The next screen will explain what's going on. Press "Create File" and you will be presented with a popup telling you that this will take some time - around one hour is not unheard of. This may indeed take several minutes. In the mean time you can continue using Facebook as usual. They will email you when the archive is ready for download.
Your email will contain a link to download your archive. Follow that link and enter your Facebook password to continue. The next page presents you with a download button and an estimate of the archive's size. Download that somewhere convenient for you. This file contains highly personal and potentially sensitive information so keep it safe!. The easiest way to browse the information is to extract the contents of the zip file, and then open the index.html file with your browser of choice. From there you can look at your profile, your wall posts, photos and videos, and private messages.
Note that as of April, 2012, this download tool seems to have some bugs -- in my tests it failed to completely back up all of my conversations, for example. It's better than nothing but for now at least I don't trust that it is perfect.
Unofficial Backup tools
- friendstogmail exports your contacts to CSV files, ostensibly for import to Gmail. Claims to work only in-browser. Retrieves your friends' birthdays, locations, bios, work history, and hometowns.
- Facedown downloads photo albums from Facebook. Official distribution of the software has stopped.
- Give Me My Data (gmmd) was a Facebook application that helped users export their data out of Facebook for reuse in visualizations, archives, or any possible method of digital storytelling. Data could be exported in common formats like CSV, XML, and JSON as well as customized network graph formats.
- Photograbber was an open-source photo album downloader, it was capable of retrieving ragged photos, albums, tags, comments and likes. It could also download the albums of your friends.
Public pages and groups
While Facebook contains personal profiles and Facebook versions of many blogs/news sites, it also contains blogs that only exist on Facebook. It could be nice to archive these pages, which are the most endangered:
- Facebook is known to have banned totally reasonable pages in the past. They usually come back with a new profile, but the old content is gone.
- Admins could delete pages and groups.
- Pages can be bought and repurposed for commercial/spammy purposes.
- Archival of pages and groups can be difficult (especially for general-purpose archivers), because Facebook is based on a (quite messy) implementation of AJAX. APIs are tightly locked and it is no longer possible to use them for interacting with/reading posts from groups.[4]
- Alternatively, one could disable JavaScript and download web pages from the mobile website, m.facebook.com. Archival of pages and groups using this method is much easier.
Proposal
We could start archiving the most popular public pages that 1) are not backed by an external website and 2) are not simply used for boring promotional purposes (eg. pages belonging to brands).
- TODO: We need to build a list of the most important pages matching the above criteria.
- Question: How do you define "important"? Are there criteria from any other archive project that could be re-used?
- Most popular pages:
- Could start with List of most-followed Facebook pages from wikipedia.
- Reportedly Largest Facebook groups
- More Large Facebook groups
Another proposal
- In software engineering terms, the above proposal is a "waterfall". It will never get started because an agreement about which pages are important will never be reached.
- Instead an "end-to-end" archive of a single Facebook page/group needs to be achieved.
- Start by archiving just one test group, including all posts, all comments, all images, etc.
- The archive should be browseable, or at least a viewer needs to exist, otherwise it will be difficult to verify its success.
- After one good quality archive has been completed, and the bugs have been ironed out, then allow the public to submit requests for pages/groups to be archived, similar to the current process for other general websites.
Portability
In 2020 it will become possible to export one's own Facebook photos to Google Photos.[5] In December 2019 the feature was made available in Ireland.[6] The move comes after the GDPR requirements for data portability and may be extended beyond EU and to further target services.
Facebook ostensibly participates in the Data transfer project by the GAFA + Twitter.
Vital Signs
Currently stable.
References
- ↑ http://thenextweb.com/facebook/2014/01/29/facebook-passes-1-23-billion-monthly-active-users-945-million-mobile-users-757-million-daily-users/
- ↑ https://www.facebook.com/notes/facebook-engineering/presto-interacting-with-petabytes-of-data-at-facebook/10151786197628920
- ↑ https://research.facebook.com/blog/1522692927972019/facebook-s-top-open-data-problems/
- ↑ http://stackoverflow.com/questions/31490435/facebook-api-2-4-user-group-permission
- ↑ https://uk.reuters.com/article/us-facebook-photos/facebook-to-allow-transfer-of-photos-videos-to-google-photos-idUKKBN1Y61NE?il=0
- ↑ https://engineering.fb.com/security/data-transfer-project/