From Archiveteam
Jump to navigation Jump to search

Maybe we should ask the US Government for that Twitter backup? --BlueMaxima 15:33, 24 January 2011 (UTC)

I don't believe they care enough. --ATrescue (talk) 01:50, 30 April 2019 (UTC)

Archiving tweet metadata as well.

Do the current methods of saving tweets (i.e. snscrape (see on GitHub)) also include metadata such as tweet source tags?[1][2]


The snscrape method puts the last 3200 (due to API limits) tweets of the archival target user into a URL list, uploaded to or and feeds it into ArchiveBot using !a < and also archives tweet replies because the URL list contains them as well.
ArchiveBot also saves it into the Wayback Machine, which is great.


Another archival method is chromebot: a or also chromebot: a (without “from:” when also including tweet replies and tweets mentioning the user in the thread).

But chromebot's infinite scroll might not reach as many tweets as snscrape, and even fewer from the targeted account when other tweets are in the search results as well.

In case of controversies, chromebot: a should be used as well.

Tweet Metadata

Because both chromebot and archivebot rely on Twitter's web interface, the amount of tweet metadata captured might be very limited.
Here are several documentations about Twitter's metadata API:

There should be a way to mass-grab this metadata as well. --ATrescue (talk) 01:49, 30 April 2019 (UTC)

snscrape grabs the entire history, not just the last 3200 tweets. But it doesn't include retweets. --JustAnotherArchivist (talk) 01:54, 10 May 2019 (UTC)