Difference between revisions of "User:OrIdow6/Sandbox"
Line 1: | Line 1: | ||
=== Yahoo Groups === | === Yahoo Groups === | ||
Forum/mailing list that shut down in a process spanning late 2019 to early 2020. | Forum/mailing list that shut down in a process spanning late 2019 to early 2020. | ||
=== 2018 API grab === | === 2018 API grab === | ||
Scrape of the Yahoo Groups API. Led or done by PurpleSym. There is one WARC file per group, and several group-WARCs per IA item. The items are located in [https://archive.org/details/archiveteam_yahoogroups the IA collection archiveteam_yahoogroups], and are distinguishable from everything else in that collection by their upload date in late 2018, and by their thumbnails being a photo of a "Yahoo!" sign (with the exception of [https://archive.org/details/yahoogroup_info_grab something else from about 2 years later]). Although they are in WARC format, they seem to use the resource record-type with synthetic URIs, meaning they will not work in the Wayback Machine. | |||
=== Doranwen's metadata upload === | |||
An IA item located [https://archive.org/details/Yahoo_Groups_Metadata here] created by Doranwen. Contains tables of group metadata, parsed from the [[#2018 API grab|2018 API grab]] | |||
Latest revision as of 05:03, 27 September 2022
Yahoo Groups
Forum/mailing list that shut down in a process spanning late 2019 to early 2020.
2018 API grab
Scrape of the Yahoo Groups API. Led or done by PurpleSym. There is one WARC file per group, and several group-WARCs per IA item. The items are located in the IA collection archiveteam_yahoogroups, and are distinguishable from everything else in that collection by their upload date in late 2018, and by their thumbnails being a photo of a "Yahoo!" sign (with the exception of something else from about 2 years later). Although they are in WARC format, they seem to use the resource record-type with synthetic URIs, meaning they will not work in the Wayback Machine.
Doranwen's metadata upload
An IA item located here created by Doranwen. Contains tables of group metadata, parsed from the 2018 API grab