Difference between revisions of "Internet Archive/Collections"

From Archiveteam
Jump to navigation Jump to search
m (JesseW moved page /Collections to Internet Archive/Collections: typo)
(rm more duplicate)
 
(2 intermediate revisions by the same user not shown)
Line 2: Line 2:


The substantive content of many of these collections is not available for direct public download, although the identifier and file names (and hashes) are.
The substantive content of many of these collections is not available for direct public download, although the identifier and file names (and hashes) are.
<pre>
 
web:
 
        wikicollections:
*{{IA id|web}}:
                wikimediadownloads: 11
**{{IA id|wikicollections}}:
                wikipediaoutlinks: 3
***{{IA id|wikimediadownloads}}: 11
                wikiteam: 2
***{{IA id|wikipediaoutlinks}}: 3
        webwidecrawl:
***{{IA id|wikiteam}}: 2
                widecrawl: 13
**{{IA id|webwidecrawl}}:
                survey_crawl: 8
***{{IA id|widecrawl}}: 13
                shallow_crawl: 5
***{{IA id|survey_crawl}}: 8
                EndofTermWebCrawls: 5
***{{IA id|shallow_crawl}}: 5
                newscrawl: 4
***{{IA id|EndofTermWebCrawls}}: 5
                crawldata: 2
***{{IA id|newscrawl}}: 4
                youtubecrawl: 1
***{{IA id|crawldata}}: 2
        focused_crawls:
***{{IA id|youtubecrawl}}: 1
                partner_crawls:
**{{IA id|focused_crawls}}:
                        ait-collection-id-4399: 42
***{{IA id|partner_crawls}}:
                        ait-collection-id-4259: 23
****{{IA id|ait-collection-id-4399}}: 42
                        ait-collection-id-3499: 21
****{{IA id|ait-collection-id-4259}}: 23
                        ait-collection-id-1193: 21
****{{IA id|ait-collection-id-3499}}: 21
                        ait-collection-id-3336: 18
****{{IA id|ait-collection-id-1193}}: 21
                        ait-collection-id-4692: 14
****{{IA id|ait-collection-id-3336}}: 18
                        ait-collection-id-1000000015: 5
****{{IA id|ait-collection-id-4692}}: 14
                        ait-collection-id-4722: 3
****{{IA id|ait-collection-id-1000000015}}: 5
                        ait-collection-id-4363: 3
****{{IA id|ait-collection-id-4722}}: 3
                        ait-collection-id-2968: 3
****{{IA id|ait-collection-id-4363}}: 3
                        ait-collection-id-4730: 2
****{{IA id|ait-collection-id-2968}}: 3
                        ait-collection-id-4725: 2
****{{IA id|ait-collection-id-4730}}: 2
                        ait-collection-id-4721: 2
****{{IA id|ait-collection-id-4725}}: 2
                        ait-collection-id-261: 2
****{{IA id|ait-collection-id-4721}}: 2
                        ait-collection-id-1986: 2
****{{IA id|ait-collection-id-261}}: 2
                        ait-collection-id-1894: 2
****{{IA id|ait-collection-id-1986}}: 2
                        ait-collection-id-4718: 1
****{{IA id|ait-collection-id-1894}}: 2
                        ait-collection-id-4708: 1
****{{IA id|ait-collection-id-4718}}: 1
                        ait-collection-id-4644: 1
****{{IA id|ait-collection-id-4708}}: 1
                        ait-collection-id-3784: 1
****{{IA id|ait-collection-id-4644}}: 1
                        ait-collection-id-3697: 1
****{{IA id|ait-collection-id-3784}}: 1
                        ait-collection-id-3373: 1
****{{IA id|ait-collection-id-3697}}: 1
                        ait-collection-id-2176: 1
****{{IA id|ait-collection-id-3373}}: 1
                        ait-collection-id-2114: 1
****{{IA id|ait-collection-id-2176}}: 1
                        ait-collection-id-1996: 1
****{{IA id|ait-collection-id-2114}}: 1
                top_domains: 308
****{{IA id|ait-collection-id-1996}}: 1
                top_news: 302
***{{IA id|top_domains}}: 308
                ait-collection-id-4399: 42
***{{IA id|top_news}}: 302
                ait-collection-id-4259: 23
***ait-collection-id-4399 (repeat, see above)
                ait-collection-id-3499: 21
*** ''repeats from partner_crawls elided''
                ait-collection-id-1193: 21
***{{IA id|personal_archives}}: 1
                ait-collection-id-3336: 18
**{{IA id|customcrawlservices}}:
                ait-collection-id-4692: 14
***{{IA id|nlaweb}}: 10
                ait-collection-id-1000000015: 5
***{{IA id|naraweb}}: 8
                ait-collection-id-4722: 3
***{{IA id|nl_spain}}: 7
                ait-collection-id-4363: 3
***{{IA id|bnf_french_domain}}: 5
                ait-collection-id-2968: 3
***{{IA id|swiss_nl}}: 3
                ait-collection-id-4730: 2
***{{IA id|olympicsweb}}: 3
                ait-collection-id-4725: 2
***{{IA id|nlnzweb}}: 3
                ait-collection-id-4721: 2
***{{IA id|nliweb}}: 2
                ait-collection-id-261: 2
***{{IA id|nl_israel}}: 2
                ait-collection-id-1986: 2
***{{IA id|nl_sweden}}: 1
                ait-collection-id-1894: 2
***{{IA id|fedsiteclosurecrawls}}: 1
                personal_archives: 1
***{{IA id|electionsweb}}: 1
                ait-collection-id-4718: 1
**{{IA id|archiveteam}}:
                ait-collection-id-4708: 1
***{{IA id|archiveteam-fire}}: 1
                ait-collection-id-4644: 1
**{{IA id|alexacrawls}}:
                ait-collection-id-3784: 1
***{{IA id|alexa_2006}}: 13
                ait-collection-id-3697: 1
***{{IA id|alexa_2007}}: 11
                ait-collection-id-3373: 1
***{{IA id|amazoncrawl}}: 5
                ait-collection-id-2176: 1
***{{IA id|alexa_2005}}: 4
                ait-collection-id-2114: 1
***{{IA id|alexa_1999}}: 2
                ait-collection-id-1996: 1
***{{IA id|alexa_2008}}: 1
        customcrawlservices:
** partner_crawls (repeat, see above):
                nlaweb: 10
*** ''repeated contents elided''
                naraweb: 8
**{{IA id|archiveitdigitalcollection}}:
                nl_spain: 7
***{{IA id|ArchiveIt-Partner-160}}: 325
                bnf_french_domain: 5
***{{IA id|ArchiveIt-Partner-449}}: 190
                swiss_nl: 3
***{{IA id|ArchiveIt-Partner-132}}: 108
                olympicsweb: 3
***{{IA id|ArchiveIt-Partner-682}}: 65
                nlnzweb: 3
***{{IA id|ArchiveIt-Partner-593}}: 62
                nliweb: 2
***{{IA id|ArchiveIt-Partner-156}}: 62
                nl_israel: 2
***{{IA id|ArchiveIt-Partner-336}}: 49
                nl_sweden: 1
***{{IA id|ArchiveIt-Partner-197}}: 49
                fedsiteclosurecrawls: 1
***{{IA id|ArchiveIt-Partner-121}}: 47
                electionsweb: 1
*
        archiveteam:
                archiveteam-fire: 1
        alexacrawls:
                alexa_2006: 13
                alexa_2007: 11
                amazoncrawl: 5
                alexa_2005: 4
                alexa_1999: 2
                alexa_2008: 1
        partner_crawls:
                ait-collection-id-4399: 42
                ait-collection-id-4259: 23
                ait-collection-id-3499: 21
                ait-collection-id-1193: 21
                ait-collection-id-3336: 18
                ait-collection-id-4692: 14
                ait-collection-id-1000000015: 5
                ait-collection-id-4722: 3
                ait-collection-id-4363: 3
                ait-collection-id-2968: 3
                ait-collection-id-4730: 2
                ait-collection-id-4725: 2
                ait-collection-id-4721: 2
                ait-collection-id-261: 2
                ait-collection-id-1986: 2
                ait-collection-id-1894: 2
                ait-collection-id-4718: 1
                ait-collection-id-4708: 1
                ait-collection-id-4644: 1
                ait-collection-id-3784: 1
                ait-collection-id-3697: 1
                ait-collection-id-3373: 1
                ait-collection-id-2176: 1
                ait-collection-id-2114: 1
                ait-collection-id-1996: 1
        archiveitdigitalcollection:
                ArchiveIt-Partner-160: 325
                ArchiveIt-Partner-449: 190
                ArchiveIt-Partner-132: 108
                ArchiveIt-Partner-682: 65
                ArchiveIt-Partner-593: 62
                ArchiveIt-Partner-156: 62
                ArchiveIt-Partner-336: 49
                ArchiveIt-Partner-197: 49
                ArchiveIt-Partner-121: 47
                ArchiveIt-Partner-153: 46
                ArchiveIt-Partner-421: 40
                ArchiveIt-Partner-518: 39
                ArchiveIt-Partner-543: 34
                ArchiveIt-Partner-62: 31
                ArchiveIt-Partner-498: 30
                ArchiveIt-Partner-66: 29
                ArchiveIt-Partner-329: 24
                ArchiveIt-Partner-567: 22
                ArchiveIt-Partner-89: 21
                ArchiveIt-Partner-152: 21
                ArchiveIt-Partner-318: 20
                ArchiveIt-Partner-316: 20
                ArchiveIt-Partner-715: 19
                ArchiveIt-Partner-135: 19
                ArchiveIt-Partner-130: 19
                AIT-AlabamaStateArchives: 19
                ArchiveIt-Partner-388: 17
                ArchiveIt-Partner-485: 16
                ArchiveIt-Partner-413: 16
                ArchiveIt-Partner-401: 16
                ArchiveIt-Partner-380: 15
                ArchiveIt-Partner-772: 14
                ArchiveIt-Partner-745: 14
                ArchiveIt-Partner-583: 14
                ArchiveIt-Partner-86: 13
                ArchiveIt-Partner-687: 13
                ArchiveIt-Partner-693: 12
                ArchiveIt-Partner-671: 12
                ArchiveIt-Partner-564: 12
                ArchiveIt-Partner-497: 12
                ArchiveIt-Partner-492: 12
                ArchiveIt-Partner-411: 12
                ArchiveIt-Partner-351: 12
                ArchiveIt-Partner-143: 12
                ArchiveIt-Partner-528: 11
                ArchiveIt-Partner-478: 11
                ArchiveIt-Partner-379: 11
                ArchiveIt-Partner-369: 11
                ArchiveIt-Partner-159: 11
                ArchiveIt-Partner-858: 10
                ArchiveIt-Partner-691: 10
                ArchiveIt-Partner-611: 10
                ArchiveIt-Partner-484: 10
                ArchiveIt-Partner-350: 10
                ArchiveIt

Latest revision as of 05:00, 11 April 2016

As of the Jan 2016 recheck of the identifiers included in the original (March 2015) IA census, these are all the collections that contain other collections. The numbers after them are the number of leaf collections in each one.

The substantive content of many of these collections is not available for direct public download, although the identifier and file names (and hashes) are.