Search results

Jump to navigation Jump to search
  • ...finalized in May 2011, underscor provided a server to upload the 40 GB of compressed data files that had been collected. The upload finished on May 31st, and un Previously all data was stored in sorted and unsorted text files, often compressed to save space. Using the name tinyarchive, soultcer created some tools to m
    12 KB (1,942 words) - 23:04, 4 December 2017
  • ...n URLs across the various domains were grabbed, resulting in 102.6 GiB of (compressed) WARCs.
    6 KB (1,017 words) - 23:41, 29 December 2023
  • ...an be exported with the "Google Takeout" interface which sends a series of compressed archives with data from the various services. It's not always reliable.<ref
    11 KB (1,629 words) - 06:01, 25 November 2023
  • ...ch record is compressed via gzip. A gzip file supports multiple "members"; compressed warcs end in .warc.gz. According to the guidelines, WARC files should top o
    18 KB (2,481 words) - 01:00, 24 March 2024
  • ...ML text, but doesn't help at all when downloading material that is already compressed, like JPEG or PNG files. To enable compression, use:
    7 KB (1,114 words) - 16:27, 17 January 2017
  • --compressed</pre> --compressed</pre>
    56 KB (7,692 words) - 20:06, 31 January 2024
  • ...| bigint(20) | NO | | NULL | | (length of the (compressed) individual record)
    13 KB (1,827 words) - 16:45, 14 November 2021
  • ...13_common_crawl_index_urls Common Crawl index] is a very big (21 gigabytes compressed) list of URLs in the Common Crawl corpus. Grepping this list may well revea
    9 KB (1,436 words) - 02:35, 18 September 2023
  • | 2,200,001 || 2,300,000 || '''Uploaded''' || 50gb compressed || Darkstar | 2,300,001 || 2,400,000 || '''Uploaded''' || 70gb compressed || Darkstar
    54 KB (6,859 words) - 16:44, 14 November 2021
  • The file is a tar archive compressed with [http://tukaani.org/xz/ `xz(1)`] from 674MB to 39MB. It contains the c
    12 KB (1,788 words) - 20:15, 14 March 2021
  • ! Archive Name !! Archive Type !! Size (Compressed) !! Size (Uncompressed) !! # of Profiles !! Volunteer
    10 KB (1,143 words) - 01:09, 15 November 2021
  • ...9/8c0e7aae4607412f82bf4a7a4486fe36/fat.jpg~tplv-banciyuan-obj.image is the compressed version of <!-- Referer ACL is enabled on img5, so don't make it a hyperlin
    20 KB (2,990 words) - 18:35, 3 May 2024
  • !Size (GB) (compressed size)
    11 KB (1,798 words) - 05:10, 1 April 2011
  • project pages and random other files wget got. Size: 400 mb compressed.
    14 KB (2,057 words) - 01:47, 11 November 2018
  • ...cly available Reddit comment for research. ~ 1.7 billion comments @ 250 GB compressed. Any interest in this?]
    18 KB (2,818 words) - 01:27, 30 April 2024
  • ...es, with the largest ones being a few tens (less than 100) megabytes (WARC compressed). Note that this is a rough estimate with a small sample. (That would mean
    22 KB (3,273 words) - 00:34, 5 December 2017
  • ...in our torrents too, just in a different format (we use pipe-delimited, xz-compressed files while 301works uses comma-delimited uncompressed files). | divided up into 3,835 files in the last old-style dump, totaling 39 GB (compressed!). Also worked on as a Warrior job, see below.
    82 KB (13,464 words) - 10:37, 1 May 2024
  • ...the original video files in (semi-)offline storage, and store transcoded (compressed) versions on the Internet Archive.
    32 KB (4,950 words) - 22:40, 30 October 2023
  • ...it. So we put ourselves up on The Pirate Bay, we have a 641GB - because it compressed well - torrent, with 7,854 files that were basically 7zs, and we put that s
    41 KB (7,606 words) - 02:37, 12 December 2017
  • ...tation archive is available at {{IA collection|youtubeannotations}}, and a compressed copy can be found at {{IA item|youtubeannotations.tar.zstd}}. 16GB of just
    53 KB (7,713 words) - 20:37, 4 May 2024

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)