Difference between revisions of "Data compression algorithms and tools"

From Archiveteam
Jump to navigation Jump to search
(added lrzip section ~~~~)
Line 1: Line 1:
This list contains the most popular '''data compression algorithms and tools'''. Mos of them are open source, an important detail if you want to preserve data for a long time from now and to be able to decompress the data in the future.
This list contains the most popular '''data compression algorithms and tools'''. All of them are free and open source, an important detail if you want to preserve data for a long time from now and to be able to decompress the data in the future.


== 7z ==
== 7z ==
Line 9: Line 9:
== Gzip ==
== Gzip ==
* http://www.gzip.org/
* http://www.gzip.org/
== LZMA/LZMA2 ==
Aside from 7z, there's also
* [http://tukaani.org/xz/format.html xz]
** Based on [http://7-zip.org/sdk.html LZMA SDK]
** Commonly included by default in Linux distros
* [http://www.nongnu.org/lzip lzip]
** Not as widely used as xz
** Well defined file format and emphasis on file integrity
** lziprecover can correct some bit-flip errors.


== Zip ==
== Zip ==
Line 14: Line 24:


== KGB ==
== KGB ==
* http://kgbarchiver.net/
* http://sourceforge.net/projects/kgbarchiver


The best compression ratio (better than 7z), but a bit slow.
The best compression ratio (better than 7z), but a bit slow.

Revision as of 03:42, 19 September 2014

This list contains the most popular data compression algorithms and tools. All of them are free and open source, an important detail if you want to preserve data for a long time from now and to be able to decompress the data in the future.

7z

Bzip2

Gzip

LZMA/LZMA2

Aside from 7z, there's also

  • xz
    • Based on LZMA SDK
    • Commonly included by default in Linux distros
  • lzip
    • Not as widely used as xz
    • Well defined file format and emphasis on file integrity
    • lziprecover can correct some bit-flip errors.

Zip

  • Available by default in any Windows version available today, but if you need cross-platform, use 7-zip.

KGB

The best compression ratio (better than 7z), but a bit slow.

You can install it in Ubuntu with: sudo apt-get install kgb

How to:

  • kgb -m file.kgb originalfile
  • m is a number from 0 to 9 (lowest compression ratio from higher; higher use 1616 MB of RAM, a lot of CPU and time)

lrzip

"This is a compression program optimised for large files" -lrzip readme

lrzip is fantastic for archiving - the compression ratio improves as the size of the input file grows - albeit a terribly slow compressor. lrzip really shines when compressing large sets of redundant information - but distant, and otherwise unconnected. General purpose compression algorithms would never see this, given their tiny compression window.

lrzip benchmarks

External links