On the bit-complexity of Lempel-Ziv compression

被引:0
|
作者
Ferragina, Paolo [1 ]
Nitto, Igor [1 ]
Venturini, Rossano [1 ]
机构
[1] Univ Pisa, Dipartimento Informat, I-56127 Pisa, Italy
关键词
ALGORITHM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
One of the most famous and investigated lossless data-compression schemes is the one introduced by Lempel and Ziv about 30 years ago [37]. This compression scheme is known as "dictionary-based compressor" and consists of squeezing an input string by replacing some of its substrings with (shorter) codewords which are actually pointers to a dictionary of phrases built as the string is processed. Surprisingly enough, although many fundamental results are nowadays known about the speed and effectiveness of this compression process (see e.g. [23, 281 and references therein), "we are not aware of any parsing scheme that achieves optimality when the LZ77-dictionary is in use under any constraint on the codewords other than being of equal length" [28, pag. 159]. Here optimality means to achieve the minimum number of bits in compressing each individual input string, without any assumption on its generating source. In this paper we investigate three issues pertaining to the bit-complexity of LZ-based compressors, and we design algorithms which achieve bit-optimality in the compressed output size by taking efficient/optimal time and optimal space. These theoretical results will be sustained by some experiments that will compare our novel LZ-based compressors against the most popular compression tools (like gzip, bzip2) and state-of-the-art compressors (like the booster of [14, 13]).
引用
收藏
页码:768 / 777
页数:10
相关论文
共 50 条
  • [41] Cellular liberality is measurable as Lempel-Ziv complexity of fastq files
    Ogata, Norichika
    Hosaka, Aoi
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2022), 2022, : 321 - 326
  • [42] Lempel-Ziv complexity of the pNNx statistics - an application to neonatal stress
    Sapina, Matej
    Karmakar, Chandan Kumar
    Kramaric, Karolina
    Kosmider, Marcin
    Garcin, Matthieu
    Brdaric, Dario
    Milas, Kresimir
    Yearwood, John
    CHAOS SOLITONS & FRACTALS, 2021, 146
  • [43] Using Lempel-Ziv Complexity to Assess ECG Signal Quality
    Zhang, Yatao
    Wei, Shoushui
    Di Maria, Costanzo
    Liu, Chengyu
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2016, 36 (05) : 625 - 634
  • [44] Lempel-Ziv Factorization Revisited
    Ohlebusch, Enno
    Gog, Simon
    COMBINATORIAL PATTERN MATCHING, 22ND ANNUAL SYMPOSIUM, CPM 2011, 2011, 6661 : 15 - 26
  • [45] Lempel-Ziv complexity analysis of one dimensional cellular automata
    Estevez-Rams, E.
    Lora-Serrano, R.
    Nunes, C. A. J.
    Aragon-Fernandez, B.
    CHAOS, 2015, 25 (12)
  • [46] Arrhythmic pulses detection using Lempel-Ziv complexity analysis
    Xu, Lisheng
    Zhang, David
    Wang, Kuanquan
    Wang, Lu
    Eurasip Journal on Applied Signal Processing, 2006, 2006 : 1 - 12
  • [47] Arrhythmic Pulses Detection Using Lempel-Ziv Complexity Analysis
    Lisheng Xu
    David Zhang
    Kuanquan Wang
    Lu Wang
    EURASIP Journal on Advances in Signal Processing, 2006
  • [48] Quantifying physiological data with Lempel-Ziv complexity - Certain issues
    Nagarajan, R
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2002, 49 (11) : 1371 - 1373
  • [49] Effects of the series length on Lempel-Ziv Complexity during sleep
    Rivolta, Massimo W.
    Migliorini, Matteo
    Aktaruzzaman, Md
    Sassi, Roberto
    Bianchi, Anna M.
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 693 - 696
  • [50] Huffman Tree Compression and Lempel-Ziv Coding Using Java
    Nuha, Hilal H.
    Rizky Putra Sailellah, Hassan
    Abo Absa, Ahmed
    2024 International Conference on Decision Aid Sciences and Applications, DASA 2024, 2024,