Lempel-Ziv Factorization Using Less Time & Space

被引:22
|
作者
Chen, Gang [1 ]
Puglisi, Simon J. [2 ]
Smyth, W. F. [3 ]
机构
[1] McMaster Univ, Dept Comp & Software, Hamilton, ON L8S 4K1, Canada
[2] RMIT Univ, Sch Comp Sci & Informat Technol, Melbourne, Vic 3001, Australia
[3] Curtin Univ Technol, Digital Ecosyst & Business Intelligence Inst, Perth, WA 6845, Australia
关键词
Lempel-Ziv factorization; suffix array; suffix tree; LZ factorization;
D O I
10.1007/s11786-007-0024-4
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
For 30 years the Lempel-Ziv factorization LZx of a string x = x[ 1.. n] has been a fundamental data structure of string processing, especially valuable for string compression and for computing all the repetitions (runs) in x. Traditionally the standard method for computing LZx was based on Theta(n)-time (or, depending on the measure used, O(n log n)-time) processing of the suffix tree STx of x. Recently Abouelhoda et al. proposed an efficient Lempel-Ziv factorization algorithm based on an "enhanced" suffix array that is, a suffix array SAx together with supporting data structures, principally an "interval tree". In this paper we introduce a collection of fast spaceefficient algorithms for LZ factorization, also based on suffix arrays, that in theory as well as in many practical circumstances are superior to those previously proposed; one family out of this collection achieves true T(n)-time alphabet-independent processing in the worst case by avoiding tree structures altogether.
引用
收藏
页码:605 / 623
页数:19
相关论文
共 50 条
  • [31] Generalized Lempel-Ziv compression for audio
    Kirovski, Darko
    Landau, Zeph
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 509 - 518
  • [32] On the Size of Lempel-Ziv and Lyndon Factorizations
    Karkkainen, Juha
    Kempa, Dominik
    Nakashima, Yuto
    Puglisi, Simon J.
    Shur, Arseny M.
    [J]. 34TH SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2017), 2017, 66
  • [33] On the Approximation Ratio of Lempel-Ziv Parsing
    Gagie, Travis
    Navarro, Gonzalo
    Prezza, Nicola
    [J]. LATIN 2018: THEORETICAL INFORMATICS, 2018, 10807 : 490 - 503
  • [34] Lempel-Ziv Decoding in External Memory
    Belazzougui, Djamal
    Karkkainen, Juha
    Kempa, Dominik
    Puglisi, Simon J.
    [J]. EXPERIMENTAL ALGORITHMS, SEA 2016, 2016, 9685 : 63 - 74
  • [35] Lempel-Ziv coding in reinforcement learning
    Iwata, K
    Ishii, N
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 531 - 537
  • [36] Lempel-Ziv Parsing for Sequences of Blocks
    Kosolobov, Dmitry
    Valenzuela, Daniel
    [J]. ALGORITHMS, 2021, 14 (12)
  • [37] Lempel-Ziv Parsing in External Memory
    Karkkainen, Juha
    Kempa, Dominik
    Puglisi, Simon J.
    [J]. 2014 DATA COMPRESSION CONFERENCE (DCC 2014), 2014, : 153 - 162
  • [38] Lempel-Ziv Complexity of Photonic Quasicrystals
    Monzon, Juan J.
    Felipe, Angel
    Sanchez-Soto, Luis L.
    [J]. CRYSTALS, 2017, 7 (07):
  • [39] Generalized Lempel-Ziv compression for audio
    Kirovski, D
    Landau, Z
    [J]. 2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 127 - 130
  • [40] Faster Lightweight Lempel-Ziv Parsing
    Kosolobov, Dmitry
    [J]. MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2015, PT II, 2015, 9235 : 432 - 444