Faster Compact On-Line Lempel-Ziv Factorization

被引:6
|
作者
Yamamoto, Jun'ichi [1 ]
Tomohiro, I [1 ]
Bannai, Hideo [1 ]
Inenaga, Shunsuke [1 ]
Takeda, Masayuki [1 ]
机构
[1] Kyushu Univ, Dept Informat, Nishi Ku, Fukuoka, Japan
关键词
Lempel-Ziv Factorization; String Index; ALGORITHM;
D O I
10.4230/LIPIcs.STACS.2014.675
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new on-line algorithm for computing the Lempel-Ziv factorization of a string that runs in O(N log N) time and uses only O(N log sigma) bits of working space, where N is the length of the string and sigma is the size of the alphabet. This is a notable improvement compared to the performance of previous on-line algorithms using the same order of working space but running in either O(N log(3) N) time (Okanohara & Sadakane 2009) or O(N log(2) N) time (Starikovskaya 2012). The key to our new algorithm is in the utilization of an elegant but less popular index structure called Directed Acyclic Word Graphs, or DAWGs (Blumer et al. 1985). We also present an opportunistic variant of our algorithm, which, given the run length encoding of size m of a string of length N, computes the Lempel-Ziv factorization of the string on-line, in O (m . min {(log log m)(log log N)/log log log N, root lon m/log log m}) time and O(m log N) bits of space.
引用
收藏
页码:675 / 686
页数:12
相关论文
共 50 条
  • [21] Lempel-Ziv Factorization May Be Harder Than Computing All Runs
    Kosolobov, Dmitry
    [J]. 32ND INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2015), 2015, 30 : 582 - 593
  • [22] Application of Lempel-Ziv factorization to the approximation of grammar-based compression
    Rytter, W
    [J]. THEORETICAL COMPUTER SCIENCE, 2003, 302 (1-3) : 211 - 222
  • [23] On Lempel-Ziv complexity of sequences
    Doganaksoy, Ali
    Gologlu, Faruk
    [J]. SEQUENCES AND THEIR APPLICATIONS - SETA 2006, 2006, 4086 : 180 - 189
  • [24] Pushdown and Lempel-Ziv depth
    Jordon, Liam
    Moser, Philippe
    [J]. INFORMATION AND COMPUTATION, 2023, 292
  • [25] Generalized Lempel-Ziv compression for audio
    Kirovski, Darko
    Landau, Zeph
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 509 - 518
  • [26] On the Size of Lempel-Ziv and Lyndon Factorizations
    Karkkainen, Juha
    Kempa, Dominik
    Nakashima, Yuto
    Puglisi, Simon J.
    Shur, Arseny M.
    [J]. 34TH SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2017), 2017, 66
  • [27] On the Approximation Ratio of Lempel-Ziv Parsing
    Gagie, Travis
    Navarro, Gonzalo
    Prezza, Nicola
    [J]. LATIN 2018: THEORETICAL INFORMATICS, 2018, 10807 : 490 - 503
  • [28] Lempel-Ziv Decoding in External Memory
    Belazzougui, Djamal
    Karkkainen, Juha
    Kempa, Dominik
    Puglisi, Simon J.
    [J]. EXPERIMENTAL ALGORITHMS, SEA 2016, 2016, 9685 : 63 - 74
  • [29] Lempel-Ziv coding in reinforcement learning
    Iwata, K
    Ishii, N
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 531 - 537
  • [30] Lempel-Ziv Parsing for Sequences of Blocks
    Kosolobov, Dmitry
    Valenzuela, Daniel
    [J]. ALGORITHMS, 2021, 14 (12)