Succinct parallel Lempel-Ziv factorization on a multicore computer

被引:0
|
作者
Han, Ling Bo [1 ]
Lao, Bin [2 ]
Nong, Ge [1 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 05期
基金
中国国家自然科学基金;
关键词
Lempel-Ziv factorization; Suffix array; Multicore; Pipeline; INDUCED SORTING SUFFIXES; COMPRESSION; ARRAYS;
D O I
10.1007/s11227-021-04165-w
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes a succinct parallel algorithm, called pLZone, to compute the Lempel-Ziv (LZ77) factorization of a size-n input string over a constant alphabet in O(n) time using approximately a small n-word workspace, where each word occupies inverted right perpendicularlogninverted left perpendicular. bits. pLZone is designed by dividing the computing process of the sequential factorization algorithm LZone into multiple stages that are organized as a pipeline to perform operations in parallel for acceleration, and a checking method is integrated into the pipeline to efficiently verify the output to prevent bugs during implementation. A performance evaluation experiment is conducted by running pLZone and the existing representative algorithms on a set of realistic and artificial datasets. Both the best time and space results are achieved by our proposed algorithm, which suggests that this work could provide a potential solution for efficient LZ77 computation.
引用
收藏
页码:7278 / 7303
页数:26
相关论文
共 50 条
  • [41] Engineering Practical Lempel-Ziv Tries
    Arroyuelo, DIego
    Cánovas, Rodrigo
    Fischer, Johannes
    Köppl, Dominik
    Löbel, Marvin
    Navarro, Gonzalo
    Raman, Rajeev
    [J]. ACM Journal of Experimental Algorithmics, 2021, 26
  • [42] A Comparison of Index-Based Lempel-Ziv LZ77 Factorization Algorithms
    Al-Hafeedh, Anisa
    Crochemore, Maxime
    Ilie, Lucian
    Kopylova, Evguenia
    Smyth, W. F.
    Tischler, German
    Yusufu, Munina
    [J]. ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [43] Lempel-Ziv Factorization in Linear-Time O(1)-Workspace for Constant Alphabets
    Liu, Weijun
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (12): : 2145 - 2153
  • [44] Near-Optimal Quantum Algorithms for Bounded Edit Distance and Lempel-Ziv Factorization
    Gibney, Daniel
    Jin, Ce
    Kociumaka, Tomasz
    Thankachan, Sharma V.
    [J]. PROCEEDINGS OF THE 2024 ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2024, : 3302 - 3332
  • [45] IMPROVED REDUNDANCY OF A VERSION OF THE LEMPEL-ZIV ALGORITHM
    WYNER, AD
    WYNER, AJ
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (03) : 723 - 731
  • [46] The Lempel-Ziv complexity of fixed points of morphisms
    Constantinescu, Sorin
    Ilie, Lucian
    [J]. MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2006, PROCEEDINGS, 2006, 4162 : 280 - 291
  • [47] Lempel-Ziv compression of highly structured documents
    Adiego, Joaquin
    Navarro, Gonzalo
    de la Fuente, Pablo
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (04): : 461 - 478
  • [48] Lempel-Ziv index for q-grams
    Karkkainen, J
    Sutinen, E
    [J]. ALGORITHMICA, 1998, 21 (01) : 137 - 154
  • [49] Practical fixed length Lempel-Ziv coding
    Klein, Shmuel T.
    Shapira, Dana
    [J]. DISCRETE APPLIED MATHEMATICS, 2014, 163 : 326 - 333
  • [50] ON THE BIT-COMPLEXITY OF LEMPEL-ZIV COMPRESSION
    Ferragina, Paolo
    Nitto, Igor
    Venturini, Rossano
    [J]. SIAM JOURNAL ON COMPUTING, 2013, 42 (04) : 1521 - 1541