Parallel Lightweight Wavelet Tree, Suffix Array and FM-Index Construction

被引:5
|
作者
Labeit, Julian [1 ]
Shun, Julian [2 ]
Blelloch, Guy E. [3 ]
机构
[1] Karlsruhe Inst Technol, D-76021 Karlsruhe, Germany
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
10.1109/DCC.2016.117
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present parallel lightweight algorithms to construct wavelet trees, rank and select structures, and suffix arrays in a shared-memory setting. The work and depth of our parallel wavelet tree algorithm matches that of the best existing algorithm while requiring asymptotically less memory. Our experiments show that it is both faster and more memory-efficient than existing parallel algorithms. We also present an experimental evaluation of the parallel construction of rank and select structures, which are used in wavelet trees. Next, we design the first parallel suffix array algorithm based on induced copying. The induced copying requires linear work and polylogarithmic depth for constant alphabets. When combined with a parallel prefix-doubling algorithm, it is more efficient in practice both in terms of running time and memory usage compared to existing parallel implementations. As an application, we combine our algorithms to build the FM-index in parallel.
引用
收藏
页码:33 / 42
页数:10
相关论文
共 44 条
  • [1] A Qualitative Performance Comparison and Analysis of Suffix Array, FM-index and Compressed Suffix Array
    Wu, Jichuan
    Mao, Xin
    Lu, Songfeng
    [J]. 2012 INTERNATIONAL CONFERENCE ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE & ENGINEERING (FITMSE 2012), 2012, 14 : 348 - 352
  • [2] Computation of the suffix array, Burrows-Wheeler transform and FM-index in V-order
    Daykin, Jacqueline W.
    Mhaskar, Neerja
    Smyth, W. F.
    [J]. THEORETICAL COMPUTER SCIENCE, 2021, 880 : 82 - 96
  • [3] Lightweight Parameterized Suffix Array Construction
    Tomohiro, I
    Deguchi, Satoshi
    Bannai, Hideo
    Inenaga, Shunsuke
    Takeda, Masayuki
    [J]. COMBINATORIAL ALGORITHMS, 2009, 5874 : 312 - +
  • [4] Lightweight Distributed Suffix Array Construction
    Fischer, Johannes
    Kurpicz, Florian
    [J]. 2019 PROCEEDINGS OF THE MEETING ON ALGORITHM ENGINEERING AND EXPERIMENTS, ALENEX, 2019, : 27 - 38
  • [5] Fast construction of FM-index for long sequence reads
    Li, Heng
    [J]. BIOINFORMATICS, 2014, 30 (22) : 3274 - 3275
  • [6] PARALLEL CONSTRUCTION OF A SUFFIX TREE WITH APPLICATIONS
    APOSTOLICO, A
    ILIOPOULOS, C
    LANDAU, GM
    SCHIEBER, B
    VISHKIN, U
    [J]. ALGORITHMICA, 1988, 3 (03) : 347 - 365
  • [7] Scalable parallel suffix array construction
    Kulla, Fabian
    Sanders, Peter
    [J]. PARALLEL COMPUTING, 2007, 33 (09) : 605 - 612
  • [8] Fast lightweight suffix array construction and checking
    Burkhardt, S
    Kärkkäinen, J
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2003, 2676 : 55 - 69
  • [9] Engineering a lightweight suffix array construction algorithm
    Manzini, G
    Ferragina, P
    [J]. ALGORITHMICA, 2004, 40 (01) : 33 - 50
  • [10] Optimal parallel suffix tree construction
    Hariharan, R
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) : 44 - 69