Space-Time Tradeoffs for Longest-Common-Prefix Array Computation

被引:0
|
作者
Puglisi, Simon J. [1 ]
Turpin, Andrew [1 ]
机构
[1] RMIT Univ, Sch Comp Sci & Informat Technol, Melbourne, Vic, Australia
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The suffix array, a space efficient alternative to the suffix tree, is an important data structure for string processing, enabling efficient and often optimal algorithms for pattern matching, data compression, repeat finding and many problems arising in computational biology. An essential augmentation to the suffix array for many of these tasks is the Longest Common Prefix (LCP) array. In particular the LCP array allows one to simulate bottom-up and top-down traversals of the suffix tree with significantly less memory overhead (but in the same time bounds). Since 2001 the LCP array has been computable in Theta(n) time, but the algorithm (even after subsequent refinements) requires relatively large working memory. In this paper we describe a new algorithm that provides a continuous space-time tradeoff for LCP array construction, running in O(nv) time and requiring n+O(n/root v-+v) bytes of working space, where v can be chosen to suit the available memory. Furthermore, the algorithm processes the suffix array, and outputs the LCP, strictly left-to-right, making it suitable for use with external memory. We show experimentally that for many naturally occurring strings our algorithm is faster than the linear time algorithms, while using significantly less working memory.
引用
收藏
页码:124 / 135
页数:12
相关论文
共 50 条
  • [1] Permuted Longest-Common-Prefix Array
    Karkkainen, Juha
    Manzini, Giovanni
    Puglisi, Simon J.
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2009, 5577 : 181 - +
  • [2] String inference from longest-common-prefix array
    Karkkainen, Juha
    Piatkowski, Marcin
    Puglisi, Simon J.
    THEORETICAL COMPUTER SCIENCE, 2023, 942 : 180 - 199
  • [3] Sampled Longest Common Prefix Array
    Siren, Jouni
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2010, 6129 : 227 - 237
  • [4] Towards a real time algorithm for parameterized longest common prefix computation
    Amir, Amihood
    Kondratovsky, Eitan
    THEORETICAL COMPUTER SCIENCE, 2021, 852 : 132 - 137
  • [5] Low Space External Memory Construction of the Succinct Permuted Longest Common Prefix Array
    Tischler, German
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2016, 2016, 9954 : 178 - 190
  • [6] Exact Tandem Repeats using Suffix Array and Longest Common Prefix
    Bhukya, Raju
    Naveen, I
    Gupta, Rohan
    Anurag, K.
    Achyuth, A.
    Taruni
    HELIX, 2018, 8 (05): : 3686 - 3691
  • [7] The Colored Longest Common Prefix Array Computed via Sequential Scans
    Garofalo, Fabio
    Rosone, Giovanna
    Sciortino, Marinella
    Verzotto, Davide
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2018, 2018, 11147 : 153 - 167
  • [8] SPACE-TIME TRADEOFFS FOR LINEAR RECURSION
    SWAMY, S
    SAVAGE, JE
    MATHEMATICAL SYSTEMS THEORY, 1983, 16 (01): : 9 - 27
  • [9] Space-time tradeoffs or emptiness queries
    Erickson, J
    SIAM JOURNAL ON COMPUTING, 2000, 29 (06) : 1968 - 1996
  • [10] Space-Time Tradeoffs in Photo Sequencing
    Dekel, Tali
    Moses, Yael
    Avidan, Shai
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 977 - 984