Optimal suffix tree construction with large alphabets

被引:274
|
作者
Farach, M [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08855 USA
关键词
D O I
10.1109/SFCS.1997.646102
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The suffix tree of a string is the fundamental data structure of combinatorial pattern matching. Weiner [Wei73] who introduced the data structure, gave an O(n)-time algorithm far building the suffix: tree of an n-character string Brawn from a constant size alphabet. In the comparison model, there is a trivial Omega(n log n)-time lower bound based on sorting, and Weiner's algorithm matches this bound trivially. For integer alphabets, a substantial gap remains between the known upper and lower bounds, and closing this gap is the main open question in the construction of suffix trees. There is no super-linear lower bound, and the fastest known algorithm was the O(n log n) time comparison based algorithm. We settle this open problem by closing the gap: we build suffix trees in linear fine for integer alphabet.
引用
收藏
页码:137 / 143
页数:7
相关论文
共 50 条
  • [21] On-line suffix tree construction with reduced branching
    Senft, Martin
    Dvorak, Tomas
    JOURNAL OF DISCRETE ALGORITHMS, 2012, 12 : 48 - 60
  • [22] Engineering a fast online persistent suffix tree construction
    Bedathur, SJ
    Haritsa, JR
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 720 - 731
  • [23] Parallel Construction of Succinct Representations of Suffix Tree Topologies
    Baier, Uwe
    Beller, Timo
    Ohlebusch, Enno
    STRING PROCESSING AND INFORMATION RETRIEVAL (SPIRE 2015), 2015, 9309 : 234 - 245
  • [24] SPACE-ECONOMICAL SUFFIX TREE CONSTRUCTION ALGORITHM
    MCCREIGHT, EM
    JOURNAL OF THE ACM, 1976, 23 (02) : 262 - 272
  • [25] Parallel Generalized Suffix Tree Construction for Genomic Data
    Al Aziz, Md Momin
    Thulasiraman, Parimala
    Mohammed, Noman
    ALGORITHMS FOR COMPUTATIONAL BIOLOGY (ALCOB 2020), 2020, 12099 : 3 - 15
  • [26] Towards real-time suffix tree construction
    Amir, Amihood
    Kopelowitz, Tsvi
    Lewenstein, Moshe
    Lewenstein, Noa
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2005, 3772 : 67 - 78
  • [27] Linear-time construction of compressed suffix arrays using o(n log n)-bit working space for large alphabets
    Na, JC
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2005, 3537 : 57 - 67
  • [28] The wavelet matrix: An efficient wavelet tree for large alphabets
    Claude, Francisco
    Navarro, Gonzalo
    Ordonez, Alberto
    INFORMATION SYSTEMS, 2015, 47 : 15 - 32
  • [29] Optimal parallel construction of minimal suffix and factor automata
    Breslauer, Dany
    Hariharan, Ramesh
    Parallel processing letters, 1996, 6 (01): : 35 - 44
  • [30] A survey of practical algorithms for suffix tree construction in external memory
    Barsky, M.
    Stege, U.
    Thomo, A.
    SOFTWARE-PRACTICE & EXPERIENCE, 2010, 40 (11): : 965 - 988