Towards real-time suffix tree construction

被引:0
|
作者
Amir, Amihood [1 ]
Kopelowitz, Tsvi [1 ]
Lewenstein, Moshe [1 ]
Lewenstein, Noa [1 ]
机构
[1] Bar Ilan Univ, Dept Comp Sci, IL-52900 Ramat Gan, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quest for a real-time suffix tree construction algorithm is over three decades old. To date there is no convincing understandable solution to this problem. This paper makes a step in this direction by constructing a suffix tree online in time O(log n) per every single input symbol. Clearly, it is impossible to achieve better than O(log n) time per symbol in the compaxison model, therefore no true real time algorithm can exist for infinite alphabets. Nevertheless, the best that can be hoped for is that the construction time for every symbol does not exceed O(log n) (as opposed to an amortized O(log n) time per symbol, achieved by current known algorithms). To our knowledge, our algorithm is the first that spends in the worst case O(log n) per every single input symbol. We also provide a simple algorithm that constructs online an indexing structure (the BIS) in time O(log n) per input symbol, where n is the number of text symbols input thus far. This structure and fast LCP (Longest Common Prefix) queries on it, provide the backbone for the suffix tree construction. Together, our two data structures provide a searching algorithm for a pattern of length m whose time is O(min(m log vertical bar Sigma vertical bar, m + log n) + tocc), where tocc is the number of occurrences of the pattern.
引用
收藏
页码:67 / 78
页数:12
相关论文
共 50 条
  • [21] A suffix tree construction algorithm for DNA sequences
    Huo, Hongwei
    Stojkovic, Vojislav
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 1178 - +
  • [22] Overcoming the memory bottleneck in suffix tree construction
    Farach, M
    Ferragina, P
    Muthukrishnan, S
    39TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1998, : 174 - 183
  • [23] Efficient Representation for Online Suffix Tree Construction
    Larsson, N. Jesper
    Fuglsang, Kasper
    Karlsson, Kenneth
    EXPERIMENTAL ALGORITHMS, SEA 2014, 2014, 8504 : 400 - 411
  • [24] On the sorting-complexity of suffix tree construction
    Farach-Colton, M
    Ferragina, P
    Muthukrishnan, S
    JOURNAL OF THE ACM, 2000, 47 (06) : 987 - 1011
  • [25] Sparse Suffix Tree Construction in Small Space
    Bille, Philip
    Fischer, Johannes
    Gortz, Inge Li
    Kopelowitz, Tsvi
    Sach, Benjamin
    Vildhoj, Hjalte Wedel
    AUTOMATA, LANGUAGES, AND PROGRAMMING, PT I, 2013, 7965 : 148 - 159
  • [26] Optimal suffix tree construction with large alphabets
    Farach, M
    38TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1997, : 137 - 143
  • [27] Real-Time Interactive Tree Animation
    Quigley, Ed
    Yu, Yue
    Huang, Jingwei
    Lin, Winnie
    Fedkiw, Ronald
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (05) : 1717 - 1727
  • [28] REAL-TIME TREE LOCALIZATION IN ORCHARDS
    Heidman, B. C.
    Rosa, U. A.
    APPLIED ENGINEERING IN AGRICULTURE, 2008, 24 (06) : 707 - 716
  • [29] Real-time construction of neural networks
    Li, Kang
    Peng, Jian Xun
    Fei, Minrui
    ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 1, 2006, 4131 : 140 - 149
  • [30] Real-time Proactive Safety in Construction
    Green, Larry
    Tominack, Gary
    POWER, 2012, 156 (01) : 62 - 65