Suffix trays and suffix trists: Structures for faster text indexing

被引:0
|
作者
Cole, Richard
Kopelowitz, Tsvi
Lewenstein, Moshe
机构
[1] NYU, New York, NY 10012 USA
[2] Bar Ilan Univ, IL-52100 Ramat Gan, Israel
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Suffix trees and suffix arrays are two of the most widely used data structures for text indexing. Each uses linear space and can be constructed in linear time [3,5,6,7]. However, when it comes to answering queries, the prior does so in O(m log vertical bar Sigma vertical bar) time, where m is the query size, vertical bar Sigma vertical bar is the alphabet size, and the latter does so in O(m +log n), where n is the text size. We propose a novel way of combining the two into, what we call, a suffix tray. The space and construction time remain linear and the query time improves to O(m + log vertical bar Sigma vertical bar). We also consider the online version of indexing, where the indexing structure continues to update the text online and queries are answered in tandem. Here we suggest a suffix trist, a cross between a suffix tree and a suffix list. It supports queries in O(m+log vertical bar Sigma vertical bar). The space and text update time of a suffix trist are the same as for the suffix tree or the suffix list.
引用
收藏
页码:358 / 369
页数:12
相关论文
共 50 条
  • [1] Suffix Trays and Suffix Trists: Structures for Faster Text Indexing
    Richard Cole
    Tsvi Kopelowitz
    Moshe Lewenstein
    [J]. Algorithmica, 2015, 72 : 450 - 466
  • [2] Suffix Trays and Suffix Trists: Structures for Faster Text Indexing
    Cole, Richard
    Kopelowitz, Tsvi
    Lewenstein, Moshe
    [J]. ALGORITHMICA, 2015, 72 (02) : 450 - 466
  • [3] Compressed suffix arrays and suffix trees with applications to text indexing and string matching
    Grossi, R
    Vitter, JS
    [J]. SIAM JOURNAL ON COMPUTING, 2005, 35 (02) : 378 - 407
  • [4] Dotted suffix trees - A structure for approximate text indexing
    Coelho, Luis Pedro
    Oliveira, Arlindo L.
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2006, 4209 : 329 - 336
  • [5] New text indexing functionalities of the compressed suffix arrays
    Sadakane, K
    [J]. JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2003, 48 (02): : 294 - 313
  • [6] Faster Compressed Suffix Trees for Repetitive Text Collections
    Navarro, Gonzalo
    Ordonez, Alberto
    [J]. EXPERIMENTAL ALGORITHMS, SEA 2014, 2014, 8504 : 424 - 435
  • [7] Faster suffix tree construction with missing suffix links
    Cole, R
    Hariharan, R
    [J]. SIAM JOURNAL ON COMPUTING, 2003, 33 (01) : 26 - 42
  • [8] Faster suffix sorting
    Larsson, N. Jesper
    Sadakane, Kunihiko
    [J]. THEORETICAL COMPUTER SCIENCE, 2007, 387 (03) : 258 - 272
  • [9] PSIST: Indexing protein structures using suffix trees
    Gao, F
    Zaki, MJ
    [J]. 2005 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2005, : 212 - 222
  • [10] Contracted Suffix Trees: A Simple and Dynamic Text Indexing Data Structure
    Ehrenfeucht, Andrzej
    McConnell, Ross M.
    Woo, Sung-Whan
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2009, 5577 : 41 - +