Contracted Suffix Trees: A Simple and Dynamic Text Indexing Data Structure

被引:0
|
作者
Ehrenfeucht, Andrzej [1 ]
McConnell, Ross M. [2 ]
Woo, Sung-Whan [2 ]
机构
[1] Univ Colorado, Dept Comp Sci, 430 UCB, Boulder, CO 80309 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO 80523 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of finding the locations of all instances of a string P in a text T, where of T is allowed to facilitate the queries. Previous data structures for this problem include the suffix tree, the suffix array, and the compact DAWG. We modify a data structure called a sequence tree, which was proposed by Coffman and Eve for hashing, and adapt it to the new problem. We can then produce a list of k occurrences of any string P in T in O(parallel to P parallel to + k) time. Because of properties shared by suffixes of a text that are riot shared by arbitrary hash keys, we can build the structure in O(parallel to T parallel to) time, which is much faster than Coffman and Eve's algorithm. These bounds are as good as those for the suffix tree, suffix array, and the compact DAWG. The advantages are the elementary nature of some of the algorithms for constructing and using the data structure and the asymptotic bounds we can give for updating the data structure when the text is edited.
引用
收藏
页码:41 / +
页数:2
相关论文
共 50 条
  • [1] Dotted suffix trees - A structure for approximate text indexing
    Coelho, Luis Pedro
    Oliveira, Arlindo L.
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2006, 4209 : 329 - 336
  • [2] Position heaps: A simple and dynamic text indexing data structure
    Ehrenfeucht, Andrzej
    McConnell, Ross M.
    Osheim, Nissa
    Woo, Sung-Whan
    [J]. JOURNAL OF DISCRETE ALGORITHMS, 2011, 9 (01) : 100 - 121
  • [3] Compressed suffix arrays and suffix trees with applications to text indexing and string matching
    Grossi, R
    Vitter, JS
    [J]. SIAM JOURNAL ON COMPUTING, 2005, 35 (02) : 378 - 407
  • [4] The Virtual Suffix Tree: An Efficient Data Structure for Suffix Trees and Suffix Arrays
    Lin, Jie
    Jiang, Yue
    Adjeroh, Don
    [J]. PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2008, 2008, : 68 - 83
  • [5] Suffix Trays and Suffix Trists: Structures for Faster Text Indexing
    Richard Cole
    Tsvi Kopelowitz
    Moshe Lewenstein
    [J]. Algorithmica, 2015, 72 : 450 - 466
  • [6] Suffix Trays and Suffix Trists: Structures for Faster Text Indexing
    Cole, Richard
    Kopelowitz, Tsvi
    Lewenstein, Moshe
    [J]. ALGORITHMICA, 2015, 72 (02) : 450 - 466
  • [7] Suffix trays and suffix trists: Structures for faster text indexing
    Cole, Richard
    Kopelowitz, Tsvi
    Lewenstein, Moshe
    [J]. AUTOMATA, LANGUAGES AND PROGRAMMING, PT 1, 2006, 4051 : 358 - 369
  • [8] Text Indexing, Suffix Sorting, and Data Compression: Common Problems and Techniques
    Grossi, Roberto
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2009, 5577 : 39 - 40
  • [9] Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays
    Dong Kyue Kim
    Minhwan Kim
    Heejin Park
    [J]. Algorithmica, 2008, 52 : 350 - 377
  • [10] Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays
    Kim, Dong Kyue
    Kim, Minhwan
    Park, Heejin
    [J]. ALGORITHMICA, 2008, 52 (03) : 350 - 377