Faster Compressed Suffix Trees for Repetitive Text Collections

被引:0
|
作者
Navarro, Gonzalo [1 ]
Ordonez, Alberto [2 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
[2] Univ A Coruna, Lab Bases Datos, Coruna, Spain
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent compressed suffix trees targeted to highly repetitive text collections reach excellent compression performance, but operation times in the order of milliseconds. We design a new suffix tree representation for this scenario that still achieves very low space usage, only slightly larger than the best previous one, but supports the operations within microseconds. This puts the data structure in the same performance level of compressed suffix trees designed for standard text collections, which on repetitive collections use many times more space than our new structure.
引用
收藏
页码:424 / 435
页数:12
相关论文
共 50 条
  • [1] Compressed Suffix Trees for Repetitive Texts
    Abeliuk, Andres
    Navarro, Gonzalo
    STRING PROCESSING AND INFORMATION RETRIEVAL: 19TH INTERNATIONAL SYMPOSIUM, SPIRE 2012, 2012, 7608 : 30 - 41
  • [2] Faster entropy-bounded compressed suffix trees
    Fischer, Johannes
    Makinen, Veli
    Navarro, Gonzalo
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (51) : 5354 - 5364
  • [3] Faster repetition-aware compressed suffix trees based on Block Trees
    Caceres, Manuel
    Navarro, Gonzalo
    INFORMATION AND COMPUTATION, 2022, 285
  • [4] Compressed suffix arrays and suffix trees with applications to text indexing and string matching
    Grossi, R
    Vitter, JS
    SIAM JOURNAL ON COMPUTING, 2005, 35 (02) : 378 - 407
  • [5] A compressed dynamic self -index for highly repetitive text collections
    Nishimoto, Takaaki
    Takabatake, Yoshimasa
    Tabei, Yasuo
    INFORMATION AND COMPUTATION, 2020, 273
  • [6] Fully Compressed Suffix Trees
    Russo, Luis M. S.
    Navarro, Gonzalo
    Oliveira, Arlindo L.
    ACM TRANSACTIONS ON ALGORITHMS, 2011, 7 (04)
  • [7] Compressed Property Suffix Trees
    Hon, Wing-Kai
    Patil, Manish
    Shah, Rahul
    Thankachan, Sharma V.
    2011 DATA COMPRESSION CONFERENCE (DCC), 2011, : 123 - 132
  • [8] PFP Compressed Suffix Trees
    Boucher, Christina
    Cvacho, Onclfej
    Gagie, Travis
    Holub, Jan
    Manzini, Giovanni
    Navarro, Gonzalo
    Rossi, Massimiliano
    2021 PROCEEDINGS OF THE SYMPOSIUM ON ALGORITHM ENGINEERING AND EXPERIMENTS, ALENEX, 2021, : 60 - 72
  • [9] Compressed property suffix trees
    Hon, Wing-Kai
    Patil, Manish
    Shah, Rahul
    Thankachan, Sharma V.
    INFORMATION AND COMPUTATION, 2013, 232 : 10 - 18
  • [10] Practical Compressed Suffix Trees
    Canovas, Rodrigo
    Navarro, Gonzalo
    EXPERIMENTAL ALGORITHMS, PROCEEDINGS, 2010, 6049 : 94 - 105