Improved Compressed Indexes for Full-Text Document Retrieval

被引:0
|
作者
Belazzougui, Djamal [1 ,2 ]
Navarro, Gonzalo [2 ]
机构
[1] Univ Paris 07, LIAFA, F-75221 Paris 05, France
[2] Univ Chile, Dept Comp Sci, Santiago, Chile
关键词
EFFICIENT ALGORITHMS; QUERIES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We give new space/time tradeoffs for compressed indexes that answer document retrieval queries on general sequences. On a collection of D documents of total length 72, current approaches require at least vertical bar CSA vertical bar + O(n lg D/1g lg D) or 2 vertical bar CSA vertical bar + o(n) bits of space, where CSA is a full-text index. Using monotone mininum perfect hash functions, we give new algorithms for document listing with frequencies and top-k document retrieval using just vertical bar CSA vertical bar + O(n lg lg lg D) bits. We also improve current solutions that use 2 vertical bar CSA vertical bar + o(n) bits, and consider other problems such as colored range listing, top-k, most important documents, and computing arbitrary frequencies.
引用
收藏
页码:386 / +
页数:3
相关论文
共 50 条
  • [1] Improved compressed indexes for full-text document retrieval
    Belazzougui, Djamal
    Navarro, Gonzalo
    Valenzuela, Daniel
    [J]. JOURNAL OF DISCRETE ALGORITHMS, 2013, 18 : 3 - 13
  • [2] Compressed full-text indexes
    Navarro, Gonzalo
    Makinen, Veli
    [J]. ACM COMPUTING SURVEYS, 2007, 39 (01)
  • [3] Compressed Representations of Sequences and Full-Text Indexes
    Ferragina, Paolo
    Manzini, Giovanni
    Makinen, Veli
    Navarro, Gonzalo
    [J]. ACM TRANSACTIONS ON ALGORITHMS, 2007, 3 (02)
  • [4] Distribution-Aware Compressed Full-Text Indexes
    Ferragina, Paolo
    Siren, Jouni
    Venturini, Rossano
    [J]. ALGORITHMICA, 2013, 67 (04) : 529 - 546
  • [5] Distribution-Aware Compressed Full-Text Indexes
    Paolo Ferragina
    Jouni Sirén
    Rossano Venturini
    [J]. Algorithmica, 2013, 67 : 529 - 546
  • [6] Distribution-Aware Compressed Full-Text Indexes
    Ferragina, Paolo
    Siren, Jouni
    Venturini, Rossano
    [J]. ALGORITHMS - ESA 2011, 2011, 6942 : 760 - 771
  • [7] Dynamic entropy-compressed sequences and full-text indexes
    Makinen, Veli
    Navarro, Gonzalo
    [J]. COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2006, 4009 : 306 - 317
  • [8] FULL-TEXT RETRIEVAL FOR DOCUMENT DELIVERY - A VIABLE OPTION
    GLAVASH, K
    [J]. ONLINE, 1994, 18 (03): : 81 - 84
  • [9] Dynamic Entropy-Compressed Sequences and Full-Text Indexes
    Maekinen, Veli
    Navarro, Gonzalo
    [J]. ACM TRANSACTIONS ON ALGORITHMS, 2008, 4 (03)