Improved Compressed Indexes for Full-Text Document Retrieval

被引:0
|
作者
Belazzougui, Djamal [1 ,2 ]
Navarro, Gonzalo [2 ]
机构
[1] Univ Paris 07, LIAFA, F-75221 Paris 05, France
[2] Univ Chile, Dept Comp Sci, Santiago, Chile
关键词
EFFICIENT ALGORITHMS; QUERIES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We give new space/time tradeoffs for compressed indexes that answer document retrieval queries on general sequences. On a collection of D documents of total length 72, current approaches require at least vertical bar CSA vertical bar + O(n lg D/1g lg D) or 2 vertical bar CSA vertical bar + o(n) bits of space, where CSA is a full-text index. Using monotone mininum perfect hash functions, we give new algorithms for document listing with frequencies and top-k document retrieval using just vertical bar CSA vertical bar + O(n lg lg lg D) bits. We also improve current solutions that use 2 vertical bar CSA vertical bar + o(n) bits, and consider other problems such as colored range listing, top-k, most important documents, and computing arbitrary frequencies.
引用
收藏
页码:386 / +
页数:3
相关论文
共 50 条
  • [31] The Performance Study of Database Full-Text Retrieval
    Wu, Daiwen
    [J]. MODERN INDUSTRIAL IOT, BIG DATA AND SUPPLY CHAIN, IIOTBDSC 2020, 2021, 218 : 239 - 247
  • [32] A public library based on full-text retrieval
    Witten, IH
    Nevill-Manning, C
    McNab, R
    Cunningham, SJ
    [J]. COMMUNICATIONS OF THE ACM, 1998, 41 (04) : 71 - 75
  • [33] FULL-TEXT DATABASE RETRIEVAL USING PARAGRAPHS - IN THE CASE OF JAPANESE TECHNICAL DOCUMENT DATABASE
    NOZUE, M
    [J]. LIBRARY AND INFORMATION SCIENCE, 1993, (31): : 79 - 131
  • [34] APPLICATION OF FULL-TEXT RETRIEVAL TO LITIGATION SUPPORT
    RUBIN, JS
    [J]. FORUM-AMERICAN BAR ASSOCIATION, 1976, 11 (04): : 1136 - 1141
  • [35] FULL-TEXT COMPUTER RETRIEVAL OF MEDICAL LITERATURE
    LLAURADO, JG
    [J]. INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1986, 18 (3-4): : 161 - 163
  • [36] OPTOELECTRONIC FULL-TEXT RETRIEVAL-SYSTEM
    KIM, YW
    BERRA, PB
    [J]. OPTICAL ENGINEERING, 1992, 31 (05) : 906 - 914
  • [37] Full-text Retrieval System for Humanities Researches
    Murakawa, Takehiko
    Watagami, Yukiharu
    Utsunomiya, Keigo
    Nakagawa, Masaru
    [J]. KNOWLEDGE-BASED SOFTWARE ENGINEERING, 2012, 240 : 118 - +
  • [38] A Method of Full-text Retrieval Based on Lucene
    Chen, Xiangrong
    Sun, Yong
    Ge, Xiaopei
    Wang, Congwei
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 217 - 220
  • [39] Automated indexing for full-text information retrieval
    Berrios, DC
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 71 - 75
  • [40] A novel full-text indexing model for Chinese text retrieval
    Zhou, SG
    Hu, YF
    Hu, JT
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, 2001, 2113 : 370 - 379