Top-k document retrieval in optimal space

被引:13
|
作者
Tsur, Dekel [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
关键词
Data structures; Document retrieval; Text indexing; EFFICIENT ALGORITHMS; STRING RETRIEVAL;
D O I
10.1016/j.ipl.2013.03.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an index for top-k most frequent document retrieval whose space is vertical bar CSA vertical bar + o(n) + Dlog n/D + O(D) bits, and its query time is O (logk log(2+epsilon) n) per reported document, where D is the number of documents, n is the sum of lengths of the documents, and vertical bar CSA vertical bar is the space of the compressed suffix array for the documents. This improves over previous results for this problem, whose space complexities are vertical bar CSA vertical bar + omega(n) or 2 vertical bar CSA vertical bar + omega(1). (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:440 / 443
页数:4
相关论文
共 50 条
  • [1] Faster Top-k Document Retrieval in Optimal Space
    Navarro, Gonzalo
    Thankachan, Sharma V.
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL (SPIRE 2013), 2013, 8214 : 255 - 262
  • [2] Top-k Document Retrieval in Compact Space and Near-Optimal Time
    Navarro, Gonzalo
    Thankachan, Sharma V.
    [J]. ALGORITHMS AND COMPUTATION, 2013, 8283 : 394 - 404
  • [3] TIME-OPTIMAL TOP-k DOCUMENT RETRIEVAL
    Navarro, Gonzalo
    Nekrich, Yakov
    [J]. SIAM JOURNAL ON COMPUTING, 2017, 46 (01) : 80 - 113
  • [4] New space/time tradeoffs for top-k document retrieval on sequences
    Navarro, Gonzalo
    Thankachan, Sharma V.
    [J]. THEORETICAL COMPUTER SCIENCE, 2014, 542 : 83 - 97
  • [5] Faster Compact Top-k Document Retrieval
    Konow, Roberto
    Navarro, Gonzalo
    [J]. 2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 351 - 360
  • [6] Top-k Document Retrieval in External Memory
    Shah, Rahul
    Sheng, Cheng
    Thankachan, Sharma V.
    Vitter, Jeffrey Scott
    [J]. ALGORITHMS - ESA 2013, 2013, 8125 : 803 - 814
  • [7] Faster Compressed Top-k Document Retrieval
    Hon, Wing-Kai
    Shah, Rahul
    Thankachan, Sharma V.
    Vitter, Jeffrey Scott
    [J]. 2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 341 - 350
  • [8] Top-K Color Queries for Document Retrieval
    Karpinski, Marek
    Nekrich, Yakov
    [J]. PROCEEDINGS OF THE TWENTY-SECOND ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2011, : 401 - 411
  • [9] Efficient In-Memory Top-k Document Retrieval
    Culpepper, J. Shane
    Petri, Matthias
    Scholer, Falk
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 225 - 234
  • [10] Approximating Document Frequency for Self-Index based Top-k Document Retrieval
    Suzuki, Tokinori
    Fujii, Atsushi
    [J]. 2015 IEEE 29TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS WAINA 2015, 2015, : 541 - 546