Top-k document retrieval in optimal space

被引:13
|
作者
Tsur, Dekel [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
关键词
Data structures; Document retrieval; Text indexing; EFFICIENT ALGORITHMS; STRING RETRIEVAL;
D O I
10.1016/j.ipl.2013.03.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an index for top-k most frequent document retrieval whose space is vertical bar CSA vertical bar + o(n) + Dlog n/D + O(D) bits, and its query time is O (logk log(2+epsilon) n) per reported document, where D is the number of documents, n is the sum of lengths of the documents, and vertical bar CSA vertical bar is the space of the compressed suffix array for the documents. This improves over previous results for this problem, whose space complexities are vertical bar CSA vertical bar + omega(n) or 2 vertical bar CSA vertical bar + omega(1). (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:440 / 443
页数:4
相关论文
共 50 条
  • [41] Towards Efficient Retrieval of Top-k Entities in Systems of Engagement
    Mondal, Anirban
    Padhariya, Nilesh
    Mohania, Mukesh
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT II, 2020, 12343 : 52 - 67
  • [42] Top-k retrieval for ontology mediated access to relational databases
    Straccia, Umberto
    [J]. INFORMATION SCIENCES, 2012, 198 : 1 - 23
  • [43] Optimal Enumeration: Efficient Top-k Tree Matching
    Chang, Lijun
    Lin, Xuemin
    Zhang, Wenjie
    Yu, Jeffrey Xu
    Zhang, Ying
    Qin, Lu
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (05): : 533 - 544
  • [44] Top-K Entity Units Retrieval Over Big Data
    Zhang, Da
    Kabuka, Mansur R.
    [J]. WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 1269 - 1272
  • [45] Range query estimation with data skewness for top-k retrieval
    Ayanso, Anteneh
    Goes, Paulo B.
    Mehta, Kumar
    [J]. DECISION SUPPORT SYSTEMS, 2014, 57 : 258 - 273
  • [46] Semantic-Based Top-k Retrieval for Competence Management
    Straccia, Umberto
    Tinelli, Eufemia
    Colucci, Simona
    Di Noia, Tommaso
    Di Sciascio, Eugenio
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2009, 5722 : 473 - +
  • [47] Optimizing top-k retrieval: submodularity analysis and search strategies
    Chaofeng Sha
    Keqiang Wang
    Dell Zhang
    Xiaoling Wang
    Aoying Zhou
    [J]. Frontiers of Computer Science, 2016, 10 : 477 - 487
  • [48] Top-k Term-Proximity in Succinct Space
    J. Ian Munro
    Gonzalo Navarro
    Jesper Sindahl Nielsen
    Rahul Shah
    Sharma V. Thankachan
    [J]. Algorithmica, 2017, 78 : 379 - 393
  • [49] Top-k Term-Proximity in Succinct Space
    Munro, J. Ian
    Navarro, Gonzalo
    Nielsen, Jesper Sindahl
    Shah, Rahul
    Thankachan, Sharma V.
    [J]. ALGORITHMS AND COMPUTATION, ISAAC 2014, 2014, 8889 : 169 - 180
  • [50] Top-k Term-Proximity in Succinct Space
    Munro, J. Ian
    Navarro, Gonzalo
    Nielsen, Jesper Sindahl
    Shah, Rahul
    Thankachan, Sharma V.
    [J]. ALGORITHMICA, 2017, 78 (02) : 379 - 393