Top-k document retrieval in optimal space

被引:13
|
作者
Tsur, Dekel [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
关键词
Data structures; Document retrieval; Text indexing; EFFICIENT ALGORITHMS; STRING RETRIEVAL;
D O I
10.1016/j.ipl.2013.03.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an index for top-k most frequent document retrieval whose space is vertical bar CSA vertical bar + o(n) + Dlog n/D + O(D) bits, and its query time is O (logk log(2+epsilon) n) per reported document, where D is the number of documents, n is the sum of lengths of the documents, and vertical bar CSA vertical bar is the space of the compressed suffix array for the documents. This improves over previous results for this problem, whose space complexities are vertical bar CSA vertical bar + omega(n) or 2 vertical bar CSA vertical bar + omega(1). (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:440 / 443
页数:4
相关论文
共 50 条
  • [31] Evaluating continuous top-k queries over document streams
    Weixiong Rao
    Lei Chen
    Shudong Chen
    Sasu Tarkoma
    [J]. World Wide Web, 2014, 17 : 59 - 83
  • [32] K* Search over Orbit Space for Top-k Planning
    Katz, Michael
    Lee, Junkyu
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5368 - 5376
  • [33] Continuous Top-k Monitoring on Document Streams (Extended Abstract)
    Hou, Leong U.
    Zhang, Junjie
    Mouratidis, Kyriakos
    Li, Ye
    [J]. 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1803 - 1804
  • [34] Evaluating continuous top-k queries over document streams
    Rao, Weixiong
    Chen, Lei
    Chen, Shudong
    Tarkoma, Sasu
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (01): : 59 - 83
  • [35] Top-k Ranked Document Search in General Text Databases
    Culpepper, J. Shane
    Navarro, Gonzalo
    Puglisi, Simon J.
    Turpin, Andrew
    [J]. ALGORITHMS-ESA 2010, PT II, 2010, 6347 : 194 - +
  • [36] Optimizing Top-k Retrieval: Submodularity Analysis and Search Strategies
    Sha, Chaofeng
    Wang, Keqiang
    Zhang, Dell
    Wang, Xiaoling
    Zhou, Aoying
    [J]. WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 18 - 29
  • [37] Optimizing top-k retrieval:submodularity analysis and search strategies
    Chaofeng SHA
    Keqiang WANG
    Dell ZHANG
    Xiaoling WANG
    Aoying ZHOU
    [J]. Frontiers of Computer Science, 2016, 10 (03) : 477 - 487
  • [38] A Top-K Retrieval algorithm based on a decomposition of ranking functions
    Madrid, Nicolas
    Rusnok, Pavel
    [J]. INFORMATION SCIENCES, 2019, 474 : 136 - 153
  • [39] Optimizing top-k retrieval: submodularity analysis and search strategies
    Sha, Chaofeng
    Wang, Keqiang
    Zhang, Dell
    Wang, Xiaoling
    Zhou, Aoying
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2016, 10 (03) : 477 - 487
  • [40] Efficient Compressed Indexing for Approximate Top-k String Retrieval
    Ferrada, Hector
    Navarro, Gonzalo
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2014, 2014, 8799 : 18 - 30