Dominant Graph: An efficient indexing structure to answer top-k queries

被引:29
|
作者
Zou, Lei [1 ]
Chen, Lei [2 ]
机构
[1] Huazhong Univ Sci & Technol, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
关键词
D O I
10.1109/ICDE.2008.4497462
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a record set D and a query score function F, a top-k query returns k records from D, whose values of function F on their attributes are the highest. In this paper, we investigate the intrinsic connection between top-k queries and dominant relationship between records, and based on which, we propose an efficient layer-based indexing structure, Dominant Graph (DG), to improve the query efficiency. Specifically, DG is built offline to express the dominant relationship between records and top-k query is implemented as a graph traversal problem, i.e. Traveler algorithm. We prove theoretically that the size of search space (that is the number of retrieved records from the record set to answer top-k query) in our basic algorithm is directly related to the cardinality of skyline points in the record set (see Theorem 3.2). Based on the cost analysis, we propose the optimization technique, pseudo record, to improve the search efficiency. In order to handle the top-k query in the high dimension record set, we also propose N-Way Traveler algorithm. Finally, extensive experiments demonstrate that our proposed methods have significant improvement over its counterparts, including both classical and state art of top-k algorithms. For example, the search space in our algorithm is less than 1/5 of that in AppRI [1], one of state art of top-k algorithms. Furthermore, our method can support any aggregate monotone query function.
引用
收藏
页码:536 / +
页数:2
相关论文
共 50 条
  • [1] Pareto-Based Dominant Graph: An Efficient Indexing Structure to Answer Top-K Queries
    Zou, Lei
    Chen, Lei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (05) : 727 - 741
  • [2] Gridded dominant graph: A more efficient index structure for top-k queries based on reverse dominant point set
    Gan, Liang
    Jin, Xin
    Jia, Yan
    Li, Aiping
    Pan, Yangke
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (10): : 1771 - 1784
  • [3] Toward Scalable Indexing for Top-k Queries
    Lee, Jongwuk
    Cho, Hyunsouk
    Lee, Sunyou
    Hwang, Seung-Won
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (12) : 3103 - 3116
  • [4] Efficient Dual-Resolution Layer Indexing for Top-k Queries
    Lee, Jongwuk
    Cho, Hyunsouk
    Hwang, Seung-won
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 1084 - 1095
  • [5] Top-k Differential Queries in Graph Databases
    Vasilyeva, Elena
    Thiele, Maik
    Bornhoevd, Christof
    Lehner, Wolfgang
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2014), 2014, 8716 : 112 - 125
  • [6] Top-k differential queries in graph databases
    Vasilyeva, Elena
    Thiele, Maik
    Bornhövd, Christof
    Lehner, Wolfgang
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8716 : 112 - 115
  • [7] Efficiently answer top-k queries on typed intervals
    Xu, Jianqiu
    Lu, Hua
    [J]. INFORMATION SYSTEMS, 2017, 71 : 164 - 181
  • [8] Answering Top-k Graph Similarity Queries in Graph Databases
    Zhu, Yuanyuan
    Qin, Lu
    Yu, Jeffrey Xu
    Cheng, Hong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1459 - 1474
  • [9] Efficient processing of distributed top-k queries
    Yu, HL
    Li, HG
    Wu, P
    Agrawal, D
    El Abbadi, A
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 65 - 74
  • [10] Efficient Top-k Queries for Orthogonal Ranges
    Rahul, Saladi
    Gupta, Prosenjit
    Janardan, Ravi
    Rajan, K. S.
    [J]. WALCOM: ALGORITHMS AND COMPUTATION, 2011, 6552 : 110 - +