Distributed top-k query processing by exploiting skyline summaries

被引:14
|
作者
Vlachou, Akrivi [1 ]
Doulkeridis, Christos [1 ]
Norvag, Kjetil [1 ]
机构
[1] NTNU, Dept Comp Sci, Trondheim, Norway
关键词
Top-k queries; Skyline operator; Distributed databases; SELECTION QUERIES;
D O I
10.1007/s10619-012-7094-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, a trend has been observed towards supporting rank-aware query operators, such as top-k, that enable users to retrieve only a limited set of the most interesting data objects. As data nowadays is commonly stored distributed over multiple servers, a challenging problem is to support rank-aware queries in distributed environments. In this paper, we propose a novel approach, called DiTo, for efficient top-k processing over multiple servers, where each server stores autonomously a fraction of the data. Towards this goal, we exploit the inherent relationship of top-k and skyline objects, and we employ the skyline objects of servers as a data summarization mechanism for efficiently identifying the servers that store top-k results. Relying on a thresholding scheme, DiTo retrieves the top-k result set progressively, while the number of queried servers and transferred data is minimized. Furthermore, we extend DiTo to support data summarizations of bounded size, thus restricting the cost of summary distribution and maintenance. To this end, we study the challenging problem of finding an abstraction of the skyline set of fixed size that influences the performance of DiTo only slightly. Our experimental evaluation shows that DiTo performs efficiently and provides a viable solution when a high degree of distribution is required.
引用
收藏
页码:239 / 271
页数:33
相关论文
共 50 条
  • [1] Distributed top-k query processing by exploiting skyline summaries
    Akrivi Vlachou
    Christos Doulkeridis
    Kjetil Nørvåg
    [J]. Distributed and Parallel Databases, 2012, 30 : 239 - 271
  • [2] Uncertain top-k query processing in distributed environments
    Wang, Xite
    Shen, Derong
    Yu, Ge
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2016, 34 (04) : 567 - 589
  • [3] Efficient Distributed Top-k Query Processing with Caching
    Ryeng, Norvald H.
    Vlachou, Akrivi
    Doulkeridis, Christos
    Norvag, Kjetil
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, 2011, 6588 : 280 - 295
  • [4] Uncertain top-k query processing in distributed environments
    Xite Wang
    Derong Shen
    Ge Yu
    [J]. Distributed and Parallel Databases, 2016, 34 : 567 - 589
  • [5] Skyline-based peer-to-peer top-k query processing
    Vlachou, Akrivi
    Doulkeridis, Christos
    Norvag, Kjetil
    Vazirgiannis, Michalis
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1421 - +
  • [6] The Top-k Skyline Query in Pervasive Computing Environments
    Pan, Peng
    Sun, YuQing
    Li, Qingzhong
    Chen, ZhiYong
    Bian, Ji
    [J]. JCPC: 2009 JOINT CONFERENCE ON PERVASIVE COMPUTING, 2009, : 335 - 338
  • [7] Imperfect Top-k Skyline Query with Confidence Level
    Elmi, Sayda
    Hadjali, Allel
    Tobji, Mohamed Anis Bach
    Ben Yaghlane, Boutheina
    [J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [8] Probabilistic Top-k Query Processing in Distributed Sensor Networks
    Ye, Mao
    Liu, Xingjie
    Lee, Wang-Chien
    Lee, Dik Lun
    [J]. 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 585 - 588
  • [9] Optimized Query Algorithms for Top-K Group Skyline
    Liu, Jia
    Chen, Wei
    Chen, Ziyang
    Liu, Lin
    Wu, Yuhong
    Liu, Kaiyu
    Jain, Amar
    Elawady, Yasser H.
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [10] A fast top-k group skyline query method based on skyline layer
    Yang, Yuntian
    Lu, Wenbo
    Tang, Cong
    [J]. 2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 146 - 151