Distributed top-k query processing by exploiting skyline summaries

被引:14
|
作者
Vlachou, Akrivi [1 ]
Doulkeridis, Christos [1 ]
Norvag, Kjetil [1 ]
机构
[1] NTNU, Dept Comp Sci, Trondheim, Norway
关键词
Top-k queries; Skyline operator; Distributed databases; SELECTION QUERIES;
D O I
10.1007/s10619-012-7094-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, a trend has been observed towards supporting rank-aware query operators, such as top-k, that enable users to retrieve only a limited set of the most interesting data objects. As data nowadays is commonly stored distributed over multiple servers, a challenging problem is to support rank-aware queries in distributed environments. In this paper, we propose a novel approach, called DiTo, for efficient top-k processing over multiple servers, where each server stores autonomously a fraction of the data. Towards this goal, we exploit the inherent relationship of top-k and skyline objects, and we employ the skyline objects of servers as a data summarization mechanism for efficiently identifying the servers that store top-k results. Relying on a thresholding scheme, DiTo retrieves the top-k result set progressively, while the number of queried servers and transferred data is minimized. Furthermore, we extend DiTo to support data summarizations of bounded size, thus restricting the cost of summary distribution and maintenance. To this end, we study the challenging problem of finding an abstraction of the skyline set of fixed size that influences the performance of DiTo only slightly. Our experimental evaluation shows that DiTo performs efficiently and provides a viable solution when a high degree of distribution is required.
引用
收藏
页码:239 / 271
页数:33
相关论文
共 50 条
  • [31] Top-k skyline: A unified approach
    Goncalves, M
    Vidal, ME
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2005: OTM 2005 WORKSHOPS, PROCEEDINGS, 2005, 3762 : 790 - 799
  • [32] Distributed top-k similarity query on big trajectory streams
    Zhang, Zhigang
    Qi, Xiaodong
    Wang, Yilin
    Jin, Cheqing
    Mao, Jiali
    Zhou, Aoying
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (03) : 647 - 664
  • [33] Distributed top-k similarity query on big trajectory streams
    Zhigang Zhang
    Xiaodong Qi
    Yilin Wang
    Cheqing Jin
    Jiali Mao
    Aoying Zhou
    [J]. Frontiers of Computer Science, 2019, 13 : 647 - 664
  • [34] Scalable Top-K Query Processing Using Graphics Processing Unit
    Zhang, Yulin
    Fang, Hui
    Li, Xiaoming
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2017, 2019, 11403 : 240 - 261
  • [35] An experimental survey of regret minimization query and variants: bridging the best worlds between top-k query and skyline query
    Min Xie
    Raymond Chi-Wing Wong
    Ashwin Lall
    [J]. The VLDB Journal, 2020, 29 : 147 - 175
  • [36] Threshold-Based Distributed Continuous Top-k Query Processing for Minimizing Communication Overhead
    Udomlamlert, Kamalas
    Hara, Takahiro
    Nishio, Shojiro
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (02): : 383 - 396
  • [37] An experimental survey of regret minimization query and variants: bridging the best worlds between top-k query and skyline query
    Xie, Min
    Wong, Raymond Chi-Wing
    Lall, Ashwin
    [J]. VLDB JOURNAL, 2020, 29 (01): : 147 - 175
  • [38] Best position algorithms for efficient top-k query processing
    Akbarinia, Reza
    Pacitti, Esther
    Valduriez, Patrick
    [J]. INFORMATION SYSTEMS, 2011, 36 (06) : 973 - 989
  • [39] Crowdsourcing for Top-K Query Processing over Uncertain Data
    Ciceri, Eleonora
    Fraternali, Piero
    Martinenghi, Davide
    Tagliasacchi, Marco
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 41 - 53
  • [40] Efficient Group Top-k Spatial Keyword Query Processing
    Yao, Kai
    Li, Jianjun
    Li, Guohui
    Luo, Changyin
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, PT I, 2016, 9931 : 153 - 165