Distributed top-k query processing by exploiting skyline summaries

被引:14
|
作者
Vlachou, Akrivi [1 ]
Doulkeridis, Christos [1 ]
Norvag, Kjetil [1 ]
机构
[1] NTNU, Dept Comp Sci, Trondheim, Norway
关键词
Top-k queries; Skyline operator; Distributed databases; SELECTION QUERIES;
D O I
10.1007/s10619-012-7094-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, a trend has been observed towards supporting rank-aware query operators, such as top-k, that enable users to retrieve only a limited set of the most interesting data objects. As data nowadays is commonly stored distributed over multiple servers, a challenging problem is to support rank-aware queries in distributed environments. In this paper, we propose a novel approach, called DiTo, for efficient top-k processing over multiple servers, where each server stores autonomously a fraction of the data. Towards this goal, we exploit the inherent relationship of top-k and skyline objects, and we employ the skyline objects of servers as a data summarization mechanism for efficiently identifying the servers that store top-k results. Relying on a thresholding scheme, DiTo retrieves the top-k result set progressively, while the number of queried servers and transferred data is minimized. Furthermore, we extend DiTo to support data summarizations of bounded size, thus restricting the cost of summary distribution and maintenance. To this end, we study the challenging problem of finding an abstraction of the skyline set of fixed size that influences the performance of DiTo only slightly. Our experimental evaluation shows that DiTo performs efficiently and provides a viable solution when a high degree of distribution is required.
引用
收藏
页码:239 / 271
页数:33
相关论文
共 50 条
  • [21] Processing Spatial Keyword Query as a Top-k Aggregation Query
    Zhang, Dongxiang
    Chan, Chee-Yong
    Tan, Kian-Lee
    [J]. SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 355 - 364
  • [22] Algebraic query optimization for distributed top-k queries
    Neumann, Thomas
    Michel, Sebastian
    [J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2007, 21 (3-4): : 197 - 211
  • [23] An Approximate Top-k Query Algorithm in Distributed Network
    Li, Wenhua
    Yu, Wenting
    Xiao, Feng
    [J]. PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NATURAL COMPUTING, VOL II, 2009, : 417 - 420
  • [24] Joint Top-K Spatial Keyword Query Processing
    Wu, Dingming
    Yiu, Man Lung
    Cong, Gao
    Jensen, Christian S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (10) : 1889 - 1903
  • [25] Distributed multi-dimensional probabilistic Top-k query processing in sensor networks
    Zhu, Jinghua
    Guan, Xuemin
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2012, 40 (SUPPL.1): : 389 - 393
  • [26] Sliding window top-k dominating query processing over distributed data streams
    Daichi Amagata
    Takahiro Hara
    Shojiro Nishio
    [J]. Distributed and Parallel Databases, 2016, 34 : 535 - 566
  • [27] Sliding window top-k dominating query processing over distributed data streams
    Amagata, Daichi
    Hara, Takahiro
    Nishio, Shojiro
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2016, 34 (04) : 535 - 566
  • [28] Examining the Additivity of Top-k Query Processing Innovations
    Mackenzie, Joel
    Moffat, Alistair
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1085 - 1094
  • [29] Efficient processing of distributed top-k queries
    Yu, HL
    Li, HG
    Wu, P
    Agrawal, D
    El Abbadi, A
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 65 - 74
  • [30] Top-k Combinatorial Skyline Queries
    Su, I-Fang
    Chung, Yu-Chi
    Lee, Chiang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 79 - +