Efficient distributed skyline computation using dependency-based data partitioning

被引:16
|
作者
Yin, Bo [1 ,2 ]
Zhou, Siwang [1 ]
Lin, Yaping [1 ]
Liu, Yonghe [3 ]
Hu, Yupeng [1 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[2] ChangSha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Hunan, Peoples R China
[3] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
基金
中国国家自然科学基金;
关键词
Skyline query; Distributed systems; Data partitioning;
D O I
10.1016/j.jss.2014.03.021
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Skyline queries, together with other advanced query operators, are essential in order to help identify sets of interesting data points buried within huge amount of data readily available these days. A skyline query retrieves sets of non-dominated data points in a multi-dimensional dataset As computing infrastructures become increasingly pervasive, connected by readily available network services, data storage and management have become inevitably more distributed. Under these distributed environments, designing efficient skyline querying with desirable quick response time and progressive returning of answers faces new challenges. To address this, in this paper, we propose a novel skyline query scheme termed MpSky. MpSky is based on a novel space partitioning scheme, employing the dependency relationships among data points on different servers. By grouping points of each server using dependencies, we are able to qualify a skyline point by only comparing it with data on dependent servers, and parallelize the skyline computation among non-dependent partitions that are from different servers or individual servers. By controlling the query propagation among partitions, we are able to generate skyline results progressively and prune partitions and points efficiently. Analytical and extensive simulation results show the effectiveness of the proposed scheme. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:69 / 83
页数:15
相关论文
共 50 条
  • [1] Supporting efficient distributed skyline computation using skyline views
    Lee, Jongwuk
    Kim, Jinhan
    Hwang, Seung-won
    [J]. INFORMATION SCIENCES, 2012, 194 : 24 - 37
  • [2] Efficient skyline computation over distributed interval data
    Li, Xiaoyong
    Ren, Kaijun
    Li, Xiaoling
    Yu, Jie
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (10):
  • [3] VMPSP: Efficient Skyline Computation Using VMP-Based Space Partitioning
    Zhang, Kaiqi
    Yang, Donghua
    Gao, Hong
    Li, Jianzhong
    Wang, Hongzhi
    Cai, Zhipeng
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 179 - 193
  • [4] A two-phase data space partitioning for efficient skyline computation
    Aziz Nasridinov
    Jong-Hyeok Choi
    Young-Ho Park
    [J]. Cluster Computing, 2017, 20 : 3617 - 3628
  • [5] Skyline View: Efficient Distributed Subspace Skyline Computation
    Kim, Jinhan
    Lee, Jongwuk
    Hwang, Seung-won
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2009, 5691 : 312 - 324
  • [6] SKYPEER: Efficient subspace skyline computation over distributed data
    Vlachou, Akrivi
    Doulkeridis, Christos
    Kotidis, Yannis
    Vazirgiannis, Michalis
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 391 - +
  • [7] A two-phase data space partitioning for efficient skyline computation
    Nasridinov, Aziz
    Choi, Jong-Hyeok
    Park, Young-Ho
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3617 - 3628
  • [8] Efficient and Adaptive Distributed Skyline Computation
    Valkanas, George
    Papadopoulos, Apostolos N.
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2010, 6187 : 24 - +
  • [9] ProbSky: Efficient Computation of Probabilistic Skyline Queries Over Distributed Data
    Kuo, Ai-Te
    Chen, Haiquan
    Tang, Liang
    Ku, Wei-Shinn
    Qin, Xiao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5173 - 5186
  • [10] Communication-Efficient Distributed Skyline Computation
    Zhang, Haoyu
    Zhang, Qin
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 437 - 446