A two-phase data space partitioning for efficient skyline computation

被引:5
|
作者
Nasridinov, Aziz [1 ]
Choi, Jong-Hyeok [1 ]
Park, Young-Ho [2 ]
机构
[1] Chungbuk Natl Univ, Dept Comp Sci, Data Analyt Lab, Cheongju, South Korea
[2] Soomyung Womens Univ, Dept IT Engn, Engn Sch, Seoul, South Korea
关键词
Data space partitioning; Skyline; Database; LAYER-BASED INDEX; TOP-K QUERIES; CONVEX SKYLINE; GPU;
D O I
10.1007/s10586-017-1070-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The skyline has attracted a lot of attention due to its wide application in various fields. However, the skyline computation is a challenging issue as there is a high probability that today's applications deal with large and high-dimensional data. As skyline computation for such huge amount of data consumes much time, parallel and distributed skyline computations are considered. State-of-the-art methods for parallel and distributed skyline computations use various data space partitioning techniques. However, these methods are not efficient, as in certain cases, these methods perform unnecessary skyline computations in a partitioned space, where local-skyline tuples do not contribute to the global-skyline. This may impose additional processing overload and enlarge the overall skyline computation time. In this paper, we propose a novel data space partitioning method for parallel and distributed skyline computation that consists of two-phases: diagonal and entropy score curve based partitioning. The proposed method produces a small set of local-skyline tuples and leads to a more sophisticated merging step. The experiment results demonstrate that the proposed method reduces the number of comparisons and processing time of skyline computation in large amount of data when compared with the existing state-of-the-art methods.
引用
收藏
页码:3617 / 3628
页数:12
相关论文
共 50 条
  • [1] A two-phase data space partitioning for efficient skyline computation
    Aziz Nasridinov
    Jong-Hyeok Choi
    Young-Ho Park
    [J]. Cluster Computing, 2017, 20 : 3617 - 3628
  • [2] Skyline Diagram: Efficient Space Partitioning for Skyline Queries
    Liu, Jinfei
    Yang, Juncheng
    Xiong, Li
    Pei, Jian
    Luo, Jun
    Guo, Yuzhang
    Ma, Shuaicheng
    Fan, Chenglin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (01) : 271 - 286
  • [3] VMPSP: Efficient Skyline Computation Using VMP-Based Space Partitioning
    Zhang, Kaiqi
    Yang, Donghua
    Gao, Hong
    Li, Jianzhong
    Wang, Hongzhi
    Cai, Zhipeng
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 179 - 193
  • [4] Efficient distributed skyline computation using dependency-based data partitioning
    Yin, Bo
    Zhou, Siwang
    Lin, Yaping
    Liu, Yonghe
    Hu, Yupeng
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2014, 93 : 69 - 83
  • [5] Efficient Skyline Computation on Big Data
    Han, Xixian
    Li, Jianzhong
    Yang, Donghua
    Wang, Jinbao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (11) : 2521 - 2535
  • [6] Efficient Computation of Reverse Skyline on Data Stream
    Zhu, Ling
    Li, Cuiping
    Chen, Hong
    [J]. INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION, VOL 1, PROCEEDINGS, 2009, : 735 - 739
  • [7] Efficient Skyline Computation on Massive Incomplete Data
    He, Jingxuan
    Han, Xixian
    [J]. DATA SCIENCE AND ENGINEERING, 2022, 7 (02) : 102 - 119
  • [8] Efficient Skyline Computation on Massive Incomplete Data
    Jingxuan He
    Xixian Han
    [J]. Data Science and Engineering, 2022, 7 : 102 - 119
  • [9] ISSA: Efficient Skyline Computation for Incomplete Data
    Zhang, Kaiqi
    Gao, Hong
    Wang, Hongzhi
    Li, Jianzhong
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 321 - 328
  • [10] Scalable Skyline Computation Using Object-based Space Partitioning
    Zhang, Shiming
    Mamoulis, Nikos
    Cheung, David W.
    [J]. ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 483 - 494