Processing of Probabilistic Skyline Queries Using MapReduce

被引:0
|
作者
Park, Yoonjae [1 ]
Min, Jun-Ki [2 ]
Shim, Kyuseok [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Korea Univ Tech & Edu, Cheonan, South Korea
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2015年 / 8卷 / 12期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There has been an increased growth in a number of applications that naturally generate large volumes of uncertain data. By the advent of such applications, the support of advanced analysis queries such as the skyline and its variant operators for big uncertain data has become important. In this paper, we propose the effective parallel algorithms using MapReduce to process the probabilistic skyline queries for uncertain data modeled by both discrete and continuous models. We present three filtering methods to identify probabilistic non-skyline objects in advance. We next develop a single MapReduce phase algorithm PS-QP-MR by utilizing space partitioning based on a variant of quadtrees to distribute the instances of objects effectively and the enhanced algorithm PS-QPF-MR by applying the three filtering methods additionally. We also propose the workload balancing technique to balance the workload of reduce functions based on the number of machines available. Finally, we present the brute-force algorithms PS-BR-MR and PS-BRF-MR with partitioning randomly and applying the filtering methods. In our experiments, we demonstrate the efficiency and scalability of PS-QPF-MR compared to the other algorithms.
引用
收藏
页码:1406 / 1417
页数:12
相关论文
共 50 条
  • [1] Efficient Processing of Skyline Queries Using MapReduce
    Park, Yoonjae
    Min, Jun-Ki
    Shim, Kyuseok
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (05) : 1031 - 1044
  • [2] Parallel computation of probabilistic skyline queries using MapReduce
    Gavagsaz, Elaheh
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (01): : 418 - 444
  • [3] Parallel computation of probabilistic skyline queries using MapReduce
    Elaheh Gavagsaz
    [J]. The Journal of Supercomputing, 2021, 77 : 418 - 444
  • [4] An efficient parallel processing method for skyline queries in MapReduce
    Junsu Kim
    Myoung Ho Kim
    [J]. The Journal of Supercomputing, 2018, 74 : 886 - 935
  • [5] An efficient parallel processing method for skyline queries in MapReduce
    Kim, Junsu
    Kim, Myoung Ho
    [J]. JOURNAL OF SUPERCOMPUTING, 2018, 74 (02): : 886 - 935
  • [6] Efficient Probabilistic Skyline Query Processing in MapReduce
    Ding, Linlin
    Wang, Guoren
    Xin, Junchang
    Yuan, Ye
    [J]. 2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 203 - 210
  • [7] Simultaneous Processing of Multi-Skyline Queries with MapReduce
    Kim, Junsu
    Lee, Kyong-Ha
    Kim, Myoung-Ho
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (07): : 1516 - 1520
  • [8] Answering skyline queries on probabilistic data using the dominance of probabilistic skyline tuples
    Trieu Minh Nhut Le
    Cao, Jinli
    He, Zhen
    [J]. INFORMATION SCIENCES, 2016, 340 : 58 - 85
  • [9] MapReduce Algorithm for Variants of Skyline Queries: Skyband and Dominating Queries
    Siddique, Md Anisuzzaman
    Tian, Hao
    Qaosar, Mahboob
    Morimoto, Yasuhiko
    [J]. ALGORITHMS, 2019, 12 (08)
  • [10] Efficient processing of probabilistic group subspace skyline queries in uncertain databases
    Lian, Xiang
    Chen, Lei
    [J]. INFORMATION SYSTEMS, 2013, 38 (03) : 265 - 285