Top k probabilistic skyline queries on uncertain data

被引:4
|
作者
Yang, Zhibang [1 ]
Li, Kenli [2 ]
Zhou, Xu [2 ]
Mei, Jing [3 ]
Gao, Yunjun [4 ]
机构
[1] Changsha Univ, Coll Comp Engn & Appl Math, Changsha 410003, Hunan, Peoples R China
[2] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[3] Hunan Normal Univ, Coll Math & Comp Sci, Changsha 410082, Hunan, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Skyline query; Query processing; Uncertain data; EFFICIENT; RETRIEVAL; ALGORITHMS;
D O I
10.1016/j.neucom.2018.03.052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Uncertainty of data is inherent in many applications, and query processing over uncertain data has gained widespread attention. The probabilistic skyline query is a powerful tool for managing uncertain data. However, the famous probabilistic skyline query, called p-skyline query, is likely to return unattractive objects which have no advantage in either their attributes or skyline probabilities with comparing to other query results. Moreover, it may return too many objects to offer any meaningful insight for customers. In this paper, we first propose a modified p-skyline (PS) query based on a strong dominance operator to identify truly attractive results. Then we formulate a top k MPS (TkMPS) query on the basis of a new ranking criterion. We present effective approaches for processing the MPS query, and extend these approaches to process the TkMPS query. To improve the query performance, the reuse technique is adopted. Extensive experiments verify that the proposed algorithms for the MPS and TkMPS queries are efficient and effective, our MPS query can filter out 34.44% unattractive objects from the p-skyline query results at most, and although in some cases the results of the MPS and the p-skyline queries are just the same, our MPS query needs much less CPU, I/O, and memory costs. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] Efficiently answering probabilistic threshold top-k queries on uncertain data
    Hua, Ming
    Pei, Jian
    Zhang, Wenjie
    Lin, Xuemin
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1403 - +
  • [2] Probabilistic skyline queries on uncertain time series
    He, Guoliang
    Chen, Lu
    Zeng, Chen
    Zheng, Qiaoxian
    Zhou, Guofu
    [J]. NEUROCOMPUTING, 2016, 191 : 224 - 237
  • [3] An efficient scheme for probabilistic skyline queries over distributed uncertain data
    Xiaoyong Li
    Yijie Wang
    Jie Yu
    [J]. Telecommunication Systems, 2015, 60 : 225 - 237
  • [4] An efficient scheme for probabilistic skyline queries over distributed uncertain data
    Li, Xiaoyong
    Wang, Yijie
    Yu, Jie
    [J]. TELECOMMUNICATION SYSTEMS, 2015, 60 (02) : 225 - 237
  • [5] Ranking uncertain sky: The probabilistic top-k skyline operator
    Zhang, Ying
    Zhang, Wenjie
    Lin, Xuemin
    Jiang, Bin
    Pei, Jian
    [J]. INFORMATION SYSTEMS, 2011, 36 (05) : 898 - 915
  • [6] Probabilistic top-k dominating queries in uncertain databases
    Lian, Xiang
    Chen, Lei
    [J]. INFORMATION SCIENCES, 2013, 226 : 23 - 46
  • [7] Continuous Probabilistic Skyline Queries for Uncertain Moving Objects
    Fu, Shichang
    Dong, Yihong
    He, Maoshun
    [J]. 2010 2ND INTERNATIONAL ASIA CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (CAR 2010), VOL 1, 2010, : 396 - 399
  • [8] PROBABILISTIC SKYLINE QUERIES OVER UNCERTAIN MOVING OBJECTS
    Ding, Xiaofeng
    Jin, Hai
    Xu, Hui
    Song, Wei
    [J]. COMPUTING AND INFORMATICS, 2013, 32 (05) : 987 - 1012
  • [9] Cleaning Uncertain Data for Top-k Queries
    Mo, Luyi
    Cheng, Reynold
    Li, Xiang
    Cheung, David W.
    Yang, Xuan S.
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 134 - 145
  • [10] Preference-Based Top-k Representative Skyline Queries on Uncertain Databases
    Ha Thanh Huynh Nguyen
    Cao, Jinli
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 280 - 292