Efficient computation for probabilistic skyline over uncertain preferences

被引:14
|
作者
Pujari, Arun K. [1 ]
Kagita, Venkateswara Rao [1 ]
Garg, Anubhuti [2 ]
Padmanabhan, Vineet [1 ]
机构
[1] Univ Hyderabad, Sch Comp & Informat Sci, Artificial Intelligence Lab, Hyderbad 500046, Andhra Pradesh, India
[2] LNM Inst Informat Technol, Jaipur, Rajasthan, India
关键词
Skyline query; Skyline computation; Uncertain preferences;
D O I
10.1016/j.ins.2015.06.041
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient computation of skyline probability over uncertain preferences has not received much attention in the database community as compared to skyline probability computation over uncertain data. All known algorithms for probabilistic skyline computation over uncertain preferences attempt to find inexact value of skyline probability by resorting to sampling or to approximation scheme. Exact computation of skyline probability for database with uncertain preferences of moderate size is not possible with any of the existing algorithms. In this paper, we propose an efficient algorithm that can compute skyline probability exactly for reasonably large database. The inclusion exclusion principle is used to express skyline probability in terms of joint probabilities of all possible combination. In this regard we introduce the concept of zero-contributing set which has zero effect in the signed aggregate of joint probabilities. Our algorithm employs a prefix-based k-level absorption to identify zero-contributing sets. It is shown empirically that only a very small portion of exponential search space remains after level wise application of prefix-based absorption. Thus it becomes possible to compute skyline probability with respect to large datasets. Detailed experimental analysis for real and synthetic datasets are reported to corroborate this claim. We also propose an incremental algorithm to compute skyline probability in dynamic scenarios wherein objects are added incrementally. Moreover, the theoretical concepts developed in this paper help to devise an efficient technique to compute skyline probability of all objects in the database. We show that the exponential search space is pruned once and then for each individual object skyline probability can be derived by inspecting a portion of the pruned lattice. We also use a concept of revival of absorbed pairs. We believe that this process is more efficient than computing the skyline probability individually. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:146 / 162
页数:17
相关论文
共 50 条
  • [1] Parallelization of skyline probability computation over uncertain preferences
    Zhu, Haoyang
    Zhu, Peidong
    Li, Xiaoyong
    Liu, Qiang
    Xun, Peng
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (18):
  • [2] Efficient Skyline Computation on Uncertain Dimensions
    Saad, Nurul Husna Mohd
    Ibrahim, Hamidah
    Sidi, Fatimah
    Yaakob, Razali
    Alwan, Ali A.
    [J]. IEEE ACCESS, 2021, 9 : 96975 - 96994
  • [3] Probabilistic n-of-N skyline computation over uncertain data streams
    Zhang, Wenjie
    Li, Aiping
    Cheema, Muhammad Aamir
    Zhang, Ying
    Chang, Lijun
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2015, 18 (05): : 1331 - 1350
  • [4] Probabilistic n-of-N Skyline Computation over Uncertain Data Streams
    Zhang, Wenjie
    Li, Aiping
    Cheema, Muhammad Aamir
    Zhang, Ying
    Chang, Lijun
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT II, 2013, 8181 : 439 - 457
  • [5] Probabilistic n-of-N skyline computation over uncertain data streams
    Wenjie Zhang
    Aiping Li
    Muhammad Aamir Cheema
    Ying Zhang
    Lijun Chang
    [J]. World Wide Web, 2015, 18 : 1331 - 1350
  • [6] An efficient scheme for probabilistic skyline queries over distributed uncertain data
    Xiaoyong Li
    Yijie Wang
    Jie Yu
    [J]. Telecommunication Systems, 2015, 60 : 225 - 237
  • [7] An efficient scheme for probabilistic skyline queries over distributed uncertain data
    Li, Xiaoyong
    Wang, Yijie
    Yu, Jie
    [J]. TELECOMMUNICATION SYSTEMS, 2015, 60 (02) : 225 - 237
  • [8] ProbSky: Efficient Computation of Probabilistic Skyline Queries Over Distributed Data
    Kuo, Ai-Te
    Chen, Haiquan
    Tang, Liang
    Ku, Wei-Shinn
    Qin, Xiao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5173 - 5186
  • [9] Probabilistic Skyline Computation on Vertically Distributed Uncertain Data
    Zhang, Kaiqi
    Wang, Jinbao
    Wang, Muxian
    Han, Xixian
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 154 - 163
  • [10] Efficient Computation of Top-K Skyline Objects in Data Set With Uncertain Preferences
    Sukhwani, Nitesh
    Kagita, Venkateswara Rao
    Kumar, Vikas
    Panda, Sanjaya Kumar
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2021, 17 (03) : 68 - 80