Efficient Approximation of Certain and Possible Answers for Ranking and Window Queries over Uncertain Data

被引:0
|
作者
Feng, Su [1 ]
Glavic, Boris [1 ]
Kennedy, Oliver [2 ]
机构
[1] Illinois Inst Technol, Chicago, IL 60616 USA
[2] SUNY Buffalo, Buffalo, NY USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 16卷 / 06期
关键词
DATABASES; AGGREGATION; INFORMATION;
D O I
10.14778/3583140.3583151
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Uncertainty arises naturally in many application domains due to, e.g., data entry errors and ambiguity in data cleaning. Prior work in incomplete and probabilistic databases has investigated the semantics and efficient evaluation of ranking and top-k queries over uncertain data. However, most approaches deal with top-k and ranking in isolation and do represent uncertain input data and query results using separate, incompatible data models. We present an efficient approach for under- and over-approximating results of ranking, top-k, and window queries over uncertain data. Our approach integrates well with existing techniques for querying uncertain data, is efficient, and is to the best of our knowledge the first to support windowed aggregation. We design algorithms for physical operators for uncertain sorting and windowed aggregation, and implement them in PostgreSQL. We evaluated our approach on synthetic and real world datasets, demonstrating that it outperforms all competitors, and often produces more accurate results.
引用
收藏
页码:1346 / 1358
页数:13
相关论文
共 50 条
  • [11] GDPS: An Efficient Approach for Skyline Queries over Distributed Uncertain Data
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Xiaowei
    yu, Jie
    [J]. BIG DATA RESEARCH, 2014, 1 (01) : 23 - 36
  • [12] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    [J]. 2010 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2010, 2010,
  • [13] An efficient scheme for probabilistic skyline queries over distributed uncertain data
    Li, Xiaoyong
    Wang, Yijie
    Yu, Jie
    [J]. TELECOMMUNICATION SYSTEMS, 2015, 60 (02) : 225 - 237
  • [14] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) : 1448 - 1462
  • [15] Approximation algorithms for aggregate queries on uncertain data
    Chen D.
    Chen L.
    Wang J.
    Wu Y.
    Wang J.
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (03): : 231 - 236
  • [16] A survey of queries over uncertain data
    Yijie Wang
    Xiaoyong Li
    Xiaoling Li
    Yuan Wang
    [J]. Knowledge and Information Systems, 2013, 37 : 485 - 530
  • [17] A survey of queries over uncertain data
    Wang, Yijie
    Li, Xiaoyong
    Li, Xiaoling
    Wang, Yuan
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (03) : 485 - 530
  • [18] Efficient Range Queries over Uncertain Strings
    Dai, Dongbo
    Xie, Jiang
    Zhang, Huiran
    Dong, Jiaqi
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012, 2012, 7338 : 75 - 95
  • [19] Efficient processing of probabilistic reverse nearest neighbor queries over uncertain data
    Lian, Xiang
    Chen, Lei
    [J]. VLDB JOURNAL, 2009, 18 (03): : 787 - 808
  • [20] Efficient processing of probabilistic reverse nearest neighbor queries over uncertain data
    Xiang Lian
    Lei Chen
    [J]. The VLDB Journal, 2009, 18 : 787 - 808