Efficient Approximation of Certain and Possible Answers for Ranking and Window Queries over Uncertain Data

被引:0
|
作者
Feng, Su [1 ]
Glavic, Boris [1 ]
Kennedy, Oliver [2 ]
机构
[1] Illinois Inst Technol, Chicago, IL 60616 USA
[2] SUNY Buffalo, Buffalo, NY USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 16卷 / 06期
关键词
DATABASES; AGGREGATION; INFORMATION;
D O I
10.14778/3583140.3583151
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Uncertainty arises naturally in many application domains due to, e.g., data entry errors and ambiguity in data cleaning. Prior work in incomplete and probabilistic databases has investigated the semantics and efficient evaluation of ranking and top-k queries over uncertain data. However, most approaches deal with top-k and ranking in isolation and do represent uncertain input data and query results using separate, incompatible data models. We present an efficient approach for under- and over-approximating results of ranking, top-k, and window queries over uncertain data. Our approach integrates well with existing techniques for querying uncertain data, is efficient, and is to the best of our knowledge the first to support windowed aggregation. We design algorithms for physical operators for uncertain sorting and windowed aggregation, and implement them in PostgreSQL. We evaluated our approach on synthetic and real world datasets, demonstrating that it outperforms all competitors, and often produces more accurate results.
引用
收藏
页码:1346 / 1358
页数:13
相关论文
共 50 条
  • [41] Visible k nearest neighbor queries over uncertain data
    Wang, Yan-Qiu
    Xu, Chuan-Fei
    Yu, Ge
    Gu, Yu
    Chen, Mo
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (10): : 1943 - 1952
  • [42] Efficient processing of keyword queries over graph databases for finding effective answers
    Park, Chang-Sup
    Lim, Sungchae
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (01) : 42 - 57
  • [43] Certain Answers and Rewritings for Local Regular Path Queries on Graph-Structured Data
    Shoaran, Maryam
    Thomo, Alex
    [J]. PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '10), 2010, : 186 - 192
  • [44] Efficient pruning for top-K ranking queries on attribute-wise uncertain datasets
    Jianwen Chen
    Ling Feng
    [J]. Journal of Intelligent Information Systems, 2017, 48 : 215 - 242
  • [45] Efficient pruning for top-K ranking queries on attribute-wise uncertain datasets
    Chen, Jianwen
    Feng, Ling
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 48 (01) : 215 - 242
  • [46] Incremental evaluation of sliding-window queries over data streams
    Ghanem, Thanaa M.
    Hammad, Moustafa A.
    Mokbel, Mohamed F.
    Aref, Walid G.
    Elmagarmid, Ahmed K.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (01) : 57 - 72
  • [47] Semantics and Implementation of Continuous Sliding Window Queries over Data Streams
    Kraemer, Juergen
    Seeger, Bernhard
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (01):
  • [48] An efficient algorithm for top-k queries on uncertain data streams
    Dai, Caiyan
    Chen, Ling
    Chen, Yixin
    Tang, Keming
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 294 - 299
  • [49] An Efficient Optimization Approach for Top-k Queries on Uncertain Data
    Zhang, Zhiqiang
    Wei, Xiaoyan
    Xie, Xiaoqin
    Pan, Haiwei
    Miao, Yu
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2018, 27 (01)
  • [50] SUBSTITUTION: An Efficient Algorithm for Probability Skyline Queries on Discrete Uncertain Data
    Ma, Zhixin
    Zhang, Qiang
    Qi, Wei
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1927 - 1933