Representing uncertain data: models, properties, and algorithms

被引:0
|
作者
Anish Das Sarma
Omar Benjelloun
Alon Halevy
Shubha Nabar
Jennifer Widom
机构
[1] Stanford University,
[2] Google Inc.,undefined
[3] Microsoft Corp,undefined
来源
The VLDB Journal | 2009年 / 18卷
关键词
Uncertain data; Data modeling; Uncertainty;
D O I
暂无
中图分类号
学科分类号
摘要
In general terms, an uncertain relation encodes a set of possible certain relations. There are many ways to represent uncertainty, ranging from alternative values for attributes to rich constraint languages. Among the possible models for uncertain data, there is a tension between simple and intuitive models, which tend to be incomplete, and complete models, which tend to be nonintuitive and more complex than necessary for many applications. We present a space of models for representing uncertain data based on a variety of uncertainty constructs and tuple-existence constraints. We explore a number of properties and results for these models. We study completeness of the models, as well as closure under relational operations, and we give results relating closure and completeness. We then examine whether different models guarantee unique representations of uncertain data, and for those models that do not, we provide complexity results and algorithms for testing equivalence of representations. The next problem we consider is that of minimizing the size of representation of models, showing that minimizing the number of tuples also minimizes the size of constraints. We show that minimization is intractable in general and study the more restricted problem of maintaining minimality incrementally when performing operations. Finally, we present several results on the problem of approximating uncertain data in an insufficiently expressive model.
引用
收藏
页码:989 / 1019
页数:30
相关论文
共 50 条
  • [1] Representing uncertain data: models, properties, and algorithms
    Das Sarma, Anish
    Benjelloun, Omar
    Halevy, Alon
    Nabar, Shubha
    Widom, Jennifer
    VLDB JOURNAL, 2009, 18 (05): : 989 - 1019
  • [2] REPRESENTING AND MANIPULATING UNCERTAIN DATA
    MORRISSEY, JM
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1992, 36 (02): : 183 - 189
  • [3] Cost Models and Efficient Algorithms on Existentially Uncertain Spatial Data
    Frentzos, Elias
    Pelekis, Nikos
    Theodoridis, Yannis
    PCI 2008: 12TH PAN-HELLENIC CONFERENCE ON INFORMATICS, PROCEEDINGS, 2008, : 26 - 30
  • [4] Asymptotically Exact Data Augmentation: Models, Properties, and Algorithms
    Vono, Maxime
    Dobigeon, Nicolas
    Chainais, Pierre
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2021, 30 (02) : 335 - 348
  • [5] A Survey of Uncertain Data Algorithms and Applications
    Aggarwal, Charu C.
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (05) : 609 - 623
  • [6] Models and algorithms for distribution problems with uncertain demands
    Cheung, RKM
    Powell, WB
    TRANSPORTATION SCIENCE, 1996, 30 (01) : 43 - 59
  • [7] Data reconciliation with uncertain models
    Maquin, D
    Adrot, O
    Ragot, J
    ISA TRANSACTIONS, 2000, 39 (01) : 35 - 45
  • [8] Approximation algorithms for aggregate queries on uncertain data
    Chen D.
    Chen L.
    Wang J.
    Wu Y.
    Wang J.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (03): : 231 - 236
  • [9] A Review of Uncertain Data Stream Clustering Algorithms
    Yang, Yue
    Liu, Zhuo
    Xing, Zhidan
    2015 EIGHTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR SCIENCE AND ENGINEERING (ICICSE), 2015, : 111 - 116
  • [10] Representing and processing lineages over uncertain data based on the Bayesian network
    Yue, Kun
    Wu, Hao
    Liu, Weiyi
    Zhu, Yunlei
    APPLIED SOFT COMPUTING, 2015, 37 : 345 - 362