Representing uncertain data: models, properties, and algorithms

被引:0
|
作者
Anish Das Sarma
Omar Benjelloun
Alon Halevy
Shubha Nabar
Jennifer Widom
机构
[1] Stanford University,
[2] Google Inc.,undefined
[3] Microsoft Corp,undefined
来源
The VLDB Journal | 2009年 / 18卷
关键词
Uncertain data; Data modeling; Uncertainty;
D O I
暂无
中图分类号
学科分类号
摘要
In general terms, an uncertain relation encodes a set of possible certain relations. There are many ways to represent uncertainty, ranging from alternative values for attributes to rich constraint languages. Among the possible models for uncertain data, there is a tension between simple and intuitive models, which tend to be incomplete, and complete models, which tend to be nonintuitive and more complex than necessary for many applications. We present a space of models for representing uncertain data based on a variety of uncertainty constructs and tuple-existence constraints. We explore a number of properties and results for these models. We study completeness of the models, as well as closure under relational operations, and we give results relating closure and completeness. We then examine whether different models guarantee unique representations of uncertain data, and for those models that do not, we provide complexity results and algorithms for testing equivalence of representations. The next problem we consider is that of minimizing the size of representation of models, showing that minimizing the number of tuples also minimizes the size of constraints. We show that minimization is intractable in general and study the more restricted problem of maintaining minimality incrementally when performing operations. Finally, we present several results on the problem of approximating uncertain data in an insufficiently expressive model.
引用
收藏
页码:989 / 1019
页数:30
相关论文
共 50 条
  • [41] Representing uncertainty in limited-area data assimilating ocean models
    Sandery, Paul A.
    Jones, Emlyn
    Griffin, David
    OCEAN MODELLING, 2024, 187
  • [42] Representing, storing and accessing molecular interaction data:: a review of models and tools
    Stromback, Lena
    Jakoniene, Vaida
    Tan, He
    Lambrix, Patrick
    BRIEFINGS IN BIOINFORMATICS, 2006, 7 (04) : 331 - 338
  • [43] MFAML: a standard data structure for representing and exchanging metabolic flux models
    Yun, H
    Lee, DY
    Jeong, J
    Lee, S
    Lee, SY
    BIOINFORMATICS, 2005, 21 (15) : 3329 - 3330
  • [44] New Data for Representing Irrigated Agriculture in Economy-Wide Models
    Ledvina, Kirby
    Winchester, Niven
    Strzepek, Kenneth
    Reilly, John M.
    JOURNAL OF GLOBAL ECONOMIC ANALYSIS, 2018, 3 (01): : 122 - 155
  • [45] Integer Programming Models to Manage Consensus for Uncertain MCGDM Based on PSO Algorithms
    Wu, Zhibin
    Ma, Ning
    Zeng, Ziqiang
    Xu, Jiuping
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (05) : 888 - 902
  • [46] Models and algorithms for multi-fidelity data
    Forbes, Alistair B.
    ADVANCED MATHEMATICAL AND COMPUTATIONAL TOOLS IN METROLOGY AND TESTING XI, 2019, 89 : 178 - 185
  • [47] Developing mathematical models and intelligent sustainable supply chains by uncertain parameters and algorithms
    Nazari, Massoumeh
    Nayeri, Mahmoud Dehghan
    Hafshjani, Kiamars Fathi
    AIMS MATHEMATICS, 2024, 9 (03): : 5204 - 5233
  • [48] Developing mathematical models and intelligent sustainable supply chains by uncertain parameters and algorithms
    Nazari, Massoumeh
    Nayeri, Mahmoud Dehghan
    Hafshjani, Kiamars Fathi
    AIMS MATHEMATICS, 2024, 9 (09): : 25223 - 25231
  • [49] Algorithms and Data Structures for New Models of Computation
    Black, Paul E.
    Flater, David
    Bojanova, Irena
    IT PROFESSIONAL, 2021, 23 (01) : 9 - 15
  • [50] Auxiliary Variable-Based Identification Algorithms for Uncertain-Input Models
    Chen, Jing
    Zhu, Quanmin
    Chandra, Budi
    Pu, Yan
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (07) : 3389 - 3404