On a theory of probabilistic deductive databases

被引:29
|
作者
Lakshmanan, LVS [1 ]
Sadri, F
机构
[1] Concordia Univ, Dept Comp Sci, Montreal, PQ H3G 1M8, Canada
[2] Indian Inst Technol, KR Sch Informat Technol, Bombay, Maharashtra, India
[3] Univ N Carolina, Dept Math Sci, Greensboro, NC 27412 USA
关键词
probabalistic databases; uncertain databases; deductive databases;
D O I
10.1017/S1471068400001058
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We propose a framework for modeling uncertainty where both belief and doubt can be given independent, first-class status. We adopt probability theory as the mathematical formalism for manipulating uncertainty. An agent can express the uncertainty in her knowledge about a piece of information in the form of a confidence level, consisting of a pair of intervals of probability, one for each of her belief and doubt. The space of confidence levels naturally leads to the notion of a trilattice, similar in spirit to Fitting's bilattices. Intuitively, the points in such a trilattice can be ordered according to truth, information, or precision. We develop a framework for probabilistic deductive databases by associating confidence levels with the facts and rules of a classical deductive database. While the trilattice structure offers a variety of choices for defining the semantics of probabilistic deductive databases, our choice of semantics is based on the truth-ordering, which we find to be closest to the classical framework for deductive databases. In addition to proposing a declarative semantics based on valuations and an equivalent semantics based on fixpoint theory, we also propose a proof procedure and prove it sound and complete. We show that while classical Datalog query programs have a polynomial time data complexity, certain query programs in the probabilistic deductive database framework do not even terminate on some input databases. We identify a large natural class of query programs of practical interest in our framework, and show that programs in this class possess polynomial time data complexity, i.e. not only do they terminate on every input database, they are guaranteed to do so in a number of steps polynomial in the input database size.
引用
收藏
页码:5 / 42
页数:38
相关论文
共 50 条