SlimShot: In-Database Probabilistic Inference for Knowledge Bases

被引:17
|
作者
Gribkoff, Eric [1 ]
Suciu, Dan [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2016年 / 9卷 / 07期
关键词
D O I
10.14778/2904483.2904487
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasingly large Knowledge Bases are being created, by crawling the Web or other corpora of documents, and by extracting facts and relations using machine learning techniques. To manage the uncertainty in the data, these KBs rely on probabilistic engines based on Markov Logic Networks (MLN), for which probabilistic inference remains a major challenge. Today's state of the art systems use variants of MCMC, which have no theoretical error guarantees, and, as we show, suffer from poor performance in practice. In this paper we describe SlimShot (Scalable Lifted Inference and Monte Carlo Sampling Hybrid Optimization Technique), a probabilistic inference engine for knowledge bases. SlimShot converts the MLN to a tuple-independent probabilistic database, then uses a simple Monte Carlo-based inference, with three key enhancements: (1) it combines sampling with safe query evaluation, (2) it estimates a conditional probability by jointly computing the numerator and denominator, and (3) it adjusts the proposal distribution based on the sample cardinality. In combination, these three techniques allow us to give formal error guarantees, and we demonstrate empirically that SlimShot outperforms today's state of the art probabilistic inference engines used in knowledge bases.
引用
下载
收藏
页码:552 / 563
页数:12
相关论文
共 50 条
  • [1] A Comparative Study of in-Database Inference Approaches
    Lin, Qiuru
    Wu, Sai
    Zhao, Junbo
    Dai, Jian
    Li, Feifei
    Chen, Gang
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1794 - 1807
  • [2] Learning and Inference in Tractable Probabilistic Knowledge Bases
    Niepert, Mathias
    Domingos, Pedro
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 632 - 641
  • [3] InferDB: In-Database Machine Learning Inference Using Indexes
    Salazar-Diaz, Ricardo
    Glavic, Boris
    Rabl, Tilmann
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (08): : 1830 - 1842
  • [4] In-database batch and query-time inference over probabilistic graphical models using UDA–GIST
    Kun Li
    Xiaofeng Zhou
    Daisy Zhe Wang
    Christan Grant
    Alin Dobra
    Christopher Dudley
    The VLDB Journal, 2017, 26 : 177 - 201
  • [5] In-database batch and query-time inference over probabilistic graphical models using UDA-GIST
    Li, Kun
    Zhou, Xiaofeng
    Wang, Daisy Zhe
    Grant, Christan
    Dobra, Alin
    Dudley, Christopher
    VLDB JOURNAL, 2017, 26 (02): : 177 - 201
  • [6] PROBABILISTIC KNOWLEDGE BASES
    WUTHRICH, B
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1995, 7 (05) : 691 - 698
  • [7] TOWARDS PROBABILISTIC KNOWLEDGE BASES
    WUTHRICH, B
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1992, 624 : 66 - 77
  • [8] In-Database Analytics with ibmdbpy
    Fouche, Edouard
    Eckert, Alexander
    Boehm, Klemens
    30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [9] Knowledge Expansion over Probabilistic Knowledge Bases
    Chen, Yang
    Wang, Daisy Zhe
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 649 - 660
  • [10] Special issue on in-database analytics
    Olteanu, Dan
    Rusu, Florin
    DISTRIBUTED AND PARALLEL DATABASES, 2017, 35 (3-4) : 333 - 334