SlimShot: In-Database Probabilistic Inference for Knowledge Bases

被引:17
|
作者
Gribkoff, Eric [1 ]
Suciu, Dan [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2016年 / 9卷 / 07期
关键词
D O I
10.14778/2904483.2904487
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasingly large Knowledge Bases are being created, by crawling the Web or other corpora of documents, and by extracting facts and relations using machine learning techniques. To manage the uncertainty in the data, these KBs rely on probabilistic engines based on Markov Logic Networks (MLN), for which probabilistic inference remains a major challenge. Today's state of the art systems use variants of MCMC, which have no theoretical error guarantees, and, as we show, suffer from poor performance in practice. In this paper we describe SlimShot (Scalable Lifted Inference and Monte Carlo Sampling Hybrid Optimization Technique), a probabilistic inference engine for knowledge bases. SlimShot converts the MLN to a tuple-independent probabilistic database, then uses a simple Monte Carlo-based inference, with three key enhancements: (1) it combines sampling with safe query evaluation, (2) it estimates a conditional probability by jointly computing the numerator and denominator, and (3) it adjusts the proposal distribution based on the sample cardinality. In combination, these three techniques allow us to give formal error guarantees, and we demonstrate empirically that SlimShot outperforms today's state of the art probabilistic inference engines used in knowledge bases.
引用
下载
收藏
页码:552 / 563
页数:12
相关论文
共 50 条
  • [21] In-database connected component analysis
    Bogeholz, Harald
    Brand, Michael
    Todor, Radu-Alexandru
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1525 - 1536
  • [22] INDREX: In-database relation extraction
    Kilias, Torsten
    Loeser, Alexander
    Andritsos, Periklis
    INFORMATION SYSTEMS, 2015, 53 : 124 - 144
  • [23] Towards Lifted Inference Under Maximum Entropy for Probabilistic Relational FO-PCL Knowledge Bases
    Beierle, Christoph
    Potyka, Nico
    Baudisch, Josef
    Finthammer, Marc
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2015, 2015, 9161 : 506 - 516
  • [24] In-Database Learning with Sparse Tensors
    Khamis, Mahmoud Abo
    Ngo, Hung Q.
    Nguyen, XuanLong
    Olteanu, Dan
    Schleich, Maximilian
    PODS'18: PROCEEDINGS OF THE 37TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2018, : 325 - 340
  • [25] LAYERED KNOWLEDGE CHUNKS FOR DATABASE INFERENCE
    HINKE, TH
    DELUGACH, HS
    CHANDRASEKHAR, A
    DATABASE SECURITY, VII - STATUS AND PROSPECTS, 1994, 47 : 275 - 295
  • [26] In-Database Graph Analytics with Recursive SPARQL
    Hogan, Aidan
    Reutter, Juan L.
    Soto, Adrian
    SEMANTIC WEB - ISWC 2020, PT I, 2020, 12506 : 511 - 528
  • [27] A recency inference engine for connectionist knowledge bases
    Ghalwash, AZ
    APPLIED INTELLIGENCE, 1998, 9 (03) : 201 - 215
  • [28] A Recency Inference Engine for Connectionist Knowledge Bases
    Atef Z. Ghalwash
    Applied Intelligence, 1998, 9 : 201 - 215
  • [29] Structural Inference from Conditional Knowledge Bases
    Kern-Isberner, Gabriele
    Eichhorn, Christian
    STUDIA LOGICA, 2014, 102 (04) : 751 - 769
  • [30] Structural Inference from Conditional Knowledge Bases
    Gabriele Kern-Isberner
    Christian Eichhorn
    Studia Logica, 2014, 102 : 751 - 769