SlimShot: In-Database Probabilistic Inference for Knowledge Bases

被引:17
|
作者
Gribkoff, Eric [1 ]
Suciu, Dan [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2016年 / 9卷 / 07期
关键词
D O I
10.14778/2904483.2904487
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasingly large Knowledge Bases are being created, by crawling the Web or other corpora of documents, and by extracting facts and relations using machine learning techniques. To manage the uncertainty in the data, these KBs rely on probabilistic engines based on Markov Logic Networks (MLN), for which probabilistic inference remains a major challenge. Today's state of the art systems use variants of MCMC, which have no theoretical error guarantees, and, as we show, suffer from poor performance in practice. In this paper we describe SlimShot (Scalable Lifted Inference and Monte Carlo Sampling Hybrid Optimization Technique), a probabilistic inference engine for knowledge bases. SlimShot converts the MLN to a tuple-independent probabilistic database, then uses a simple Monte Carlo-based inference, with three key enhancements: (1) it combines sampling with safe query evaluation, (2) it estimates a conditional probability by jointly computing the numerator and denominator, and (3) it adjusts the proposal distribution based on the sample cardinality. In combination, these three techniques allow us to give formal error guarantees, and we demonstrate empirically that SlimShot outperforms today's state of the art probabilistic inference engines used in knowledge bases.
引用
下载
收藏
页码:552 / 563
页数:12
相关论文
共 50 条
  • [41] In-Database Machine Learning with SQL on GPUs
    Schuele, Maximilian
    Lang, Harald
    Springer, Maximilian
    Kemper, Alfons
    Neumann, Thomas
    Guennemann, Stephan
    33RD INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2021), 2020, : 25 - 36
  • [42] AC/DC: In-Database Learning Thunderstruck
    Khamis, Mahmoud Abo
    Ngo, Hung Q.
    Nguyen, XuanLong
    Olteanu, Dan
    Schleich, Maximilian
    PROCEEDINGS OF THE SECOND WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, 2018,
  • [43] IDEL: In-Database Neural Entity Linking
    Kilias, Torsten
    Loeser, Alexander
    Gers, Felix
    Zhang, Ying
    Koopmanschap, Richard
    Kersten, Martin
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 56 - 63
  • [44] In-Database Regression in Input Sparsity Time
    Jayaram, Rajesh
    Samadian, Alireza
    Woodruff, David P.
    Ye, Peng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [45] Inference Algorithm for Knowledge Bases with Rule Cluster Structure
    Nowak-Brzezinska, Agnieszka
    Gaibei, Igor
    COMPUTATIONAL SCIENCE, ICCS 2024, PT III, 2024, 14834 : 71 - 78
  • [46] Probabilistic Inference for Cold Start Knowledge Base Population with PriorWorld Knowledge
    Min, Bonan
    Freedman, Marjorie
    Meltzer, Talya
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 601 - 612
  • [47] Probabilistic Hybrid Knowledge Bases Under the Distribution Semantics
    Alberti, Marco
    Lamma, Evelina
    Riguzzi, Fabrizio
    Zese, Riccardo
    AI*IA 2016: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2016, 10037 : 364 - 376
  • [48] Consolidating Probabilistic Knowledge Bases via Belief Contraction
    De Bona, Glauber
    Finger, Marcelo
    Ribeiro, Marcio M.
    Santos, Yuri D.
    Wassermann, Renata
    FIFTEENTH INTERNATIONAL CONFERENCE ON THE PRINCIPLES OF KNOWLEDGE REPRESENTATION AND REASONING, 2016, : 125 - 134
  • [49] Possibilistic and standard probabilistic semantics of conditional knowledge bases
    Benferhat, S
    Dubois, D
    Prade, H
    JOURNAL OF LOGIC AND COMPUTATION, 1999, 9 (06) : 873 - 895
  • [50] ArchimedesOne: Query Processing over Probabilistic Knowledge Bases
    Zhou, Xiaofeng
    Chen, Yang
    Wang, Daisy Zhe
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1461 - 1464