SlimShot: In-Database Probabilistic Inference for Knowledge Bases

被引:17
|
作者
Gribkoff, Eric [1 ]
Suciu, Dan [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2016年 / 9卷 / 07期
关键词
D O I
10.14778/2904483.2904487
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasingly large Knowledge Bases are being created, by crawling the Web or other corpora of documents, and by extracting facts and relations using machine learning techniques. To manage the uncertainty in the data, these KBs rely on probabilistic engines based on Markov Logic Networks (MLN), for which probabilistic inference remains a major challenge. Today's state of the art systems use variants of MCMC, which have no theoretical error guarantees, and, as we show, suffer from poor performance in practice. In this paper we describe SlimShot (Scalable Lifted Inference and Monte Carlo Sampling Hybrid Optimization Technique), a probabilistic inference engine for knowledge bases. SlimShot converts the MLN to a tuple-independent probabilistic database, then uses a simple Monte Carlo-based inference, with three key enhancements: (1) it combines sampling with safe query evaluation, (2) it estimates a conditional probability by jointly computing the numerator and denominator, and (3) it adjusts the proposal distribution based on the sample cardinality. In combination, these three techniques allow us to give formal error guarantees, and we demonstrate empirically that SlimShot outperforms today's state of the art probabilistic inference engines used in knowledge bases.
引用
下载
收藏
页码:552 / 563
页数:12
相关论文
共 50 条
  • [31] Extensible Database Simulator for Fast Prototyping In-Database Algorithms
    Wang, Yifan
    Wang, Daisy Zhe
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5029 - 5033
  • [32] Method for Restoring Consistency in Probabilistic Knowledge Bases
    Van Tham Nguyen
    Ngoc Thanh Nguyen
    Trong Hieu Tran
    Do Kieu Loan Nguyen
    CYBERNETICS AND SYSTEMS, 2018, 49 (5-6) : 317 - 338
  • [33] Measuring and repairing inconsistency in probabilistic knowledge bases
    Muino, David Picado
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2011, 52 (06) : 828 - 840
  • [34] Consolidation of Probabilistic Knowledge Bases by Inconsistency Minimization
    Potyka, Nico
    Thimm, Matthias
    21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 729 - +
  • [35] Knowledge Representation in Probabilistic Spatio-Temporal Knowledge Bases
    Parisi, Francesco
    Grant, John
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2016, 55 : 743 - 798
  • [36] Recent Advances in Querying Probabilistic Knowledge Bases
    Borgwardt, Stefan
    Ceylan, Ismail Ilkan
    Lukasiewicz, Thomas
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5420 - 5426
  • [37] Reasoning about hybrid probabilistic knowledge bases
    Mu, Kedian
    Lin, Zuoquan
    Jin, Zhi
    Lu, Ruqian
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 130 - 139
  • [38] Uncovering Probabilistic Implications in Typological Knowledge Bases
    Bjerva, Johannes
    Kementchedjhieva, Yova
    Cotterell, Ryan
    Augenstein, Isabelle
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3924 - 3930
  • [39] DB4Trans: In-Database Model Training Engine for Knowledge Graph Embedding
    Liu P.-K.
    Wang X.
    Liu B.-Z.
    Cai S.-T.
    Li S.-Z.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (09): : 1969 - 1982
  • [40] AIDA - Abstraction for Advanced In-Database Analytics
    D'silva, Joseph Vinish
    De Moor, Florestan
    Kemme, Bettina
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (11): : 1400 - 1413