The Sum-Product Theorem: A Foundation for Learning Tractable Models

被引:0
|
作者
Friesen, Abram L. [1 ]
Domingos, Pedro [1 ]
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
关键词
CONSTRAINT-SATISFACTION; INFERENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inference in expressive probabilistic models is generally intractable, which makes them difficult to learn and limits their applicability. Sum-product networks are a class of deep models where, surprisingly, inference remains tractable even when an arbitrary number of hidden layers are present. In this paper, we generalize this result to a much broader set of learning problems: all those where inference consists of summing a function over a semiring. This includes satisfiability, constraint satisfaction, optimization, integration, and others. In any semiring, for summation to be tractable it suffices that the factors of every product have disjoint scopes. This unifies and extends many previous results in the literature. Enforcing this condition at learning time thus ensures that the learned models are tractable. We illustrate the power and generality of this approach by applying it to a new type of structured prediction problem: learning a nonconvex function that can be globally optimized in polynomial time. We show empirically that this greatly outperforms the standard approach of learning without regard to the cost of optimization.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Sum-Product Networks: Deep Models with Tractable Inference
    Domingos, Pedro
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT I, 2015, 9284
  • [2] Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models
    Zecevic, Matej
    Dhami, Devendra Singh
    Karanam, Athresh
    Natarajan, Sriraam
    Kersting, Kristian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Explaining Deep Tractable Probabilistic Models: The sum-product network case
    Karanam, Athresh
    Mathur, Saurabh
    Radivojac, Predrag
    Haas, David M.
    Kersting, Kristian
    Natarajan, Sriraam
    INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 186, 2022, 186
  • [4] A Sum-Product Theorem in Function Fields
    Bloom, Thomas F.
    Jones, Timothy G. F.
    INTERNATIONAL MATHEMATICS RESEARCH NOTICES, 2014, 2014 (19) : 5249 - 5263
  • [5] Sum-product graphical models
    Desana, Mattia
    Schnoerr, Christoph
    MACHINE LEARNING, 2020, 109 (01) : 135 - 173
  • [6] THE SUM-PRODUCT THEOREM IN Zq WITH q ARBITRARY
    Bourgain, Jean
    JOURNAL D ANALYSE MATHEMATIQUE, 2008, 106 (1): : 1 - 93
  • [7] The sum-product theorem in ℤq with q arbitrary
    Jean Bourgain
    Journal d'Analyse Mathématique, 2008, 106 : 1 - 93
  • [8] Applications of the sum-product theorem in finite fields
    Wigderson, Avi
    CCC 2006: TWENTY-FIRST ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2006, : 111 - 111
  • [9] On Learning the Structure of Sum-Product Networks
    Butz, Cory J.
    Oliveira, Jhonatan S.
    dos Santos, Andre E.
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 2997 - 3004
  • [10] Bayesian Learning of Sum-Product Networks
    Trapp, Martin
    Peharz, Robert
    Ge, Hong
    Pernkopf, Franz
    Ghahramani, Zoubin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32