The Sum-Product Theorem: A Foundation for Learning Tractable Models

被引：0

作者：

Friesen, Abram L. ^{[1
]}

Domingos, Pedro ^{[1
]}

机构：

[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48 | 2016年 / 48卷

关键词：

CONSTRAINT-SATISFACTION; INFERENCE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Inference in expressive probabilistic models is generally intractable, which makes them difficult to learn and limits their applicability. Sum-product networks are a class of deep models where, surprisingly, inference remains tractable even when an arbitrary number of hidden layers are present. In this paper, we generalize this result to a much broader set of learning problems: all those where inference consists of summing a function over a semiring. This includes satisfiability, constraint satisfaction, optimization, integration, and others. In any semiring, for summation to be tractable it suffices that the factors of every product have disjoint scopes. This unifies and extends many previous results in the literature. Enforcing this condition at learning time thus ensures that the learned models are tractable. We illustrate the power and generality of this approach by applying it to a new type of structured prediction problem: learning a nonconvex function that can be globally optimized in polynomial time. We show empirically that this greatly outperforms the standard approach of learning without regard to the cost of optimization.

引用

页数：10

共 50 条

[1] Sum-Product Networks: Deep Models with Tractable Inference
Domingos, Pedro
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT I, 2015, 9284
[2] Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models
Zecevic, Matej
Dhami, Devendra Singh
Karanam, Athresh
Natarajan, Sriraam
Kersting, Kristian
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[3] Explaining Deep Tractable Probabilistic Models: The sum-product network case
Karanam, Athresh
Mathur, Saurabh
Radivojac, Predrag
Haas, David M.
Kersting, Kristian
Natarajan, Sriraam
INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 186, 2022, 186
[4] A Sum-Product Theorem in Function Fields
Bloom, Thomas F.
Jones, Timothy G. F.
INTERNATIONAL MATHEMATICS RESEARCH NOTICES, 2014, 2014 (19) : 5249 - 5263
[5] Sum-product graphical models
Desana, Mattia
Schnoerr, Christoph
MACHINE LEARNING, 2020, 109 (01) : 135 - 173
[6] THE SUM-PRODUCT THEOREM IN Zq WITH q ARBITRARY
Bourgain, Jean
JOURNAL D ANALYSE MATHEMATIQUE, 2008, 106 (1): : 1 - 93
[7] The sum-product theorem in ℤq with q arbitrary
Jean Bourgain
Journal d'Analyse Mathématique, 2008, 106 : 1 - 93
[8] Applications of the sum-product theorem in finite fields
Wigderson, Avi
CCC 2006: TWENTY-FIRST ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2006, : 111 - 111
[9] On Learning the Structure of Sum-Product Networks
Butz, Cory J.
Oliveira, Jhonatan S.
dos Santos, Andre E.
2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 2997 - 3004
[10] Bayesian Learning of Sum-Product Networks
Trapp, Martin
Peharz, Robert
Ge, Hong
Pernkopf, Franz
Ghahramani, Zoubin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →