Discrete Graphical Models - An Optimization Perspective

被引:9
|
作者
Savchynskyy, Bogdan [1 ]
机构
[1] Heidelberg Univ, Heidelberg, Germany
关键词
COORDINATE DESCENT METHOD; MARKOV RANDOM-FIELDS; LAGRANGEAN DECOMPOSITION; ENERGY MINIMIZATION; MAP ESTIMATION; CONVERGENCE; PERSISTENCY; ALGORITHMS; INFERENCE;
D O I
10.1561/0600000084
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This monograph is about combinatorial optimization. More precisely, about a special class of combinatorial problems known as energy minimization or maximum a posteriori (MAP) inference in graphical models, closely related to weighted and valued constraint satisfaction problems and having tight connections to Markov random fields and quadratic pseudo-boolean optimization. What distinguishes this monograph from a number of other monographs on graphical models is its focus: It considers graphical models, or, more precisely, MAP-inference for graphical models, purely as a combinatorial optimization problem. Modeling, applications, probabilistic interpretations and many other aspects are either ignored here or find their place in examples and remarks only. Combinatorial optimization as a field is largely based on five fundamental topics: (i) integer linear programming and polyhedral optimization; (ii) totally unimodular matrices and the class of min-cost-flow problems; (iii) Lagrange decompositions and relaxations; (iv) dynamic programming and (v) submodularity, matroids and greedy algorithms Each of these topics found its place in this monograph, although to a variable extent. The covering of each respective topic reflects its importance for the considered MAP-inference problem. Since optimization is the primary topic of this monograph, we mostly stick to the terminology widely used in optimization and where it was possible we tried to avoid the graphical models community-specific technical terms. The latter differ from sub-community to sub-community and, in our view, significantly complicate the information exchange between them. The same holds also for the presentation of material in this monograph. If there is a choice when introducing mathematical constructs or proving statements, we prefer more general mathematical tools applicable in the whole field of operations research rather than to stick to graphical model-specific constructions. We additionally provide the graphical model-specific constructions if it turns out to be easier than the more general one. This way of presentation has two advantages. A reader familiar with a more general technique can grasp the new material faster. On the other hand, the monograph may serve as an introduction to combinatorial optimization for readers unfamiliar with this subject. To make the monograph even more suitable for both categories of readers we explicitly separate the mathematical optimization background chapters from those specific to graphical models. We believe, therefore, that the monograph can be useful for undergraduate and graduate students studying optimization or graphical models, as well as for experts in optimization who want to have a look into graphical models. Moreover, we believe that even experts in graphical models can find new views on the known facts as well as a novel presentation of less known results in the monograph. These are for instance (i) a simple and general proof of equivalence of different acyclic Lagrange decompositions of a graphical model; (ii) a general scheme for analyzing convergence of dual block-coordinate descent methods; (iii) a short and self-contained analysis of a linear programming relaxation for binary graphical models, its persistency properties and its relation to quadratic pseudo-boolean optimization. The present monograph is based on lectures given to under-graduate students of Technical University of Dresden and Heidelberg University. The selection of material is done in a way that it may serve as a basis for a semester course.
引用
收藏
页码:160 / 429
页数:270
相关论文
共 50 条
  • [21] Bi-objective Discrete Graphical Model Optimization
    Buchet, Samuel
    Allouche, David
    de Givry, Simon
    Schiex, Thomas
    INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, PT I, CPAIOR 2024, 2024, 14742 : 136 - 152
  • [22] Probabilistic Circuits for Variational Inference in Discrete Graphical Models
    Shih, Andy
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [23] Schemata, Distributions and Graphical Models in Evolutionary Optimization
    Heinz Mühlenbein
    Thilo Mahnig
    Alberto Ochoa Rodriguez
    Journal of Heuristics, 1999, 5 : 215 - 247
  • [24] Learning Graphical Factor Models with Riemannian Optimization
    Hippert-Ferrer, Alexandre
    Bouchard, Florent
    Mian, Ammar
    Vayer, Titouan
    Breloy, Arnaud
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 349 - 366
  • [25] Schemata, distributions and graphical models in evolutionary optimization
    Mühlenbein, H
    Mahnig, T
    Rodriguez, AO
    JOURNAL OF HEURISTICS, 1999, 5 (02) : 215 - 247
  • [26] Distributed fusion in sensor networks -: A graphical models perspective
    Cetin, Mujdat
    Chen, Lei
    Fisher, John W., III
    Ihler, Alexander T.
    Moses, Randolph L.
    Wainwright, Martin J.
    Willsky, Alan S.
    IEEE SIGNAL PROCESSING MAGAZINE, 2006, 23 (04) : 42 - 55
  • [27] Context-specific graphical models for discrete longitudinal data
    Edwards, David
    Ankinakatte, Smitha
    STATISTICAL MODELLING, 2015, 15 (04) : 301 - 325
  • [28] Mode Estimation for High Dimensional Discrete Tree Graphical Models
    Chen, Chao
    Liu, Han
    Metaxas, Dimitris N.
    Zhao, Tianqi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [29] A Dynamical Systems Perspective on Discrete Optimization
    Guanchun, Tong
    Muehlebach, Michael
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [30] Memory intensive AND/OR search for combinatorial optimization in graphical models
    Marinescu, Radu
    Dechter, Rina
    ARTIFICIAL INTELLIGENCE, 2009, 173 (16-17) : 1492 - 1524