Optimal control as a graphical model inference problem

被引:171
|
作者
Kappen, Hilbert J. [1 ]
Gomez, Vicenc [1 ]
Opper, Manfred [2 ]
机构
[1] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, NL-6525 EZ Nijmegen, Netherlands
[2] TU Berlin, Dept Comp Sci, D-10587 Berlin, Germany
关键词
Optimal control; Uncontrolled dynamics; Kullback-Leibler divergence; Graphical model; Approximate inference; Cluster variation method; Belief propagation;
D O I
10.1007/s10994-012-5278-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We reformulate a class of non-linear stochastic optimal control problems introduced by Todorov (in Advances in Neural Information Processing Systems, vol. 19, pp. 1369-1376, 2007) as a Kullback-Leibler (KL) minimization problem. As a result, the optimal control computation reduces to an inference computation and approximate inference methods can be applied to efficiently compute approximate optimal controls. We show how this KL control theory contains the path integral control method as a special case. We provide an example of a block stacking task and a multi-agent cooperative game where we demonstrate how approximate inference can be successfully applied to instances that are too complex for exact computation. We discuss the relation of the KL control approach to other inference approaches to control.
引用
收藏
页码:159 / 182
页数:24
相关论文
共 50 条
  • [11] On an optimal control problem of the Leray-α model
    Hacat, Guelnur
    cibik, Aytekin
    Yilmaz, Fikriye
    Kaya, Songuel
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2024, 436
  • [12] A Model of Incentive Wages as an Optimal Control Problem
    Aleksandrova, E. A.
    Anikin, S. A.
    BULLETIN OF THE SOUTH URAL STATE UNIVERSITY SERIES-MATHEMATICAL MODELLING PROGRAMMING & COMPUTER SOFTWARE, 2014, 7 (04): : 22 - 35
  • [13] An Optimal Control Problem Related to the RSS Model
    Zaslavski, Alexander J.
    MATHEMATICS, 2023, 11 (17)
  • [14] Optimal Control Problem for an Electoral Behavior Model
    Omar Balatif
    Mohamed El Hia
    Mostafa Rachik
    Differential Equations and Dynamical Systems, 2023, 31 : 233 - 250
  • [15] Study of an Optimal Control Problem Related to the Solow Control Model
    Nikol'skii, M. S.
    PROCEEDINGS OF THE STEKLOV INSTITUTE OF MATHEMATICS, 2016, 292 : S231 - S237
  • [16] Study of an optimal control problem related to the Solow control model
    Nikolskii, M. S.
    TRUDY INSTITUTA MATEMATIKI I MEKHANIKI URO RAN, 2014, 20 (04): : 231 - 237
  • [17] Study of an optimal control problem related to the Solow control model
    M. S. Nikol’skii
    Proceedings of the Steklov Institute of Mathematics, 2016, 292 : 231 - 237
  • [18] LARGE MULTIPLE GRAPHICAL MODEL INFERENCE VIA BOOTSTRAP
    Zhang, Yongli
    Shen, Xiaotong
    Wang, Shaoli
    STATISTICA SINICA, 2020, 30 (02) : 695 - 717
  • [19] Learning Graphical Model Parameters with Approximate Marginal Inference
    Domke, Justin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (10) : 2454 - 2467
  • [20] Inference With Aggregate Data in Probabilistic Graphical Models: An Optimal Transport Approach
    Singh, Rahul
    Haasler, Isabel
    Zhang, Qinsheng
    Karlsson, Johan
    Chen, Yongxin
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (09) : 4483 - 4497