Optimal control as a graphical model inference problem

被引:171
|
作者
Kappen, Hilbert J. [1 ]
Gomez, Vicenc [1 ]
Opper, Manfred [2 ]
机构
[1] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, NL-6525 EZ Nijmegen, Netherlands
[2] TU Berlin, Dept Comp Sci, D-10587 Berlin, Germany
关键词
Optimal control; Uncontrolled dynamics; Kullback-Leibler divergence; Graphical model; Approximate inference; Cluster variation method; Belief propagation;
D O I
10.1007/s10994-012-5278-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We reformulate a class of non-linear stochastic optimal control problems introduced by Todorov (in Advances in Neural Information Processing Systems, vol. 19, pp. 1369-1376, 2007) as a Kullback-Leibler (KL) minimization problem. As a result, the optimal control computation reduces to an inference computation and approximate inference methods can be applied to efficiently compute approximate optimal controls. We show how this KL control theory contains the path integral control method as a special case. We provide an example of a block stacking task and a multi-agent cooperative game where we demonstrate how approximate inference can be successfully applied to instances that are too complex for exact computation. We discuss the relation of the KL control approach to other inference approaches to control.
引用
收藏
页码:159 / 182
页数:24
相关论文
共 50 条
  • [21] An optimal control model for the Lyapunov system of stability problem
    Xiong, Xiaolin
    Lao, Zhi
    Feng, Zhiguo
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3768 - 3771
  • [22] Stochastic Optimal Control Problem in Advertising Model with Delay
    CHEN Li
    WU Zhen
    Journal of Systems Science & Complexity, 2020, 33 (04) : 968 - 987
  • [23] OPTIMAL CONTROL PROBLEM OF A TUBERCULOSIS MODEL WITH SPATIAL DYNAMICS
    Ben Rhila, Soukaina
    Rachik, Mostafa
    COMMUNICATIONS IN MATHEMATICAL BIOLOGY AND NEUROSCIENCE, 2020, : 1 - 12
  • [24] Stochastic Optimal Control Problem in Advertising Model with Delay
    Chen Li
    Wu Zhen
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2020, 33 (04) : 968 - 987
  • [25] On the optimal control problem for two regions' macroeconomic model
    Surkov, Platon G.
    ARCHIVES OF CONTROL SCIENCES, 2015, 25 (04): : 417 - 427
  • [26] Stochastic Optimal Control Problem in Advertising Model with Delay
    Li Chen
    Zhen Wu
    Journal of Systems Science and Complexity, 2020, 33 : 968 - 987
  • [27] Smooth and nonsmooth optimal Lipschitz control - a model problem
    Goebel, M
    VARIATIONAL CALCULUS, OPTIMAL CONTROL AND APPLICATIONS, 1998, 124 : 53 - 60
  • [28] An optimal control problem for diffusion-precipitation model
    Kundu, A.
    Mahato, H. S.
    ASYMPTOTIC ANALYSIS, 2024, 139 (3-4) : 183 - 215
  • [29] Graphical Inference for Infovis
    Wickham, Hadley
    Cook, Dianne
    Hofmann, Heike
    Buja, Andreas
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 973 - 979
  • [30] Incomplete graphical model inference via latent tree aggregation
    Robin, Genevieve
    Ambroise, Christophe
    Robin, Stephane
    STATISTICAL MODELLING, 2019, 19 (05) : 545 - 568