Maximum Entropy Optimal Control of Continuous-Time Dynamical Systems

被引:6
|
作者
Kim, Jeongho [1 ,2 ]
Yang, Insoon [3 ,4 ]
机构
[1] Seoul Natl Univ, Seoul 08826, South Korea
[2] Korea Inst Adv Study, Seoul 02455, South Korea
[3] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 08826, South Korea
[4] Seoul Natl Univ, Automat & Syst Res Inst, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Dynamic programming (DP); entropy; Hamilton-Jacobi-Bellman (HJB) equations; optimal control; viscosity solution; VISCOSITY SOLUTIONS; RELAXED CONTROLS; EQUATIONS; DIMENSIONALITY; ALGORITHM; CURSE;
D O I
10.1109/TAC.2022.3168168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Maximum entropy reinforcement learning methods have been successfully applied to a range of challenging sequential decision-making and control tasks. However, most of the existing techniques are designed for discrete-time systems although there has been a growing interest to handle physical processes evolving in continuous time. As a first step toward their extension to continuous-time systems, this article aims to study the theory of maximum entropy optimal control in continuous time. Applying the dynamic programming principle, we derive a novel class of Hamilton-Jacobi-Bellman (HJB) equations and prove that the optimal value function of the maximum entropy control problem corresponds to the unique viscosity solution of the HJB equation. We further show that the optimal control is uniquely characterized as Gaussian in the case of control-affine systems and that, for linear-quadratic problems, the HJB equation is reduced to a Riccati equation, which can be used to obtain an explicit expression of the optimal control. The results of our numerical experiments demonstrate the performance of our maximum entropy method in continuous-time optimal control and reinforcement learning problems.
引用
收藏
页码:2018 / 2033
页数:16
相关论文
共 50 条
  • [1] Relatively optimal control for continuous-time systems
    Blanchini, Franco
    Fujisaki, Yasumasa
    PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 5649 - +
  • [2] Anti-control of continuous-time dynamical systems
    Yu, Simin
    Chen, Guanrong
    COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2012, 17 (06) : 2617 - 2627
  • [3] Finite-Time Control of Continuous-Time Networked Dynamical Systems
    Liu, Huabo
    Yu, Haisheng
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4623 - 4632
  • [4] Optimal semistable control for continuous-time linear systems
    Hui, Qing
    SYSTEMS & CONTROL LETTERS, 2011, 60 (04) : 278 - 284
  • [5] Optimal Semistable Control for Continuous-Time Coupled Systems
    Hui, Qing
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 6403 - 6408
  • [6] Sparse optimal feedback control for continuous-time systems
    Ikeda, Takuya
    Kashima, Kenji
    2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 3728 - 3733
  • [7] Optimal control for continuous-time Markov jump systems
    Engineering College, Air Force Engineering University, Xi'an 710038, China
    Kongzhi yu Juece Control Decis, 2013, 3 (396-401):
  • [8] Optimal Control of Affine Nonlinear Continuous-time Systems
    Dierks, T.
    Jagannathan, S.
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 1568 - 1573
  • [9] L∞ optimal control of SISO continuous-time systems
    The Pennsylvania State Univ, University Park, United States
    Automatica, 1 (85-90):
  • [10] Optimal control of continuous-time switched affine systems
    Seatzu, Carla
    Corona, Daniele
    Giua, Alessandro
    Bemporad, Alberto
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (05) : 726 - 741