Deep reinforcement learning based finite-horizon optimal control for a discrete-time affine nonlinear system

被引:0
|
作者
Kim, Jong Woo [1 ]
Park, Byung Jun [1 ]
Yoo, Haeun [2 ]
Lee, Jay H. [2 ]
Lee, Jong Min [1 ]
机构
[1] Seoul Natl Univ, Sch Chem & Biol Engn, Inst Chem Proc, 1 Gwanak Ro, Seoul 08826, South Korea
[2] Korea Adv Inst Sci & Technol, Chem & Biomol Engn Dept, Daejeon 34141, South Korea
关键词
Reinforcement learning; Approximate dynamic programming; Deep learning; Actor-critic method; Finite horizon optimal control; DESIGN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Approximate dynamic programming (ADP) aims to obtain an approximate numerical solution to the discrete time Hamilton-Jacobi-Bellman (HJB) equation. Heuristic dynamic programming (HDP) is a two-stage iterative scheme of ADP by separating the HJB equation into two equations, one for the value function and another for the policy function, which are referred to as the critic and the actor, respectively. Previous ADP implementations have been limited by the choice of function approximator, which requires significant prior domain knowledge or a large number of parameters to be fitted. However, recent advances in deep learning brought by the computer science community enable the use of deep neural networks (DNN) to approximate high-dimensional nonlinear functions without prior domain knowledge. Motivated by this, we examine the potential of DNNs as function approximators of the critic and the actor. In contrast to the infinite-horizon optimal control problem, the critic and the actor of the finite horizon optimal control (FHOC) problem are time-varying functions and have to satisfy a boundary condition. DNN structure and training algorithm suitable for FHOC are presented. Illustrative examples are provided to demonstrate the validity of the proposed method.
引用
收藏
页码:567 / 572
页数:6
相关论文
共 50 条
  • [1] Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system
    Kim, Jong Woo
    Park, Byung Jun
    Yoo, Haeun
    Lee, Jay H.
    Lee, Jong Min
    IFAC PAPERSONLINE, 2018, 51 (25): : 257 - 262
  • [2] Neural Network-Based Finite-Horizon Optimal Control of Uncertain Affine Nonlinear Discrete-Time Systems
    Zhao, Qiming
    Xu, Hao
    Jagannathan, Sarangapani
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (03) : 486 - 499
  • [3] A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system
    Kim, Jong Woo
    Park, Byung Jun
    Yoo, Haeun
    Oh, Tae Hoon
    Lee, Jay H.
    Lee, Jong Min
    JOURNAL OF PROCESS CONTROL, 2020, 87 (87) : 166 - 178
  • [4] Finite-horizon inverse optimal control for discrete-time nonlinear systems
    Molloy, Timothy L.
    Ford, Jason J.
    Perez, Tristan
    AUTOMATICA, 2018, 87 : 442 - 446
  • [5] A New Approach to Finite-Horizon Optimal Control for Discrete-Time Affine Nonlinear Systems via a Pseudolinear Method
    Wei, Qinglai
    Zhu, Liao
    Li, Tao
    Liu, Derong
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (05) : 2610 - 2617
  • [6] The finite-horizon optimal control for a class of time-delay affine nonlinear system
    Ruizhuo Song
    Huaguang Zhang
    Neural Computing and Applications, 2013, 22 : 229 - 235
  • [7] The finite-horizon optimal control for a class of time-delay affine nonlinear system
    Song, Ruizhuo
    Zhang, Huaguang
    NEURAL COMPUTING & APPLICATIONS, 2013, 22 (02): : 229 - 235
  • [8] Finite-Horizon Optimal Control of Discrete-Time Switched Linear Systems
    Zhu, Qixin
    Xie, Guangming
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2012, 2012
  • [9] Stability analysis of discrete-time finite-horizon discounted optimal control
    Granzotto, Mathieu
    Postoyan, Romain
    Busoniu, Lucian
    Nesic, Dragan
    Daafouz, Jamal
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 2322 - 2327
  • [10] Finite-Horizon Optimal Adaptive Neural Network Control of Uncertain Nonlinear Discrete-time Systems
    Zhao, Qiming
    Xu, Hao
    Jagannathan, S.
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL (ISIC), 2013, : 41 - 46