Value iteration with deep neural networks for optimal control of input-affine nonlinear systems

被引:1
|
作者
Beppu H. [1 ,2 ]
Maruta I. [1 ]
Fujimoto K. [1 ]
机构
[1] Department of Aeronautics and Astronautics, Graduate School of Engineering, Kyoto University, Kyoto
[2] Japan Society for the Promotion of Science, Tokyo
关键词
convergence analysis; deep neural networks; input-affine nonlinear systems; optimal control; Value iteration;
D O I
10.1080/18824889.2021.1936817
中图分类号
学科分类号
摘要
This paper proposes a new algorithm with deep neural networks to solve optimal control problems for continuous-time input nonlinear systems based on a value iteration algorithm. The proposed algorithm applies the networks to approximating the value functions and control inputs in the iterations. Consequently, the partial differential equations of the original algorithm reduce to the optimization problems for the parameters of the networks. Although the conventional algorithm can obtain the optimal control with iterative computations, each of the computations needs to be completed precisely, and it is hard to achieve sufficient precision in practice. Instead, the proposed method provides a practical method using deep neural networks and overcomes the difficulty based on a property of the networks, under which our convergence analysis shows that the proposed algorithm can achieve the minimum of the value function and the corresponding optimal controller. The effectiveness of the proposed method even with reasonable computational resources is demonstrated in two numerical simulations. © 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:140 / 149
页数:9
相关论文
共 50 条
  • [1] Fixed-final-time optimal tracking control of input-affine nonlinear systems
    Heydari, Ali
    Balakrishnan, S. N.
    [J]. NEUROCOMPUTING, 2014, 129 : 528 - 539
  • [2] HYBRID APPROACH FOR CONTROL OF A CLASS OF INPUT-AFFINE NONLINEAR SYSTEMS
    Atam, Ercan
    Mathelin, Lionel
    Cordier, Laurent
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (03): : 1207 - 1228
  • [3] Control of input-affine nonlinear systems via linear programming
    Merrikh-Bayat, Farshad
    Afshar, Mehdi
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (18) : 9358 - 9371
  • [5] Hierarchical optimal control for input-affine nonlinear systems through the formulation of Stackelberg game
    Mu, Chaoxu
    Wang, Ke
    Zhang, Qichao
    Zhao, Dongbin
    [J]. INFORMATION SCIENCES, 2020, 517 : 1 - 17
  • [6] General interpolation for input-affine nonlinear systems
    Bacic, M
    Cannon, M
    Kouvaritakis, B
    [J]. PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 2010 - 2014
  • [7] Time-energy suboptimal control of nonlinear input-affine systems
    Nanavati, Rohit V.
    Kumar, Shashi Ranjan
    Maity, Arnab
    [J]. International Journal of Control, 2023, 96 (12): : 3058 - 3071
  • [8] On the stability of input-affine nonlinear systems with sampled-data control
    Omran, Hassan
    Hetel, Laurentiu
    Richard, Jean-Pierre
    Lamnabhi-Lagarrigue, Francoise
    [J]. 2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 2585 - 2590
  • [9] Time-energy suboptimal control of nonlinear input-affine systems
    Nanavati, Rohit, V
    Kumar, Shashi Ranjan
    Maity, Arnab
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (12) : 3058 - 3071
  • [10] Adaptive Passification of Unknown Input-Affine Nonlinear Systems
    Miyano, Tatsuya
    Shima, Ryotaro
    Ito, Yuji
    [J]. IEEE Control Systems Letters, 2024, 8 : 2979 - 2984