Value iteration with deep neural networks for optimal control of input-affine nonlinear systems

被引:1
|
作者
Beppu H. [1 ,2 ]
Maruta I. [1 ]
Fujimoto K. [1 ]
机构
[1] Department of Aeronautics and Astronautics, Graduate School of Engineering, Kyoto University, Kyoto
[2] Japan Society for the Promotion of Science, Tokyo
关键词
convergence analysis; deep neural networks; input-affine nonlinear systems; optimal control; Value iteration;
D O I
10.1080/18824889.2021.1936817
中图分类号
学科分类号
摘要
This paper proposes a new algorithm with deep neural networks to solve optimal control problems for continuous-time input nonlinear systems based on a value iteration algorithm. The proposed algorithm applies the networks to approximating the value functions and control inputs in the iterations. Consequently, the partial differential equations of the original algorithm reduce to the optimization problems for the parameters of the networks. Although the conventional algorithm can obtain the optimal control with iterative computations, each of the computations needs to be completed precisely, and it is hard to achieve sufficient precision in practice. Instead, the proposed method provides a practical method using deep neural networks and overcomes the difficulty based on a property of the networks, under which our convergence analysis shows that the proposed algorithm can achieve the minimum of the value function and the corresponding optimal controller. The effectiveness of the proposed method even with reasonable computational resources is demonstrated in two numerical simulations. © 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:140 / 149
页数:9
相关论文
共 50 条
  • [21] Event-triggered control of input-affine nonlinear interconnected systems using multiplayer game
    Narayanan, Vignesh
    Modares, Hamidreza
    Jagannathan, Sarangapani
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (03) : 950 - 970
  • [22] Tikhonov Regularization Based Control Allocation for Underactuated Input-Affine Systems
    de Morais, Junio Eduardo
    Cardoso, Daniel N.
    Raffo, Guilherme V.
    2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 379 - 384
  • [23] Loewner Functions and Model Order Reduction for Nonlinear Input-Affine Descriptor Systems
    Simard, Joel D.
    Astolfi, Alessandro
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 6887 - 6894
  • [24] Homogeneous stabilization for input-affine homogeneous systems
    Nakamura, Nami
    Nakamura, Hisakazu
    Yamashita, Yah
    PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 3770 - +
  • [25] An analytical fuzzy-based approach to -gain optimal control of input-affine nonlinear systems using Newton-type algorithm
    Milic, Vladimir
    Kasac, Josip
    Novakovic, Branko
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2015, 46 (13) : 2448 - 2460
  • [26] Self-Learning Optimal Guaranteed Cost Control of Input-Affine Continuous-Time Nonlinear Systems Under Uncertain Environment
    Wang, Ding
    He, Haibo
    Liu, Derong
    Li, Chao
    Wang, Huidong
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 857 - 862
  • [27] Further results on the structure of normal forms of input-affine nonlinear MIMO systems
    Isidori, Alberto
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 638 - 643
  • [28] Value-iteration-based affine nonlinear optimal control involving admissibility discussion
    Wang, Ding
    Ren, Jin
    Ha, Mingming
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (13) : 7290 - 7303
  • [29] Event-based feedback control of disturbed input-affine systems
    Stoecker, Christian
    Lunze, Jan
    ZAMM-ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 2014, 94 (04): : 290 - 302
  • [30] A Nash Game Approach to Mixed H2/H∞ Control for Input-Affine Nonlinear Systems
    Mylvaganam, T.
    Astolfi, A.
    IFAC PAPERSONLINE, 2016, 49 (18): : 1024 - 1029