Data-Driven Policy Iteration for Nonlinear Optimal Control Problems

被引:4
|
作者
Possieri, Corrado [1 ]
Sassano, Mario [2 ]
机构
[1] CNR, Ist Anal Sistemi Informat A Ruberti, I-00185 Rome, Italy
[2] Univ Roma Tor Vergata, Dipartimento Ingn Civile & Ingn Informat, I-00133 Rome, Italy
关键词
Optimal control; Costs; Neural networks; Real-time systems; Nonlinear dynamical systems; Closed loop systems; Learning systems; Data-driven methods; nonlinear systems; optimal control; policy iteration; ADAPTIVE OPTIMAL-CONTROL; TIME LINEAR-SYSTEMS; IDENTIFICATION; ALGORITHM; DESIGN;
D O I
10.1109/TNNLS.2022.3142501
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of optimal control laws for nonlinear systems is tackled without knowledge of the underlying plant and of a functional description of the cost function. The proposed data-driven method is based only on real-time measurements of the state of the plant and of the (instantaneous) value of the reward signal and relies on a combination of ideas borrowed from the theories of optimal and adaptive control problems. As a result, the architecture implements a policy iteration strategy in which, hinging on the use of neural networks, the policy evaluation step and the computation of the relevant information instrumental for the policy improvement step are performed in a purely continuous-time fashion. Furthermore, the desirable features of the design method, including convergence rate and robustness properties, are discussed. Finally, the theory is validated via two benchmark numerical simulations.
引用
收藏
页码:7365 / 7376
页数:12
相关论文
共 50 条
  • [1] Data-driven policy iteration algorithm for continuous-time stochastic linear-quadratic optimal control problems
    Zhang, Heng
    Li, Na
    [J]. ASIAN JOURNAL OF CONTROL, 2024, 26 (01) : 481 - 489
  • [2] A data-driven indirect method for nonlinear optimal control
    Gao Tang
    Kris Hauser
    [J]. Astrodynamics, 2019, 3 : 345 - 359
  • [3] A data-driven indirect method for nonlinear optimal control
    Tang, Gao
    Hauser, Kris
    [J]. ASTRODYNAMICS, 2019, 3 (04) : 345 - 359
  • [4] A Data-driven Indirect Method for Nonlinear Optimal Control
    Tang, Gao
    Hauser, Kris
    [J]. 2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 4854 - 4861
  • [5] A data-driven online ADP control method for nonlinear system based on policy iteration and nonlinear MIMO decoupling ADRC
    Huang, Zhijian
    Zhang, Cheng
    Zhang, Yanyan
    Zhang, Guichen
    [J]. NEUROCOMPUTING, 2018, 310 : 28 - 37
  • [6] A Combined Policy Gradient and Q-learning Method for Data-driven Optimal Control Problems
    Lin, Mingduo
    Liu, Derong
    Zhao, Bo
    Dai, Qionghai
    Dong, Yi
    [J]. 2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, : 6 - 10
  • [7] Data-Driven Nonlinear Adaptive Optimal Control of Connected Vehicles
    Gao, Weinan
    Jiang, Zhong-Ping
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 122 - 129
  • [8] Data-Driven Nearly Optimal Control for Constrained Nonlinear Systems
    Yang, Xiong
    [J]. PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 105 - 110
  • [9] Data-driven policy iteration algorithm for optimal control of continuous-time Ito stochastic systems with Markovian jumps
    Song, Jun
    He, Shuping
    Liu, Fei
    Niu, Yugang
    Ding, Zhengtao
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (12): : 1431 - 1439
  • [10] Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. AUTOMATICA, 2016, 71 : 348 - 360