Data-Driven Adaptive Dynamic Programming for Optimal Control of Continuous-Time Multicontroller Systems With Unknown Dynamics

被引:7
|
作者
Zhao, Jingang [1 ]
机构
[1] Weifang Univ, Coll Informat & Control Engn, Weifang 261061, Shandong, Peoples R China
关键词
Optimal control; Games; Mathematical models; Control systems; Heuristic algorithms; Dynamic programming; Performance analysis; Adaptive dynamic programming; fully cooperative games; neural networks; multi-controller systems; unknown dynamics; ZERO-SUM GAMES; OPTIMAL TRACKING CONTROL; HORIZON OPTIMAL-CONTROL; NONLINEAR-SYSTEMS; ALGORITHMS;
D O I
10.1109/ACCESS.2022.3168032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the optimal control of continuous-time multi-controller systems with completely unknown dynamics using data-driven adaptive dynamic programming (DD-ADP). In this investigation, all controllers take actions together as a team, and they have precisely the same cost function, which is actually a fully cooperative game. According to optimal control theory, the HJB equation corresponding to the fully cooperative game is derived. To obtain the solution to HJB equation, a model-based policy iteration (PI) algorithm is first presented. On the basis of the PI algorithm, a DD-ADP algorithm without requiring the system dynamics is developed, and the neural networks (NNs) implementation scheme of the developed DD-ADP algorithm is given. Stability and convergence analysis are derived by Lyapunov theory. Finally, numerical simulation examples on linear and nonlinear multi-controller systems demonstrate the effectiveness of the designed scheme.
引用
收藏
页码:41503 / 41511
页数:9
相关论文
共 50 条
  • [1] Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems
    Mu, Chaoxu
    Ni, Zhen
    Sun, Changyin
    He, Haibo
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) : 1460 - 1470
  • [2] Online optimal tracking control of continuous-time linear systems with unknown dynamics by using adaptive dynamic programming
    Qin, Chunbin
    Zhang, Huaguang
    Luo, Yanhong
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (05) : 1000 - 1009
  • [3] Optimal dynamic output feedback control of unknown linear continuous-time systems by adaptive dynamic programming☆
    Xie, Kedi
    Zheng, Yiwei
    Jiang, Yi
    Lan, Weiyao
    Yu, Xiao
    [J]. AUTOMATICA, 2024, 163
  • [4] Data-Driven Iterative Adaptive Dynamic Programming Algorithm for Approximate Optimal Control of Unknown Nonlinear Systems
    Li, Hongliang
    Liu, Derong
    Wang, Ding
    Li, Chao
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3265 - 3271
  • [5] Data-driven optimal tracking control for a class of affine non-linear continuous-time systems with completely unknown dynamics
    Xiao, Geyang
    Zhang, Huaguang
    Luo, Yanhong
    Jiang, He
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (06): : 700 - 710
  • [6] Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics
    Lv, Yongfeng
    Na, Jing
    Yang, Qinmin
    Wu, Xing
    Guo, Yu
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2016, 89 (01) : 99 - 112
  • [7] Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
    Jiang, Yu
    Jiang, Zhong-Ping
    [J]. AUTOMATICA, 2012, 48 (10) : 2699 - 2704
  • [8] Distributed adaptive dynamic programming for data-driven optimal control
    Tang, Wentao
    Daoutidis, Prodromos
    [J]. SYSTEMS & CONTROL LETTERS, 2018, 120 : 36 - 43
  • [9] Internal reinforcement adaptive dynamic programming for optimal containment control of unknown continuous-time multi-agent systems
    Zhang, Jiefu
    Peng, Zhinan
    Hu, Jiangping
    Zhao, Yiyi
    Luo, Rui
    Ghosh, Bijoy Kumar
    [J]. NEUROCOMPUTING, 2020, 413 : 85 - 95
  • [10] Data-driven adaptive optimal control of linear uncertain systems with unknown jumping dynamics
    Zhang, Meng
    Gan, Ming-Gang
    Chen, Jie
    Jiang, Zhong-Ping
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2019, 356 (12): : 6087 - 6105