Data-Based Optimal Control of Multiagent System A Reinforcement Learning Design Approach

被引：94

作者：

Zhang, Jilie ^{[1
]}

Wang, Zhanshan ^{[2
]}

Zhang, Hongwei ^{[3
]}

机构：

[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Sichuan, Peoples R China

[2] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China

[3] Southwest Jiaotong Univ, Sch Elect Engn, Key Lab Magnet Suspens Technol & Maglev Vehicle, Minist Educ, Chengdu 611756, Sichuan, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2019年 / 49卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Consensus; data-based control; optimal cooperative control; reinforcement learning; DIFFERENTIAL GRAPHICAL GAMES; ADAPTIVE OPTIMAL-CONTROL; SYNCHRONIZATION; FEEDBACK;

D O I：

10.1109/TCYB.2018.2868715

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies an optimal consensus tracking problem of heterogeneous linear multiagent systems. By introducing tracking error dynamics, the optimal tracking problem is reformulated as finding a Nash-equilibrium solution to multiplayer games, which can be done by solving associated coupled Hamilton-Jacobi equations. A data-based error estimator is designed to obtain the data-based control for the multiagent systems. Using the quadratic functional to approximate every agent's value function, we can obtain the optimal cooperative control by the input-output (I/O) Q-learning algorithm with a value iteration technique in the least-square sense. The control law solves the optimal consensus problem for multiagent systems with measured I/O information, and does not rely on the model of multiagent systems. A numerical example is provided to illustrate the effectiveness of the proposed algorithm.

引用

页码：4441 / 4449

页数：9

共 50 条

[1] Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning
Yang, Xindi
Zhang, Hao
Wang, Zhuping
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3872 - 3883
[2] Data-based optimal control design with reinforcement learning for nonlinear PDE systems
Zheng, Yuqing
Zhang, Guoshan
[J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 345 - 350
[3] Input-Output Data-Based Output Antisynchronization Control of Multiagent Systems Using Reinforcement Learning Approach
Peng, Zhinan
Zhao, Yiyi
Hu, Jiangping
Luo, Rui
Ghosh, Bijoy Kumar
Nguang, Sing Kiong
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (11) : 7359 - 7367
[4] Data-Based Optimal Synchronization of Heterogeneous Multiagent Systems in Graphical Games via Reinforcement Learning
Xiong, Chunping
Ma, Qian
Guo, Jian
Lewis, Frank L.
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 9
[5] Data-based reinforcement learning approximate optimal control for an uncertain nonlinear system with control effectiveness faults
Deptula, Patryk
Bell, Zachary, I
Doucette, Emily A.
Curtis, J. Willard
Dixon, Warren E.
[J]. AUTOMATICA, 2020, 116
[6] Data-Based Reinforcement Learning Approximate Optimal Control for an Uncertain Nonlinear System with Partial Loss of Control Effectiveness
Deptula, Patryk
Bell, Zachary, I
Doucette, Emily A.
Curtis, J. Willard
Dixon, Warren E.
[J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 2521 - 2526
[7] Design of a Networked Tracking Control System With a Data-based Approach
Tong, Shiwen
Qian, Dianwei
Yan, Xiaoyu
Fang, Jianjun
Liu, Wei
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (05) : 1261 - 1267
[8] Design of a Networked Tracking Control System With a Data-based Approach
Shiwen Tong
Dianwei Qian
Xiaoyu Yan
Jianjun Fang
Wei Liu
[J]. IEEE/CAA Journal of Automatica Sinica, 2019, 6 (05) : 1261 - 1267
[9] An Approach to Data-Based Linear Quadratic Optimal Control
Yan, Yitao
Bao, Jie
Huang, Biao
[J]. IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1120 - 1125
[10] Data-based optimal control
Aangenent, W
Kostic, D
de Jager, B
van de Molengraft, R
Steinbuch, M
[J]. ACC: PROCEEDINGS OF THE 2005 AMERICAN CONTROL CONFERENCE, VOLS 1-7, 2005, : 1460 - 1465

← 1 2 3 4 5 →