Data-Based Optimal Control of Multiagent System A Reinforcement Learning Design Approach

被引:94
|
作者
Zhang, Jilie [1 ]
Wang, Zhanshan [2 ]
Zhang, Hongwei [3 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Sichuan, Peoples R China
[2] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[3] Southwest Jiaotong Univ, Sch Elect Engn, Key Lab Magnet Suspens Technol & Maglev Vehicle, Minist Educ, Chengdu 611756, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Consensus; data-based control; optimal cooperative control; reinforcement learning; DIFFERENTIAL GRAPHICAL GAMES; ADAPTIVE OPTIMAL-CONTROL; SYNCHRONIZATION; FEEDBACK;
D O I
10.1109/TCYB.2018.2868715
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies an optimal consensus tracking problem of heterogeneous linear multiagent systems. By introducing tracking error dynamics, the optimal tracking problem is reformulated as finding a Nash-equilibrium solution to multiplayer games, which can be done by solving associated coupled Hamilton-Jacobi equations. A data-based error estimator is designed to obtain the data-based control for the multiagent systems. Using the quadratic functional to approximate every agent's value function, we can obtain the optimal cooperative control by the input-output (I/O) Q-learning algorithm with a value iteration technique in the least-square sense. The control law solves the optimal consensus problem for multiagent systems with measured I/O information, and does not rely on the model of multiagent systems. A numerical example is provided to illustrate the effectiveness of the proposed algorithm.
引用
收藏
页码:4441 / 4449
页数:9
相关论文
共 50 条
  • [1] Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning
    Yang, Xindi
    Zhang, Hao
    Wang, Zhuping
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3872 - 3883
  • [2] Data-based optimal control design with reinforcement learning for nonlinear PDE systems
    Zheng, Yuqing
    Zhang, Guoshan
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 345 - 350
  • [3] Input-Output Data-Based Output Antisynchronization Control of Multiagent Systems Using Reinforcement Learning Approach
    Peng, Zhinan
    Zhao, Yiyi
    Hu, Jiangping
    Luo, Rui
    Ghosh, Bijoy Kumar
    Nguang, Sing Kiong
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (11) : 7359 - 7367
  • [4] Data-Based Optimal Synchronization of Heterogeneous Multiagent Systems in Graphical Games via Reinforcement Learning
    Xiong, Chunping
    Ma, Qian
    Guo, Jian
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 9
  • [5] Data-based reinforcement learning approximate optimal control for an uncertain nonlinear system with control effectiveness faults
    Deptula, Patryk
    Bell, Zachary, I
    Doucette, Emily A.
    Curtis, J. Willard
    Dixon, Warren E.
    [J]. AUTOMATICA, 2020, 116
  • [6] Data-Based Reinforcement Learning Approximate Optimal Control for an Uncertain Nonlinear System with Partial Loss of Control Effectiveness
    Deptula, Patryk
    Bell, Zachary, I
    Doucette, Emily A.
    Curtis, J. Willard
    Dixon, Warren E.
    [J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 2521 - 2526
  • [7] Design of a Networked Tracking Control System With a Data-based Approach
    Tong, Shiwen
    Qian, Dianwei
    Yan, Xiaoyu
    Fang, Jianjun
    Liu, Wei
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (05) : 1261 - 1267
  • [8] Design of a Networked Tracking Control System With a Data-based Approach
    Shiwen Tong
    Dianwei Qian
    Xiaoyu Yan
    Jianjun Fang
    Wei Liu
    [J]. IEEE/CAA Journal of Automatica Sinica, 2019, 6 (05) : 1261 - 1267
  • [9] An Approach to Data-Based Linear Quadratic Optimal Control
    Yan, Yitao
    Bao, Jie
    Huang, Biao
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1120 - 1125
  • [10] Data-based optimal control
    Aangenent, W
    Kostic, D
    de Jager, B
    van de Molengraft, R
    Steinbuch, M
    [J]. ACC: PROCEEDINGS OF THE 2005 AMERICAN CONTROL CONFERENCE, VOLS 1-7, 2005, : 1460 - 1465