Optimal Self-Learning Cooperative Control for Continuous-Time Heterogeneous Multi-Agent Systems

被引:0
|
作者
Wei Qinglai [1 ]
Liu Derong [1 ]
Song Ruizhuo [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
关键词
Adaptive Critic Designs; Adaptive Dynamic Programming; Approximate Dynamic Programming; Heterogeneous Multi-Agents; Graphical Games; Policy Iteration; Synchronization; DYNAMIC-PROGRAMMING ALGORITHM; OPTIMAL TRACKING CONTROL; NONLINEAR-SYSTEMS; CONTROL SCHEME; REINFORCEMENT; GAMES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an optimal self-learning cooperative control for heterogeneous multi-agent systems by iterative adaptive dynamic programming (ADP) is developed. The main idea is to design an optimal control law by policy iteration based ADP technique which makes all the agents track a given dynamics and simultaneously makes the iterative performance index function reach the Nash equilibrium. The cooperative policy iteration algorithm for graphical differential games is developed to achieve the optimal control law for the agent of each node. Convergence properties are analyzed which make the performance index functions of heterogeneous multi-agent differential graphical games converge to the Nash equilibrium. Simulation example is given to show the effectiveness of the developed optimal self-learning control scheme.
引用
收藏
页码:3005 / 3010
页数:6
相关论文
共 50 条
  • [31] Distributed learning and cooperative control for multi-agent systems
    Choi, Jongeun
    Oh, Songhwai
    Horowitz, Roberto
    AUTOMATICA, 2009, 45 (12) : 2802 - 2814
  • [32] LQ inverse optimal consensus protocol for continuous-time multi-agent systems and its application to formation control
    Choi, Y.H. (yhchoi@kyonggi.ac.kr), 1600, Institute of Control, Robotics and Systems (20):
  • [33] Optimal Consensus Control for Continuous-time Multi-agent Systems via Actor-Critic Neural Networks
    Jia, Xiao
    Wolter, Katinka
    2022 8TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2022), 2022, : 191 - 195
  • [34] Consensusability of Continuous-time Multi-agent Systems With Multiplicative Noises
    Qu, Cuiyun
    Wang, Zhongmei
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 5178 - 5181
  • [35] Consensus of Singular Multi-Agent Systems with Continuous-Time Dynamics
    Jiang, Tong Qiang
    He, Jia Wei
    Gao, Yan Ping
    MANUFACTURING ENGINEERING AND AUTOMATION II, PTS 1-3, 2012, 591-593 : 1506 - 1510
  • [36] Event-triggered consensus control of continuous-time stochastic multi-agent systems
    Cao, Xiangyang
    Zhang, Chenghui
    Zhao, Daduan
    Sun, Bo
    Li, Yan
    AUTOMATICA, 2022, 137
  • [37] Localized data-driven consensus control for continuous-time multi-agent systems
    Chang, Zeze
    Li, Zhongkui
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024,
  • [38] Cooperative robust optimal control of uncertain multi-agent systems
    Zhang, Zhuo
    Zhang, Shouxu
    Li, Huiping
    Yan, Weisheng
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (14): : 9467 - 9483
  • [39] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
    Li, Ye
    Liu, ZhongXin
    Lan, Ge
    Sader, Malika
    Chen, ZengQiang
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2023, 66 (08) : 2441 - 2453
  • [40] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
    Ye Li
    ZhongXin Liu
    Ge Lan
    Malika Sader
    ZengQiang Chen
    Science China Technological Sciences, 2023, 66 : 2441 - 2453