Optimal couple-group tracking control for the heterogeneous multi-agent systems with cooperative-competitive interactions via reinforcement learning method

被引:8
|
作者
Li, Jun [1 ,2 ]
Ji, Lianghao [1 ,2 ]
Zhang, Cuijuan [1 ,2 ]
Li, Huaqing [3 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Chongqing 400065, Peoples R China
[3] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning (RL); Cooperative-competitive interaction; Optimal couple-group tracking control (OCGTC); Heterogeneous multi-agent systems (HeMASs); OUTPUT SYNCHRONIZATION; CLUSTER CONSENSUS; TIME NETWORKS; GAMES; ALGORITHM;
D O I
10.1016/j.ins.2022.07.181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study a class of optimal couple-group tracking control (OCGTC) problems for heterogeneous multi-agent systems (HeMASs) based on reinforcement learning (RL) method, whose goal is to minimize the local tracking errors (states) and control inputs (ac-tions) of followers by learning the dynamic knowledge of a single leader. The weakly con-nected multi-agent network is randomly divided into coupled sub-networks, and each agent in the same sub-network cooperates to accomplish tracking control such that the positions and velocities of all the agents converge to the same value, while the agents from different subgroups compete with each other to dissimilar tracking goals. In particular, in the discussed HeMASs, we consider agents with unknown dynamics of first-order and second-order. To solve the algebraic Riccati equation (ARE), an policy-value-based actor -critic technique is applied. Using the Lyapunov-like theorem, we verify that the local track-ing error and the estimated weights of actor-critic neural networks are deduced to be uni-formly ultimately bounded. Eventually, several simulations demonstrate the correctness of the retrieved theoretical results. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:401 / 424
页数:24
相关论文
共 50 条
  • [31] The impact of sociality regimes on heterogeneous cooperative-competitive multi-agent reinforcement learning: a study with the predator-prey game
    Zhao, Yue
    Hernandez-Orallo, Jose
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024,
  • [32] Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks
    Aotani, Takumi
    Kobayashi, Taisuke
    Sugimoto, Kenji
    APPLIED INTELLIGENCE, 2021, 51 (07) : 4434 - 4452
  • [33] Couple-group consensus for heterogeneous MASs under switched topologies in cooperative-competitive systems: A hybrid pinning and delta operator skills
    Pu, Xingcheng
    Ren, Li
    Liu, Yi
    Pu, Rui
    NEUROCOMPUTING, 2021, 441 : 335 - 349
  • [34] Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks
    Takumi Aotani
    Taisuke Kobayashi
    Kenji Sugimoto
    Applied Intelligence, 2021, 51 : 4434 - 4452
  • [35] A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning
    Sun, Yu
    Lai, Jun
    Cao, Lei
    Chen, Xiliang
    Xu, Zhixiong
    Xu, Yue
    IEEE ACCESS, 2020, 8 : 135605 - 135616
  • [36] Bipartite Consensus of Heterogeneous Multi-agent Systems Based on Event-triggered Control and Cooperative-competitive Relation
    Pu Xingcheng
    Mu, Yilan
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4112 - 4119
  • [37] Couple-group Consensus of Multi-agent Systems with Directed and Fixed Topology
    Tan Chong
    Liu Guo-Ping
    Duan Guang-Ren
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 6515 - 6520
  • [38] Data-driven optimal cooperative tracking control for heterogeneous multi-agent systems
    Ma, Yong-Sheng
    Xu, Yong
    Sun, Jian
    Dou, Li-Hua
    ISA Transactions, 2024, 154 : 23 - 31
  • [39] Fully distributed event-triggered pinning group consensus control for heterogeneous multi-agent systems with cooperative-competitive interaction strength
    Li, Kangying
    Ji, Lianghao
    Zhang, Cuijuan
    Li, Huaqing
    NEUROCOMPUTING, 2021, 464 : 273 - 281
  • [40] Weighted Group Consensus for Discrete-Time Heterogeneous Multi-Agent Systems in the Cooperative-Competitive Network With Time Delays
    Pu, Xingcheng
    Zhao, Longlong
    Xiong, Chaowen
    IEEE ACCESS, 2019, 7 : 123679 - 123688