Optimal couple-group tracking control for the heterogeneous multi-agent systems with cooperative-competitive interactions via reinforcement learning method

被引：8

作者：

Li, Jun ^{[1
,2
]}

Ji, Lianghao ^{[1
,2
]}

Zhang, Cuijuan ^{[1
,2
]}

Li, Huaqing ^{[3
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China

[2] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Chongqing 400065, Peoples R China

[3] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China

来源：

INFORMATION SCIENCES | 2022年 / 610卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning (RL); Cooperative-competitive interaction; Optimal couple-group tracking control (OCGTC); Heterogeneous multi-agent systems (HeMASs); OUTPUT SYNCHRONIZATION; CLUSTER CONSENSUS; TIME NETWORKS; GAMES; ALGORITHM;

D O I：

10.1016/j.ins.2022.07.181

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we study a class of optimal couple-group tracking control (OCGTC) problems for heterogeneous multi-agent systems (HeMASs) based on reinforcement learning (RL) method, whose goal is to minimize the local tracking errors (states) and control inputs (ac-tions) of followers by learning the dynamic knowledge of a single leader. The weakly con-nected multi-agent network is randomly divided into coupled sub-networks, and each agent in the same sub-network cooperates to accomplish tracking control such that the positions and velocities of all the agents converge to the same value, while the agents from different subgroups compete with each other to dissimilar tracking goals. In particular, in the discussed HeMASs, we consider agents with unknown dynamics of first-order and second-order. To solve the algebraic Riccati equation (ARE), an policy-value-based actor -critic technique is applied. Using the Lyapunov-like theorem, we verify that the local track-ing error and the estimated weights of actor-critic neural networks are deduced to be uni-formly ultimately bounded. Eventually, several simulations demonstrate the correctness of the retrieved theoretical results. (c) 2022 Elsevier Inc. All rights reserved.

引用

页码：401 / 424

页数：24

共 50 条

[31] The impact of sociality regimes on heterogeneous cooperative-competitive multi-agent reinforcement learning: a study with the predator-prey game
Zhao, Yue
Hernandez-Orallo, Jose
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024,
[32] Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks
Aotani, Takumi
Kobayashi, Taisuke
Sugimoto, Kenji
APPLIED INTELLIGENCE, 2021, 51 (07) : 4434 - 4452
[33] Couple-group consensus for heterogeneous MASs under switched topologies in cooperative-competitive systems: A hybrid pinning and delta operator skills
Pu, Xingcheng
Ren, Li
Liu, Yi
Pu, Rui
NEUROCOMPUTING, 2021, 441 : 335 - 349
[34] Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks
Takumi Aotani
Taisuke Kobayashi
Kenji Sugimoto
Applied Intelligence, 2021, 51 : 4434 - 4452
[35] A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning
Sun, Yu
Lai, Jun
Cao, Lei
Chen, Xiliang
Xu, Zhixiong
Xu, Yue
IEEE ACCESS, 2020, 8 : 135605 - 135616
[36] Bipartite Consensus of Heterogeneous Multi-agent Systems Based on Event-triggered Control and Cooperative-competitive Relation
Pu Xingcheng
Mu, Yilan
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4112 - 4119
[37] Couple-group Consensus of Multi-agent Systems with Directed and Fixed Topology
Tan Chong
Liu Guo-Ping
Duan Guang-Ren
2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 6515 - 6520
[38] Data-driven optimal cooperative tracking control for heterogeneous multi-agent systems
Ma, Yong-Sheng
Xu, Yong
Sun, Jian
Dou, Li-Hua
ISA Transactions, 2024, 154 : 23 - 31
[39] Fully distributed event-triggered pinning group consensus control for heterogeneous multi-agent systems with cooperative-competitive interaction strength
Li, Kangying
Ji, Lianghao
Zhang, Cuijuan
Li, Huaqing
NEUROCOMPUTING, 2021, 464 : 273 - 281
[40] Weighted Group Consensus for Discrete-Time Heterogeneous Multi-Agent Systems in the Cooperative-Competitive Network With Time Delays
Pu, Xingcheng
Zhao, Longlong
Xiong, Chaowen
IEEE ACCESS, 2019, 7 : 123679 - 123688

← 1 2 3 4 5 →