Enhancing Cooperative Multi-Agent Systems With Self-Advice and Near-Neighbor Priority Collision Control

被引：1

作者：

Palacios-Morocho, Elizabeth ^{[1
]}

Inca, Saul ^{[1
]}

Monserrat, Jose F. ^{[1
]}

机构：

[1] Univ Politecn Valencia, Inst Telecommun & Multime dia Applicat iTEAM, Valencia 46022, Spain

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期

关键词：

Multi-agent systems; Behavioral sciences; Reinforcement learning; Training; Task analysis; Recurrent neural networks; Intelligent vehicles; Cooperative multi-agent system; reinforcement learning; independent learning; joint action learning; k-nearest neighbors; deep deterministic policy gradient;

D O I：

10.1109/TIV.2023.3293198

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The coordination of actions to be executed by multiple independent agents in a dynamic environment is one of the main challenges of multi-agent systems. To address this type of scenario, a key technology called Reinforcement Learning (RL) has emerged, which enables the training of optimal cooperative policies among agents. However, traditional value decomposition methods suffer from unstable convergence when the number of agents increases. To address this problem, this article proposes a novel algorithm based on centralized learning that employs a self-advice module to replace the joint action, thereby reducing the algorithmic complexity. The proposed algorithm uses the Joint Action Learning (JAL) concept to find an optimal approach and a collision controller module that was designed to further mitigate the risk of collisions. A comparison of the algorithm proposed is carried out with two benchmark algorithms. The first one focuses on decomposing the reward signal and the second one trains a different actor-critic network for each agent. Furthermore, multiple target points are defined to enhance cooperative scenarios during the learning process. According to the results, the proposed approach outperforms the two benchmarks by 8% and 49%, thus highlighting the effectiveness of the centralized learning approach in multi-agent systems.

引用

页码：2864 / 2877

页数：14

共 50 条

[21] Cooperative Control for Industrial Multi-agent Systems: Framework and Problems
Wang, Bohui
Fang, Xinpeng
Zhang, Bin
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 6971 - 6976
[22] Cooperative robust optimal control of uncertain multi-agent systems
Zhang, Zhuo
Zhang, Shouxu
Li, Huiping
Yan, Weisheng
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (14): : 9467 - 9483
[23] Robust Cooperative Control Reconfiguration/Recovery in Multi-Agent Systems
Gallehdari, Zahra
Meskin, Nader
Khorasani, Khashayar
2014 EUROPEAN CONTROL CONFERENCE (ECC), 2014, : 1554 - 1561
[24] An H∞ cooperative fault recovery control of multi-agent systems*
Gallehdari, Zahra
Meskin, Nader
Khorasani, Khashayar
AUTOMATICA, 2017, 84 : 101 - 108
[25] Special issue on learning and control in cooperative multi-agent systems
Lewis, Frank L.
Jiang, Zhong-Ping
Liu, Tengfei
Control Theory and Technology, 2015, 13 (01) : 44 - 44
[26] Finite Horizon Cooperative Formation Control for Multi-agent Systems
Shi, Jiantao
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 4849 - 4852
[27] Remote formation control and collision avoidance for multi-agent nonholonomic systems
Mastellone, Silvia
Stipanovic, Dusan A.
Spong, Mark W.
PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 1062 - +
[28] Collision avoidance control for discrete multi-agent systems with communication outages
Liu Z.
Liu Y.-C.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (12): : 2172 - 2178
[29] Secondary control of microgrids based on distributed cooperative control of multi-agent systems
Bidram, Ali
Davoudi, Ali
Lewis, Frank L.
Qu, Zhihua
IET GENERATION TRANSMISSION & DISTRIBUTION, 2013, 7 (08) : 822 - 831
[30] Containment Control for the Cooperative Output Regulation of Linear Multi-Agent Systems
Shen Yanchao
Yan Huaicheng
Zhang Hao
Shi Hongbo
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7296 - 7301

← 1 2 3 4 5 →