Enhancing Cooperative Multi-Agent Systems With Self-Advice and Near-Neighbor Priority Collision Control

被引:1
|
作者
Palacios-Morocho, Elizabeth [1 ]
Inca, Saul [1 ]
Monserrat, Jose F. [1 ]
机构
[1] Univ Politecn Valencia, Inst Telecommun & Multime dia Applicat iTEAM, Valencia 46022, Spain
来源
关键词
Multi-agent systems; Behavioral sciences; Reinforcement learning; Training; Task analysis; Recurrent neural networks; Intelligent vehicles; Cooperative multi-agent system; reinforcement learning; independent learning; joint action learning; k-nearest neighbors; deep deterministic policy gradient;
D O I
10.1109/TIV.2023.3293198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The coordination of actions to be executed by multiple independent agents in a dynamic environment is one of the main challenges of multi-agent systems. To address this type of scenario, a key technology called Reinforcement Learning (RL) has emerged, which enables the training of optimal cooperative policies among agents. However, traditional value decomposition methods suffer from unstable convergence when the number of agents increases. To address this problem, this article proposes a novel algorithm based on centralized learning that employs a self-advice module to replace the joint action, thereby reducing the algorithmic complexity. The proposed algorithm uses the Joint Action Learning (JAL) concept to find an optimal approach and a collision controller module that was designed to further mitigate the risk of collisions. A comparison of the algorithm proposed is carried out with two benchmark algorithms. The first one focuses on decomposing the reward signal and the second one trains a different actor-critic network for each agent. Furthermore, multiple target points are defined to enhance cooperative scenarios during the learning process. According to the results, the proposed approach outperforms the two benchmarks by 8% and 49%, thus highlighting the effectiveness of the centralized learning approach in multi-agent systems.
引用
收藏
页码:2864 / 2877
页数:14
相关论文
共 50 条
  • [21] Cooperative Control for Industrial Multi-agent Systems: Framework and Problems
    Wang, Bohui
    Fang, Xinpeng
    Zhang, Bin
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 6971 - 6976
  • [22] Cooperative robust optimal control of uncertain multi-agent systems
    Zhang, Zhuo
    Zhang, Shouxu
    Li, Huiping
    Yan, Weisheng
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (14): : 9467 - 9483
  • [23] Robust Cooperative Control Reconfiguration/Recovery in Multi-Agent Systems
    Gallehdari, Zahra
    Meskin, Nader
    Khorasani, Khashayar
    2014 EUROPEAN CONTROL CONFERENCE (ECC), 2014, : 1554 - 1561
  • [24] An H∞ cooperative fault recovery control of multi-agent systems*
    Gallehdari, Zahra
    Meskin, Nader
    Khorasani, Khashayar
    AUTOMATICA, 2017, 84 : 101 - 108
  • [25] Special issue on learning and control in cooperative multi-agent systems
    Lewis, Frank L.
    Jiang, Zhong-Ping
    Liu, Tengfei
    Control Theory and Technology, 2015, 13 (01) : 44 - 44
  • [26] Finite Horizon Cooperative Formation Control for Multi-agent Systems
    Shi, Jiantao
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 4849 - 4852
  • [27] Remote formation control and collision avoidance for multi-agent nonholonomic systems
    Mastellone, Silvia
    Stipanovic, Dusan A.
    Spong, Mark W.
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 1062 - +
  • [28] Collision avoidance control for discrete multi-agent systems with communication outages
    Liu Z.
    Liu Y.-C.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (12): : 2172 - 2178
  • [29] Secondary control of microgrids based on distributed cooperative control of multi-agent systems
    Bidram, Ali
    Davoudi, Ali
    Lewis, Frank L.
    Qu, Zhihua
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2013, 7 (08) : 822 - 831
  • [30] Containment Control for the Cooperative Output Regulation of Linear Multi-Agent Systems
    Shen Yanchao
    Yan Huaicheng
    Zhang Hao
    Shi Hongbo
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7296 - 7301