Decomposing shared networks for separate cooperation with multi-agent reinforcement learning

被引:4
|
作者
Liu, Weiwei [1 ]
Peng, Linpeng [1 ]
Wen, Licheng [1 ]
Yang, Jian [2 ]
Liu, Yong [1 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, Adv Percept Robot & Intelligent Learning Lab, Hangzhou 310027, Peoples R China
[2] China Res & Dev Acad Machinery Equipment, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
Multi-agent reinforcement learning; Neural network; Multi-agent systems; Navigation planning;
D O I
10.1016/j.ins.2023.119085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sharing network parameters between agents is an essential and typical operation to improve the scalability of multi-agent reinforcement learning algorithms. However, agents with different tasks sharing the same network parameters are not conducive to distinguishing the agents' skills. In addition, the importance of communication between agents undertaking the same task is much higher than that with external agents. Therefore, we propose Dual Cooperation Networks (DCN). In order to distinguish whether agents undertake the same task, all agents are grouped according to their status through the graph neural network instead of the traditional proximity. The agent communicates within the group to achieve strong cooperation. After that, the global value function is decomposed by groups to facilitate cooperation between groups. Finally, we have verified it in simulation and physical hardware, and the algorithm has achieved excellent performance.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning
    Hu, Xunhan
    Zhao, Jian
    Zhou, Wengang
    Feng, Ruili
    Li, Houqiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [32] Efficient Communications for Multi-Agent Reinforcement Learning in Wireless Networks
    Lv, Zefang
    Du, Yousong
    Chen, Yifan
    Xiao, Liang
    Han, Shuai
    Ji, Xiangyang
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 583 - 588
  • [33] Multi-agent reinforcement learning algorithm based on neural networks
    Tang, Lianggui
    Yang, Hu
    An, Bo
    Cheng, Daijie
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 1569 - 1574
  • [34] A Survey on Multi-Agent Reinforcement Learning Methods for Vehicular Networks
    Althamary, Ibrahim
    Huang, Chih-Wei
    Lin, Phone
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1154 - 1159
  • [35] Multi-Agent Reinforcement Learning with Shared Policy for Cloud Quota Management Problem
    Cheng, Tong
    Dong, Hang
    Wang, Lu
    Qiao, Bo
    Qin, Si
    Lin, Qingwei
    Zhang, Dongmei
    Rajmohan, Saravan
    Moscibroda, Thomas
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 391 - 395
  • [36] Multi-agent reinforcement learning: an approach based on agents' cooperation for a common goal
    Wang, GQ
    Yu, HB
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, VOL 1, 2004, : 336 - 339
  • [37] Emergent cooperation from mutual acknowledgment exchange in multi-agent reinforcement learning
    Phan, Thomy
    Sommer, Felix
    Ritz, Fabian
    Altmann, Philipp
    Nuesslein, Jonas
    Koelle, Michael
    Belzner, Lenz
    Linnhoff-Popien, Claudia
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (02)
  • [38] A fully value distributional deep reinforcement learning framework for multi-agent cooperation
    Fu, Mingsheng
    Huang, Liwei
    Li, Fan
    Qu, Hong
    Xu, Chengzhong
    NEURAL NETWORKS, 2025, 184
  • [39] Innovative Approach Towards Cooperation Models for Multi-agent Reinforcement Learning (CMMARL)
    Vidhate, Deepak A.
    Kulkarni, Parag
    SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 468 - 478
  • [40] Partner Selection for the Emergence of Cooperation in Multi-Agent Systems Using Reinforcement Learning
    Anastassacos, Nicolas
    Hailes, Stephen
    Musolesi, Mirco
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7047 - 7054