Improving scalability of multi-agent reinforcement learning with parameters sharing

被引:3
|
作者
Yang, Ning [1 ]
Shi, PeiChang [1 ]
Ding, Bo [1 ]
Feng, Dawei [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci, Changsha, Peoples R China
关键词
Deep reinforcement learning; Multi-agent system; Scalability;
D O I
10.1109/JCC56315.2022.00013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Improving the scalability of a multi-agent system is one of the key challenges for applying reinforcement learning to learn an effective policy. Parameter sharing is a common approach used to improve the efficiency of learning by reducing the volume of policy network parameters that need to be updated. However, sharing parameters also reduces the variance between agents' policies, which further restricts the diversity of their behaviors. In this paper, we introduce a policy parameter sharing approach, it maintains a policy network for each agent, and only updates one of them. The differentiated behavior of agents is maintained by the policy, while sharing parameters are updated through a soft way. Experiments in foraging scenarios demonstrate that our method can effectively improve the performance and also the scalability of the multi-agent systems.
引用
收藏
页码:37 / 42
页数:6
相关论文
共 50 条
  • [1] Reinforcement learning for multi-agent formation navigation with scalability
    Gong, Yalei
    Xiong, Hongyun
    Li, Mengmeng
    Wang, Haibo
    Nian, Xiaohong
    [J]. APPLIED INTELLIGENCE, 2023, 53 (23) : 28207 - 28225
  • [2] Reinforcement learning for multi-agent formation navigation with scalability
    Yalei Gong
    Hongyun Xiong
    MengMeng Li
    Haibo Wang
    Xiaohong Nian
    [J]. Applied Intelligence, 2023, 53 : 28207 - 28225
  • [3] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] A survey on scalability and transferability of multi-agent deep reinforcement learning
    Yan, Chao
    Xiang, Xiao-Jia
    Xu, Xin
    Wang, Chang
    Zhou, Han
    Shen, Lin-Cheng
    [J]. Kongzhi yu Juece/Control and Decision, 2022, 37 (12): : 3083 - 3102
  • [5] Rationality of reward sharing in multi-agent reinforcement learning
    Kazuteru Miyazaki
    Shigenobu Kobayashi
    [J]. New Generation Computing, 2001, 19 : 157 - 172
  • [6] Rationality of reward sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    [J]. NEW GENERATION COMPUTING, 2001, 19 (02) : 157 - 172
  • [7] On the rationality of profit sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    [J]. ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 421 - 425
  • [8] On the rationality of Profit Sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    [J]. ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 123 - 127
  • [9] A Multi-agent Reinforcement Learning with Weighted Experience Sharing
    Yu, Lasheng
    Abdulai, Issahaku
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 219 - 225
  • [10] Improving the scalability of multi-agent systems
    Turner, PJ
    Jennings, NR
    [J]. INFRASTRUCTURE FOR AGENTS, MULTI-AGENT SYSTEMS, AND SCALABLE MULTI-AGENT SYSTEMS, 2001, 1887 : 246 - 262