Celebrating Diversity in Shared Multi-Agent Reinforcement Learning

被引:0
|
作者
Li, Chenghao [1 ]
Wang, Tonghan [1 ]
Wu, Chengjie [1 ]
Zhao, Qianchuan [1 ]
Yang, Jun [1 ]
Zhang, Chongjie [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep multi-agent reinforcement learning (MARL) has shown the promise to solve complex cooperative tasks. Its success is partly because of parameter sharing among agents. However, such sharing may lead agents to behave similarly and limit their coordination capacity. In this paper, we aim to introduce diversity in both optimization and representation of shared multi-agent reinforcement learning. Specifically, we propose an information-theoretical regularization to maximize the mutual information between agents' identities and their trajectories, encouraging extensive exploration and diverse individualized behaviors. In representation, we incorporate agent-specific modules in the shared neural network architecture, which are regularized by L1-norm to promote learning sharing among agents while keeping necessary diversity. Empirical results show that our method achieves state-of-the-art performance on Google Research Football and super hard StarCraft II micromanagement tasks+.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Decentralized Multi-agent Reinforcement Learning with Shared Actions
    Mishra, Rajesh K.
    Vasal, Deepanshu
    Vishwanath, Sriram
    [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [2] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [3] State-Clusters Shared Cooperative Multi-Agent Reinforcement Learning
    Jin, Zhao
    Liu, WeiYi
    Jin, Jian
    [J]. ASCC: 2009 7TH ASIAN CONTROL CONFERENCE, VOLS 1-3, 2009, : 129 - 135
  • [4] Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
    Christianos, Filippos
    Schafer, Lukas
    Albrecht, Stefano V.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [5] Decomposing shared networks for separate cooperation with multi-agent reinforcement learning
    Liu, Weiwei
    Peng, Linpeng
    Wen, Licheng
    Yang, Jian
    Liu, Yong
    [J]. INFORMATION SCIENCES, 2023, 641
  • [6] Celebrating Diversity With Subtask Specialization in Shared Multiagent Reinforcement Learning
    Li, Chenghao
    Wang, Tonghan
    Wu, Chengjie
    Zhao, Qianchuan
    Yang, Jun
    Zhang, Chongjie
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [7] Quantifying the effects of environment and population diversity in multi-agent reinforcement learning
    McKee, Kevin R.
    Leibo, Joel Z.
    Beattie, Charlie
    Everett, Richard
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (01)
  • [8] Quantifying the effects of environment and population diversity in multi-agent reinforcement learning
    Kevin R. McKee
    Joel Z. Leibo
    Charlie Beattie
    Richard Everett
    [J]. Autonomous Agents and Multi-Agent Systems, 2022, 36
  • [9] Multi-Agent Reinforcement Learning with Shared Policy for Cloud Quota Management Problem
    Cheng, Tong
    Dong, Hang
    Wang, Lu
    Qiao, Bo
    Qin, Si
    Lin, Qingwei
    Zhang, Dongmei
    Rajmohan, Saravan
    Moscibroda, Thomas
    [J]. COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 391 - 395
  • [10] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,