Multi-agent deep reinforcement learning with type-based hierarchical group communication

被引:12
|
作者
Jiang, Hao [1 ]
Shi, Dianxi [2 ,3 ]
Xue, Chao [2 ,3 ]
Wang, Yajie [1 ]
Wang, Gongju [2 ]
Zhang, Yongjun [2 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
[2] Natl Innovat Inst Def Technol, Artificial Intelligence Res Ctr, Beijing, Peoples R China
[3] Tianjin Artificial Intelligence Innovat Ctr, Tianjin, Peoples R China
基金
中国博士后科学基金;
关键词
Multi-agent reinforcement learning; Group cognitive consistency; Group communication; Value decomposition;
D O I
10.1007/s10489-020-02065-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-world multi-agent tasks often involve varying types and quantities of agents. These agents connected by complex interaction relationships causes great difficulty for policy learning because they need to learn various interaction types to complete a given task. Therefore, simplifying the learning process is an important issue. In multi-agent systems, agents with a similar type often interact more with each other and exhibit behaviors more similar. That means there are stronger collaborations between these agents. Most existing multi-agent reinforcement learning (MARL) algorithms expect to learn the collaborative strategies of all agents directly in order to maximize the common rewards. This causes the difficulty of policy learning to increase exponentially as the number and types of agents increase. To address this problem, we propose a type-based hierarchical group communication (THGC) model. This model uses prior domain knowledge or predefine rule to group agents, and maintains the group's cognitive consistency through knowledge sharing. Subsequently, we introduce a group communication and value decomposition method to ensure cooperation between the various groups. Experiments demonstrate that our model outperforms state-of-the-art MARL methods on the widely adopted StarCraft II benchmarks across different scenarios, and also possesses potential value for large-scale real-world applications.
引用
收藏
页码:5793 / 5808
页数:16
相关论文
共 50 条
  • [1] Multi-agent deep reinforcement learning with type-based hierarchical group communication
    Hao Jiang
    Dianxi Shi
    Chao Xue
    Yajie Wang
    Gongju Wang
    Yongjun Zhang
    [J]. Applied Intelligence, 2021, 51 : 5793 - 5808
  • [2] Deep Hierarchical Communication Graph in Multi-Agent Reinforcement Learning
    Liu, Zeyang
    Wan, Lipeng
    Sui, Xue
    Chen, Zhuoran
    Sun, Kewu
    Lan, Xuguang
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 208 - 216
  • [3] GHGC: Goal-based Hierarchical Group Communication in Multi-Agent Reinforcement Learning
    Jiang, Hao
    Shi, Dianxi
    Xue, Chao
    Wang, Yajie
    Wang, Gongju
    Zhang, Yongjun
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3507 - 3514
  • [4] When Does Communication Learning Need Hierarchical Multi-Agent Deep Reinforcement Learning
    Ossenkopf, Marie
    Jorgensen, Mackenzie
    Geihs, Kurt
    [J]. CYBERNETICS AND SYSTEMS, 2019, 50 (08) : 672 - 692
  • [5] Multi-Agent Deep Reinforcement Learning with Emergent Communication
    Simoes, David
    Lau, Nuno
    Reis, Luis Paulo
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [6] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    [J]. Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [7] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [8] Learning multi-agent communication with double attentional deep reinforcement learning
    Mao, Hangyu
    Zhang, Zhengchao
    Xiao, Zhen
    Gong, Zhibo
    Ni, Yan
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)
  • [9] Learning multi-agent communication with double attentional deep reinforcement learning
    Hangyu Mao
    Zhengchao Zhang
    Zhen Xiao
    Zhibo Gong
    Yan Ni
    [J]. Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [10] Multi-agent reinforcement learning based on local communication
    Zhang, Wenxu
    Ma, Lei
    Li, Xiaonan
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 15357 - 15366