Multi-agent deep reinforcement learning with type-based hierarchical group communication

被引：12

作者：

Jiang, Hao ^{[1
]}

Shi, Dianxi ^{[2
,3
]}

Xue, Chao ^{[2
,3
]}

Wang, Yajie ^{[1
]}

Wang, Gongju ^{[2
]}

Zhang, Yongjun ^{[2
]}

机构：

[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China

[2] Natl Innovat Inst Def Technol, Artificial Intelligence Res Ctr, Beijing, Peoples R China

[3] Tianjin Artificial Intelligence Innovat Ctr, Tianjin, Peoples R China

来源：

APPLIED INTELLIGENCE | 2021年 / 51卷 / 08期

基金：

中国博士后科学基金;

关键词：

Multi-agent reinforcement learning; Group cognitive consistency; Group communication; Value decomposition;

D O I：

10.1007/s10489-020-02065-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Real-world multi-agent tasks often involve varying types and quantities of agents. These agents connected by complex interaction relationships causes great difficulty for policy learning because they need to learn various interaction types to complete a given task. Therefore, simplifying the learning process is an important issue. In multi-agent systems, agents with a similar type often interact more with each other and exhibit behaviors more similar. That means there are stronger collaborations between these agents. Most existing multi-agent reinforcement learning (MARL) algorithms expect to learn the collaborative strategies of all agents directly in order to maximize the common rewards. This causes the difficulty of policy learning to increase exponentially as the number and types of agents increase. To address this problem, we propose a type-based hierarchical group communication (THGC) model. This model uses prior domain knowledge or predefine rule to group agents, and maintains the group's cognitive consistency through knowledge sharing. Subsequently, we introduce a group communication and value decomposition method to ensure cooperation between the various groups. Experiments demonstrate that our model outperforms state-of-the-art MARL methods on the widely adopted StarCraft II benchmarks across different scenarios, and also possesses potential value for large-scale real-world applications.

引用

页码：5793 / 5808

页数：16

共 50 条

[1] Multi-agent deep reinforcement learning with type-based hierarchical group communication
Hao Jiang
Dianxi Shi
Chao Xue
Yajie Wang
Gongju Wang
Yongjun Zhang
Applied Intelligence, 2021, 51 : 5793 - 5808
[2] Deep Hierarchical Communication Graph in Multi-Agent Reinforcement Learning
Liu, Zeyang
Wan, Lipeng
Sui, Xue
Chen, Zhuoran
Sun, Kewu
Lan, Xuguang
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 208 - 216
[3] GHGC: Goal-based Hierarchical Group Communication in Multi-Agent Reinforcement Learning
Jiang, Hao
Shi, Dianxi
Xue, Chao
Wang, Yajie
Wang, Gongju
Zhang, Yongjun
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3507 - 3514
[4] When Does Communication Learning Need Hierarchical Multi-Agent Deep Reinforcement Learning
Ossenkopf, Marie
Jorgensen, Mackenzie
Geihs, Kurt
CYBERNETICS AND SYSTEMS, 2019, 50 (08) : 672 - 692
[5] Multi-Agent Deep Reinforcement Learning with Emergent Communication
Simoes, David
Lau, Nuno
Reis, Luis Paulo
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[6] A survey of multi-agent deep reinforcement learning with communication
Zhu, Changxi
Dastani, Mehdi
Wang, Shihan
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (01)
[7] Hierarchical multi-agent reinforcement learning
Mohammad Ghavamzadeh
Sridhar Mahadevan
Rajbala Makar
Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
[8] Hierarchical multi-agent reinforcement learning
Ghavamzadeh, Mohammad
Mahadevan, Sridhar
Makar, Rajbala
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
[9] Learning multi-agent communication with double attentional deep reinforcement learning
Mao, Hangyu
Zhang, Zhengchao
Xiao, Zhen
Gong, Zhibo
Ni, Yan
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)
[10] Multi-agent communication cooperation based on deep reinforcement learning and information theory
Gao, Bing
Zhang, Zhejie
Zou, Qijie
Liu, Zhiguo
Zhao, Xiling
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (18):

← 1 2 3 4 5 →