Cooperative Multi-agent Reinforcement Learning with Hierachical Communication Architecture

被引：0

作者：

Liu, Shifan ^{[1
]}

Yuan, Quan ^{[1
]}

Chen, Bo ^{[1
]}

Luo, Guiyang ^{[1
]}

Li, Jinglin ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II | 2022年 / 13530卷

关键词：

Multi-agent cooperation; Hierarchical architecture; Communication;

D O I：

10.1007/978-3-031-15931-2_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Communication is an essential way for multi-agent system to coordinate. By sharing local observations and intentions via communication channel, agents can better deal with dynamic environment and thus make optimal decisions. However, restricted by the limited communication channel, agents have to leverage less communication resources to transmit more informative messages. In this article, we propose a two-level hierarchical multi-agent reinforcement learning algorithm which utilizes different timescales in different levels. Communication happens only between high levels at a coarser time scale to generate sub-goals which convey the intention of agents for the low level. And the low level is responsible for implementing these sub-goals by controlling primitive actions at every tick of environment. Sub-goal is the core of this hierachical communication architecture which requires the high level to communicate efficiently and provide guidance for the low level to coordinate. This hierarchical communication architecture conveys several benefits: 1) It coarsens the collaborative granularity and reduces the requirement of communication since communication happens only in high level at a larger scale; 2) It enables the high level to focus on the coordination of goals without paying attention to implementation, thus improves the efficiency of communication; and 3) It makes better control by dividing a complex multi-agent cooperative task into multiple single-agent tasks. In experiments, we apply our approach in vehicle collision avoidance tasks and achieve better performance than baselines.

引用

页码：14 / 25

页数：12

共 50 条

[1] Cooperative Behavior by Multi-agent Reinforcement Learning with Abstractive Communication
Tanda, Jin
Moustafa, Ahmed
Ito, Takayuki
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON AGENTS (ICA), 2019, : 8 - 13
[2] DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Zhao, Canzhe
Ze, Yanjie
Dong, Jing
Wang, Baoxiang
Li, Shuai
[J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4638 - 4646
[3] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
Chen, Hao
Yang, Guangkai
Zhang, Junge
Yin, Qiyue
Huang, Kaiqi
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[4] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
Lin, Jieyu
Dzeparoska, Kristina
Zhang, Sai Qian
Leon-Garcia, Alberto
Papernot, Nicolas
[J]. 2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68
[5] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Xu, Zhiwei
Zhang, Bin
Li, Dapeng
Zhang, Zeren
Zhou, Guangchong
Chen, Hao
Fan, Guoliang
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
[6] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
Xu, Chi
Zhang, Hui
Zhang, Ya
[J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
[7] Mixed Cooperative-Competitive Communication Using Multi-agent Reinforcement Learning
Vanneste, Astrid
Van Wijnsberghe, Wesley
Vanneste, Simon
Mets, Kevin
Mercelis, Siegfried
Latre, Steven
Hellinckx, Peter
[J]. ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 197 - 206
[8] Learning structured communication for multi-agent reinforcement learning
Sheng, Junjie
Wang, Xiangfeng
Jin, Bo
Yan, Junchi
Li, Wenhao
Chang, Tsung-Hui
Wang, Jun
Zha, Hongyuan
[J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
[9] Learning structured communication for multi-agent reinforcement learning
Junjie Sheng
Xiangfeng Wang
Bo Jin
Junchi Yan
Wenhao Li
Tsung-Hui Chang
Jun Wang
Hongyuan Zha
[J]. Autonomous Agents and Multi-Agent Systems, 2022, 36
[10] Learning Cooperative Intrinsic Motivation in Multi-Agent Reinforcement Learning
Hong, Seung-Jin
Lee, Sang-Kwang
[J]. 12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1697 - 1699

← 1 2 3 4 5 →