Deep Hierarchical Communication Graph in Multi-Agent Reinforcement Learning

被引:0
|
作者
Liu, Zeyang [1 ,2 ]
Wan, Lipeng [1 ,2 ]
Sui, Xue [1 ,2 ]
Chen, Zhuoran [1 ,2 ]
Sun, Kewu [3 ]
Lan, Xuguang [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Natl Key Lab Human Machine Hybrid Augmented Intel, Natl Engn Res Ctr Visual Informat & Applicat, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
[3] Intelligent Sci & Technol Acad, Xian, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sharing intentions is crucial for efficient cooperation in communication-enabled multi-agent reinforcement learning. Recent work applies static or undirected graphs to determine the order of interaction. However, the static graph is not general for complex cooperative tasks, and the parallel message-passing update in the undirected graph with cycles cannot guarantee convergence. To solve this problem, we propose Deep Hierarchical Communication Graph (DHCG) to learn the dependency relationships between agents based on their messages. The relationships are formulated as directed acyclic graphs (DAGs), where the selection of the proper topology is viewed as an action and trained in an end-to-end fashion. To eliminate the cycles in the graph, we apply an acyclicity constraint as intrinsic rewards and then project the graph in the admissible solution set of DAGs. As a result, DHCG removes redundant communication edges for cost improvement and guarantees convergence. To show the effectiveness of the learned graphs, we propose policy-based and value-based DHCG. Policy-based DHCG factorizes the joint policy in an auto-regressive manner, and value-based DHCG factorizes the joint value function to individual value functions and pairwise payoff functions. Empirical results show that our method improves performance across various cooperative multi-agent tasks, including Predator-Prey, Multi-Agent Coordination Challenge, and StarCraft Multi-Agent Challenge.
引用
收藏
页码:208 / 216
页数:9
相关论文
共 50 条
  • [1] When Does Communication Learning Need Hierarchical Multi-Agent Deep Reinforcement Learning
    Ossenkopf, Marie
    Jorgensen, Mackenzie
    Geihs, Kurt
    [J]. CYBERNETICS AND SYSTEMS, 2019, 50 (08) : 672 - 692
  • [2] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    [J]. 2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [3] Multi-Agent Deep Reinforcement Learning with Emergent Communication
    Simoes, David
    Lau, Nuno
    Reis, Luis Paulo
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [4] Multi-agent deep reinforcement learning with type-based hierarchical group communication
    Hao Jiang
    Dianxi Shi
    Chao Xue
    Yajie Wang
    Gongju Wang
    Yongjun Zhang
    [J]. Applied Intelligence, 2021, 51 : 5793 - 5808
  • [5] Multi-agent deep reinforcement learning with type-based hierarchical group communication
    Jiang, Hao
    Shi, Dianxi
    Xue, Chao
    Wang, Yajie
    Wang, Gongju
    Zhang, Yongjun
    [J]. APPLIED INTELLIGENCE, 2021, 51 (08) : 5793 - 5808
  • [6] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    [J]. Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [7] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [8] Hierarchical graph multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    [J]. INFORMATION SCIENCES, 2023, 634 : 55 - 72
  • [9] Learning multi-agent communication with double attentional deep reinforcement learning
    Mao, Hangyu
    Zhang, Zhengchao
    Xiao, Zhen
    Gong, Zhibo
    Ni, Yan
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)
  • [10] Learning multi-agent communication with double attentional deep reinforcement learning
    Hangyu Mao
    Zhengchao Zhang
    Zhen Xiao
    Zhibo Gong
    Yan Ni
    [J]. Autonomous Agents and Multi-Agent Systems, 2020, 34