VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning

被引:12
|
作者
Wei, Qinglai [1 ,2 ,3 ]
Li, Yugu [1 ]
Zhang, Jie [1 ]
Wang, Fei-Yue [1 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Macau Univ Sci & Technol, Inst Syst Engn, Macau 999078, Peoples R China
[4] Qingdao Acad Intelligent Ind, Qingdao 266109, Peoples R China
基金
中国国家自然科学基金;
关键词
Mathematical models; Task analysis; Games; Q-learning; Neural networks; Behavioral sciences; Training; Deep learning; graph attention networks (GATs); multiagent systems; reinforcement learning;
D O I
10.1109/TNNLS.2022.3172572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although value decomposition networks and the follow on value-based studies factorizes the joint reward function to individual reward functions for a kind of cooperative multiagent reinforcement problem, in which each agent has its local observation and shares a joint reward signal, most of the previous efforts, however, ignored the graphical information between agents. In this article, a new value decomposition with graph attention network (VGN) method is developed to solve the value functions by introducing the dynamical relationships between agents. It is pointed out that the decomposition factor of an agent in our approach can be influenced by the reward signals of all the related agents and two graphical neural network-based algorithms (VGN-Linear and VGN-Nonlinear) are designed to solve the value functions of each agent. It can be proved theoretically that the present methods satisfy the factorizable condition in the centralized training process. The performance of the present methods is evaluated on the StarCraft Multiagent Challenge (SMAC) benchmark. Experiment results show that our method outperforms the state-of-the-art value-based multiagent reinforcement algorithms, especially when the tasks are with very hard level and challenging for existing methods.
引用
收藏
页码:182 / 195
页数:14
相关论文
共 50 条
  • [21] Solving uncapacitated P-Median problem with reinforcement learning assisted by graph attention networks
    Wang, Chenguang
    Han, Congying
    Guo, Tiande
    Ding, Man
    [J]. APPLIED INTELLIGENCE, 2023, 53 (02) : 2010 - 2025
  • [22] Solving uncapacitated P-Median problem with reinforcement learning assisted by graph attention networks
    Chenguang Wang
    Congying Han
    Tiande Guo
    Man Ding
    [J]. Applied Intelligence, 2023, 53 : 2010 - 2025
  • [23] Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
    Meirom, Eli A.
    Maron, Haggai
    Mannor, Shie
    Chechik, Gal
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [24] Multiagent reinforcement learning through merging individually learned value functions
    张化祥
    黄上腾
    [J]. Journal of Harbin Institute of Technology(New series), 2005, (03) : 346 - 350
  • [25] Learning Graph Topology Representation with Attention Networks
    Qi, Yuanyuan
    Zhang, Jiayue
    Xu, Weiran
    Guo, Jun
    Zhang, Honggang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 1 - 4
  • [26] SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning
    Yao, Xinghu
    Wen, Chao
    Wang, Yuhui
    Tan, Xiaoyang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 52 - 63
  • [27] A Local-and-Global Attention Reinforcement Learning Algorithm for Multiagent Cooperative Navigation
    Song, Chunwei
    He, Zichen
    Dong, Lu
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7767 - 7777
  • [28] Heterogeneous-graph Attention Reinforcement Learning for Football Matches
    Wang, Shijie
    Pan, Yi
    Pu, Zhiqiang
    Yi, Jianqiang
    Liang, Yanyan
    Zhang, Du
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [29] Asymmetric multiagent reinforcement learning
    Könönen, V
    [J]. IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [30] AVD-Net: Attention Value Decomposition Network For Deep Multi-Agent Reinforcement Learning
    Zhang, Yuanxin
    Ma, Huimin
    Wang, Yu
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7810 - 7816