A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning

被引:2
|
作者
Fu, Qingxu [1 ,2 ]
Qiu, Tenghai [1 ,2 ]
Pu, Zhiqiang [1 ,2 ]
Yi, Jianqiang [1 ,2 ]
Yuan, Wanmai [3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[3] Corp Informat Sci Acad China, Elect Technol Grp, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
multiagent system; reinforcement learning; sparse reward;
D O I
10.1109/IJCNN55064.2022.9891991
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiagent reinforcement learning (MARL) can solve complex cooperative tasks. However, the efficiency of existing MARL methods relies heavily on well-defined reward functions. Multiagent tasks with sparse reward feedback are especially challenging not only because of the credit distribution problem, but also due to the low probability of obtaining positive reward feedback. In this paper, we design a graph network called Cooperation Graph (CG). The Cooperation Graph is the combination of two simple bipartite graphs, namely, the Agent Clustering subgraph (ACG) and the Cluster Designating subgraph (CDG). Next, based on this novel graph structure, we propose a Cooperation Graph Multiagent Reinforcement Learning (CG-MARL) algorithm, which can efficiently deal with the sparse reward problem in multiagent tasks. In CG-MARL, agents are directly controlled by the Cooperation Graph. And a policy neural network is trained to manipulate this Cooperation Graph, guiding agents to achieve cooperation in an implicit way. This hierarchical feature of CG-MARL provides space for customized cluster-actions, an extensible interface for introducing fundamental cooperation knowledge. In experiments, CG-MARL shows state-of-the-art performance in sparse reward multiagent benchmarks, including the anti-invasion interception task and the multi-cargo delivery task.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Robust Reward-Free Actor-Critic for Cooperative Multiagent Reinforcement Learning
    Lin, Qifeng
    Ling, Qing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 12
  • [42] Cascaded Attention: Adaptive and Gated Graph Attention Network for Multiagent Reinforcement Learning
    Qi, Shuhan
    Huang, Xinhao
    Peng, Peixi
    Huang, Xuzhong
    Zhang, Jiajia
    Wang, Xuan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3769 - 3779
  • [43] Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
    Zhang, Yudi
    Du, Yali
    Huang, Biwei
    Wang, Ziyan
    Wang, Jun
    Fang, Meng
    Pechenizkiy, Mykola
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [44] A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
    Lanctot, Marc
    Zambaldi, Vinicius
    Gruslys, Audrunas
    Lazaridou, Angeliki
    Tuyls, Karl
    Perolat, Julien
    Silver, David
    Graepel, Thore
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [45] A HYBRID MULTIAGENT REINFORCEMENT LEARNING APPROACH USING STRATEGIES AND FUSION
    Partalas, Ioannis
    Feneris, Ioannis
    Vlahavas, Ioannis
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (05) : 945 - 962
  • [46] A Multiagent Reinforcement Learning Approach for Wind Farm Frequency Control
    Liang, Yanchang
    Zhao, Xiaowei
    Sun, Li
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (02) : 1725 - 1734
  • [47] Graph Partitioning and Sparse Matrix Ordering using Reinforcement Learning and Graph Neural Networks
    Gatti, Alice
    Hu, Zhixiong
    Smidt, Tess
    Ng, Esmond G.
    Ghysels, Pieter
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [48] Graph Partitioning and Sparse Matrix Ordering using Reinforcement Learning and Graph Neural Networks
    Gatti, Alice
    Hu, Zhixiong
    Smidt, Tess
    Ng, Esmond G.
    Ghysels, Pieter
    [J]. Journal of Machine Learning Research, 2022, 23
  • [49] Asymmetric multiagent reinforcement learning
    Könönen, V
    [J]. IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [50] Exploration-Guided Reward Shaping for Reinforcement Learning under Sparse Rewards
    Devidze, Rati
    Kamalaruban, Parameswaran
    Singla, Adish
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,