A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning

被引:2
|
作者
Fu, Qingxu [1 ,2 ]
Qiu, Tenghai [1 ,2 ]
Pu, Zhiqiang [1 ,2 ]
Yi, Jianqiang [1 ,2 ]
Yuan, Wanmai [3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[3] Corp Informat Sci Acad China, Elect Technol Grp, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
multiagent system; reinforcement learning; sparse reward;
D O I
10.1109/IJCNN55064.2022.9891991
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiagent reinforcement learning (MARL) can solve complex cooperative tasks. However, the efficiency of existing MARL methods relies heavily on well-defined reward functions. Multiagent tasks with sparse reward feedback are especially challenging not only because of the credit distribution problem, but also due to the low probability of obtaining positive reward feedback. In this paper, we design a graph network called Cooperation Graph (CG). The Cooperation Graph is the combination of two simple bipartite graphs, namely, the Agent Clustering subgraph (ACG) and the Cluster Designating subgraph (CDG). Next, based on this novel graph structure, we propose a Cooperation Graph Multiagent Reinforcement Learning (CG-MARL) algorithm, which can efficiently deal with the sparse reward problem in multiagent tasks. In CG-MARL, agents are directly controlled by the Cooperation Graph. And a policy neural network is trained to manipulate this Cooperation Graph, guiding agents to achieve cooperation in an implicit way. This hierarchical feature of CG-MARL provides space for customized cluster-actions, an extensible interface for introducing fundamental cooperation knowledge. In experiments, CG-MARL shows state-of-the-art performance in sparse reward multiagent benchmarks, including the anti-invasion interception task and the multi-cargo delivery task.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Measurement of Underlying Cooperation in Multiagent Reinforcement Learning
    Arai, Sachiyo
    Ishigaki, Yoshihisa
    Hirata, Hironori
    [J]. INTELLIGENT AGENTS AND MULTI-AGENT SYSTEMS, PROCEEDINGS, 2008, 5357 : 34 - 41
  • [2] Reinforcement learning for encouraging cooperation in a multiagent system
    Jiang, Wei-Cheng
    Huang, Hong-Hao
    Wang, Yu-Teng
    [J]. INFORMATION SCIENCES, 2024, 680
  • [3] Multiagent cooperation and competition with deep reinforcement learning
    Tampuu, Ardi
    Matiisen, Tambet
    Kodelja, Dorian
    Kuzovkin, Ilya
    Korjus, Kristjan
    Aru, Juhan
    Aru, Jaan
    Vicente, Raul
    [J]. PLOS ONE, 2017, 12 (04):
  • [4] Scaling Up Multiagent Reinforcement Learning for Robotic Systems: Learn an Adaptive Sparse Communication Graph
    Sun, Chuangchuang
    Shen, Macheng
    How, Jonathan P.
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 11755 - 11762
  • [5] A Distributional Perspective on Multiagent Cooperation With Deep Reinforcement Learning
    Huang, Liwei
    Fu, Mingsheng
    Rao, Ananya
    Irissappane, Athirai A.
    Zhang, Jie
    Xu, Chengzhong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4246 - 4259
  • [6] A REINFORCEMENT LEARNING APPROACH FOR MULTIAGENT NAVIGATION
    Martinez-Gil, Francisco
    Barber, Fernando
    Lozano, Miguel
    Grimaldo, Francisco
    Fernandez, Fernando
    [J]. ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 607 - 610
  • [7] Automatic Decomposition of Reward Machines for Decentralized Multiagent Reinforcement Learning
    Smith, Sophia
    Neary, Cyrus
    Topcu, Ufuk
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 5423 - 5430
  • [8] Generating individual intrinsic reward for cooperative multiagent reinforcement learning
    Wu, Haolin
    Li, Hui
    Zhang, Jianwei
    Wang, Zhuang
    Zhang, Jianeng
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2021, 18 (05)
  • [9] Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network
    Du, Wei
    Ding, Shifei
    Zhang, Chenglong
    Shi, Zhongzhi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6851 - 6860
  • [10] Reinforcement Learning with Reward Shaping and Hybrid Exploration in Sparse Reward Scenes
    Yang, Yulong
    Cao, Weihua
    Guo, Linwei
    Gan, Chao
    Wu, Min
    [J]. 2023 IEEE 6TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS, 2023,