Hierarchical relationship modeling in multi-agent reinforcement learning for mixed cooperative-competitive environments

被引:1
|
作者
Xie, Shaorong [1 ]
Li, Yang [1 ]
Wang, Xinzhi [1 ]
Zhang, Han [1 ]
Zhang, Zhenyu [1 ]
Luo, Xiangfeng [1 ]
Yu, Hang [1 ,2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, 99 Shangda Rd, Shanghai 200444, Peoples R China
[2] Univ Technol Sydney, Fac Engn & Informat Technol, POB 123,Broadway, Sydney, NSW 2007, Australia
基金
中国国家自然科学基金;
关键词
Multi-agent reinforcement learning; Mixed cooperative-competitive environment; Graph neural networks; Relationship modeling; SYSTEMS; GAME; GO;
D O I
10.1016/j.inffus.2024.102318
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-agent reinforcement learning (MARL), information fusion through relationship modeling can effectively learn behavior strategies. However, the high dynamics among heterogeneous interactive agents in mixed cooperative-competitive environments pose difficulties for relational modeling. Traditional MARL solutions concatenate all agents' states based on the global relationship, which is unrealistic and unscalable under largescale conditions. Other methods fuse local information by modeling neighbor relationships, but local features lead to suboptimal strategies. From this perspective, we propose a novel relational information fusion approach for mixed tasks to fuse local and global features by modeling heterogeneous relationships through a hierarchical graph. During training, remote agents' global features are fused through second-order graph representation to help strategy optimization. During decision making, the practicality and scalability of strategy are improved by fusing neighbor agents' local features through first-order graph representation. Our approach consistently outperforms the state-of-the-art MARL methods in several multi-agent tasks, such as the Predator-Prey and Soccer Games. In particular, it achieves an 83.7% win rate, which is 11.5% higher than baselines in the 4 vs. 4 Soccer Game, and can scale from 4 vs. 4 to 13 vs. 13 in the Predator-Prey Game while maintaining good performance.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Bias Estimation Correction in Multi-Agent Reinforcement Learning for Mixed Cooperative-Competitive Environments
    Sarkar T.
    Kalita S.
    [J]. SN Computer Science, 5 (1)
  • [2] Scalable and Transferable Reinforcement Learning for Multi-Agent Mixed Cooperative-Competitive Environments Based on Hierarchical Graph Attention
    Chen, Yining
    Song, Guanghua
    Ye, Zhenhui
    Jiang, Xiaohong
    [J]. ENTROPY, 2022, 24 (04)
  • [3] Mixed Cooperative-Competitive Communication Using Multi-agent Reinforcement Learning
    Vanneste, Astrid
    Van Wijnsberghe, Wesley
    Vanneste, Simon
    Mets, Kevin
    Mercelis, Siegfried
    Latre, Steven
    Hellinckx, Peter
    [J]. ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 197 - 206
  • [4] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
    Lowe, Ryan
    Wu, Yi
    Tamar, Aviv
    Harb, Jean
    Abbeel, Pieter
    Mordatch, Igor
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [5] Demand response model: A cooperative-competitive multi-agent reinforcement learning approach
    Salazar, Eduardo J.
    Rosero, Veronica
    Gabrielski, Jawana
    Samper, Mauricio E.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [6] Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks
    Aotani, Takumi
    Kobayashi, Taisuke
    Sugimoto, Kenji
    [J]. APPLIED INTELLIGENCE, 2021, 51 (07) : 4434 - 4452
  • [7] Bottom-up multi-agent reinforcement learning by reward shaping for cooperative-competitive tasks
    Takumi Aotani
    Taisuke Kobayashi
    Kenji Sugimoto
    [J]. Applied Intelligence, 2021, 51 : 4434 - 4452
  • [8] A Novel Multi-Agent Parallel-Critic Network Architecture for Cooperative-Competitive Reinforcement Learning
    Sun, Yu
    Lai, Jun
    Cao, Lei
    Chen, Xiliang
    Xu, Zhixiong
    Xu, Yue
    [J]. IEEE ACCESS, 2020, 8 : 135605 - 135616
  • [9] Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in mixed cooperative and competitive environments
    Dong, Shaokang
    Li, Chao
    Yang, Shangdong
    Li, Wenbin
    Gao, Yang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 257
  • [10] Centralized reinforcement learning for multi-agent cooperative environments
    Chengxuan Lu
    Qihao Bao
    Shaojie Xia
    Chongxiao Qu
    [J]. Evolutionary Intelligence, 2024, 17 : 267 - 273