Cooperative multi-agent game based on reinforcement learning

被引：0

作者：

Liu, Hongbo ^{[1
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China

来源：

HIGH-CONFIDENCE COMPUTING | 2024年 / 4卷 / 01期

关键词：

Collaborative multi-agent; Reinforcement learning; Credit distribution; Multi-agent communication; Reward shaping;

D O I：

10.1016/j.hcc.2024.100205

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-agent reinforcement learning holds tremendous potential for revolutionizing intelligent systems across diverse domains. However, it is also concomitant with a set of formidable challenges, which include the effective allocation of credit values to each agent, real-time collaboration among heterogeneous agents, and an appropriate reward function to guide agent behavior. To handle these issues, we propose an innovative solution named the Graph Attention Counterfactual Multiagent Actor-Critic algorithm (GACMAC). This algorithm encompasses several key components: First, it employs a multiagent actor-critic framework along with counterfactual baselines to assess the individual actions of each agent. Second, it integrates a graph attention network to enhance real-time collaboration among agents, enabling heterogeneous agents to effectively share information during handling tasks. Third, it incorporates prior human knowledge through a potential-based reward shaping method, thereby elevating the convergence speed and stability of the algorithm. We tested our algorithm on the StarCraft Multi-Agent Challenge (SMAC) platform, which is a recognized platform for testing multiagent algorithms, and our algorithm achieved a win rate of over 95% on the platform, comparable to the current state-of-the-art multi-agent controllers. (c) 2024 The Author(s). Published by Elsevier B.V. on behalf of Shandong University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页数：11

共 50 条

[31] Reinforcement learning of coordination in cooperative multi-agent systems
Kapetanakis, S
Kudenko, D
EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331
[32] Training Cooperative Agents for Multi-Agent Reinforcement Learning
Bhalla, Sushrut
Subramanian, Sriram G.
Crowley, Mark
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1826 - 1828
[33] Evolutionary game theory and multi-agent reinforcement learning
Tuyls, K
Nowé, A
KNOWLEDGE ENGINEERING REVIEW, 2005, 20 (01): : 63 - 90
[34] A reinforcement learning scheme for a multi-agent card game
Fujita, H
Matsuno, Y
Ishii, S
2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4071 - 4078
[35] Multi-Agent Reinforcement Learning for a Random Access Game
Lee, Dongwoo
Zhao, Yu
Seo, Jun-Bae
Lee, Joohyun
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (08) : 9119 - 9124
[36] Cooperative Reinforcement Learning Algorithm to Distributed Power System Based on Multi-Agent
Gao, La-mei
Zeng, Jun
Wu, Jie
Li, Min
2009 3RD INTERNATIONAL CONFERENCE ON POWER ELECTRONICS SYSTEMS AND APPLICATIONS: ELECTRIC VEHICLE AND GREEN ENERGY, 2009, : 53 - 53
[37] Inference-based Hierarchical Reinforcement Learning for Cooperative Multi-agent Navigation
Xia, Lijun
Yu, Chao
Wu, Zifan
2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 57 - 64
[38] Pursuit and evasion game between UVAs based on multi-agent reinforcement learning
Xu, Guangyan
Zhao, Yang
Liu, Hao
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1261 - 1266
[39] Learning Distinct Strategies for Heterogeneous Cooperative Multi-agent Reinforcement Learning
Wan, Kejia
Xu, Xinhai
Li, Yuan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 544 - 555
[40] Pacesetter Learning for Large Scale Cooperative Multi-Agent Reinforcement Learning
Zhou, Pingqi
Li, Chao
Qiu, Mengwei
Liu, Jun
Ma, Chennan
Yan, Ming
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 115 - 126

← 1 2 3 4 5 →