Learning cooperative strategies in StarCraft through role-based monotonic value function factorization

被引：0

作者：

Han, Kun ^{[1
]}

Jiang, Feng ^{[1
,2
]}

Zhu, Haiqi ^{[1
]}

Shao, Mengxuan ^{[1
]}

Yan, Ruyu ^{[3
]}

机构：

[1] Harbin Inst Technol, Fac Comp, Harbin 150000, Peoples R China

[2] Harbin Inst Technol, Sch Med & Hlth, Harbin 150000, Peoples R China

[3] Harbin Inst Technol, Sch Management, Harbin 150000, Peoples R China

来源：

ELECTRONIC RESEARCH ARCHIVE | 2024年 / 32卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Q-learning; multi-agent reinforcement learning; machine learning; artificial intelligence; StarCraft multi-agent challenge;

D O I：

10.3934/era.2024037

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

StarCraft is a popular real-time strategy game that has been widely used as a research platform for artificial intelligence. Micromanagement refers to the process of making each unit perform appropriate actions separately, depending on the current state in the the multi-agent system comprising all of the units, i.e., the fine-grained control of individual units for common benefit. Therefore, cooperation between different units is crucially important to improve the joint strategy. We have selected multi-agent deep reinforcement learning to tackle the problem of micromanagement. In this paper, we propose a method for learning cooperative strategies in StarCraft based on role-based montonic value function factorization (RoMIX). RoMIX learns roles based on the potential impact of each agent on the multi-agent task; it then represents the action value of a role in a mixed way based on monotonic value function factorization. The final value is calculated by accumulating the action value of all roles. The role-based learning improves the cooperation between agents on the team, allowing them to learn the joint strategy more quickly and efficiently. In addition, RoMIX can also reduce storage resources to a certain extent. Experiments show that RoMIX can not only solve easy tasks, but it can also learn better cooperation strategies for more complex and difficult tasks.

引用

页码：779 / 798

页数：20

共 50 条

[1] Towards an Adaptive Regulation Scaffolding through Role-based Strategies
Krishna, Sooraj
Pelachaud, Catherine
Kappas, Arvid
[J]. PROCEEDINGS OF THE 19TH ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS (IVA' 19), 2019, : 263 - 266
[2] Research on role-based learning technologies
Slator, BM
Clark, J
Juell, P
McClean, P
Saini-Eidukat, B
Schwert, DP
White, AR
[J]. IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2001, : 37 - 40
[3] Role-based autonomous and collaborative mechanism for cooperative behavior
Sakashita, Y
Ideguchi, T
Sato, F
Mizuno, T
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (06) : 1255 - 1265
[4] Cooperative automated lane merge with role-based negotiation
Eiermann, Lucas
Sawade, Oliver
Bunk, Sebastian
Breuel, Gabi
Radusch, Ilja
[J]. 2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 495 - 501
[5] The power of role-based e-learning
Smyth, Robyn
[J]. BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2012, 43 (01) : E40 - E41
[6] ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning
Li, Huiqun
Zhou, Hanhan
Zou, Yifei
Yu, Dongxiao
Lan, Tian
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17461 - 17468
[7] Automating Role-based Provisioning by Learning from Examples
Ni, Qun
Lobo, Jorge
Calo, Seraphin
Rohatgi, Pankaj
Bertino, Elisa
[J]. SACMAT'09: PROCEEDINGS OF THE 14TH ACM SYMPOSIUM ON ACCESS CONTROL MODELS AND TECHNOLOGIES, 2009, : 75 - 84
[8] From teacher-in-role to researcher-in-role: possibilities for repositioning children through role-based strategies in classroom research
Aitken, V.
[J]. RIDE-THE JOURNAL OF APPLIED THEATRE AND PERFORMANCE, 2014, 19 (03): : 255 - 271
[9] Role-based attention in deep reinforcement learning for games
Yang, Dong
Yang, Wenjing
Li, Minglong
Yang, Qiong
[J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (02)
[10] Role-based lateral movement detection with unsupervised learning
Powell, Brian A.
[J]. INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 16

← 1 2 3 4 5 →