Learning cooperative strategies in StarCraft through role-based monotonic value function factorization

被引:0
|
作者
Han, Kun [1 ]
Jiang, Feng [1 ,2 ]
Zhu, Haiqi [1 ]
Shao, Mengxuan [1 ]
Yan, Ruyu [3 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin 150000, Peoples R China
[2] Harbin Inst Technol, Sch Med & Hlth, Harbin 150000, Peoples R China
[3] Harbin Inst Technol, Sch Management, Harbin 150000, Peoples R China
来源
ELECTRONIC RESEARCH ARCHIVE | 2024年 / 32卷 / 02期
基金
中国国家自然科学基金;
关键词
Q-learning; multi-agent reinforcement learning; machine learning; artificial intelligence; StarCraft multi-agent challenge;
D O I
10.3934/era.2024037
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
StarCraft is a popular real-time strategy game that has been widely used as a research platform for artificial intelligence. Micromanagement refers to the process of making each unit perform appropriate actions separately, depending on the current state in the the multi-agent system comprising all of the units, i.e., the fine-grained control of individual units for common benefit. Therefore, cooperation between different units is crucially important to improve the joint strategy. We have selected multi-agent deep reinforcement learning to tackle the problem of micromanagement. In this paper, we propose a method for learning cooperative strategies in StarCraft based on role-based montonic value function factorization (RoMIX). RoMIX learns roles based on the potential impact of each agent on the multi-agent task; it then represents the action value of a role in a mixed way based on monotonic value function factorization. The final value is calculated by accumulating the action value of all roles. The role-based learning improves the cooperation between agents on the team, allowing them to learn the joint strategy more quickly and efficiently. In addition, RoMIX can also reduce storage resources to a certain extent. Experiments show that RoMIX can not only solve easy tasks, but it can also learn better cooperation strategies for more complex and difficult tasks.
引用
收藏
页码:779 / 798
页数:20
相关论文
共 50 条
  • [1] Towards an Adaptive Regulation Scaffolding through Role-based Strategies
    Krishna, Sooraj
    Pelachaud, Catherine
    Kappas, Arvid
    [J]. PROCEEDINGS OF THE 19TH ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS (IVA' 19), 2019, : 263 - 266
  • [2] Research on role-based learning technologies
    Slator, BM
    Clark, J
    Juell, P
    McClean, P
    Saini-Eidukat, B
    Schwert, DP
    White, AR
    [J]. IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2001, : 37 - 40
  • [3] Role-based autonomous and collaborative mechanism for cooperative behavior
    Sakashita, Y
    Ideguchi, T
    Sato, F
    Mizuno, T
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (06) : 1255 - 1265
  • [4] Cooperative automated lane merge with role-based negotiation
    Eiermann, Lucas
    Sawade, Oliver
    Bunk, Sebastian
    Breuel, Gabi
    Radusch, Ilja
    [J]. 2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 495 - 501
  • [5] The power of role-based e-learning
    Smyth, Robyn
    [J]. BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2012, 43 (01) : E40 - E41
  • [6] ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning
    Li, Huiqun
    Zhou, Hanhan
    Zou, Yifei
    Yu, Dongxiao
    Lan, Tian
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17461 - 17468
  • [7] Automating Role-based Provisioning by Learning from Examples
    Ni, Qun
    Lobo, Jorge
    Calo, Seraphin
    Rohatgi, Pankaj
    Bertino, Elisa
    [J]. SACMAT'09: PROCEEDINGS OF THE 14TH ACM SYMPOSIUM ON ACCESS CONTROL MODELS AND TECHNOLOGIES, 2009, : 75 - 84
  • [8] From teacher-in-role to researcher-in-role: possibilities for repositioning children through role-based strategies in classroom research
    Aitken, V.
    [J]. RIDE-THE JOURNAL OF APPLIED THEATRE AND PERFORMANCE, 2014, 19 (03): : 255 - 271
  • [9] Role-based attention in deep reinforcement learning for games
    Yang, Dong
    Yang, Wenjing
    Li, Minglong
    Yang, Qiong
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (02)
  • [10] Role-based lateral movement detection with unsupervised learning
    Powell, Brian A.
    [J]. INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 16