ASN: action semantics network for multiagent reinforcement learning

被引:2
|
作者
Yang, Tianpei [1 ,2 ,3 ]
Wang, Weixun [4 ]
Hao, Jianye [1 ,5 ]
Taylor, Matthew E. [2 ,3 ]
Liu, Yong [6 ]
Hao, Xiaotian [1 ]
Hu, Yujing [4 ]
Chen, Yingfeng [4 ]
Fan, Changjie [4 ]
Ren, Chunxu [4 ]
Huang, Ye [4 ]
Zhu, Jiangcheng [5 ]
Gao, Yang [6 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst Amii, Edmonton, AB, Canada
[4] Fuxi AI Lab, NetEase, Hangzhou, Peoples R China
[5] Huawei, Shenzhen, Peoples R China
[6] Nanjing Univ, Nanjing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Multiagent reinforcement learning; Multiagent coordination; Deep reinforcement learning;
D O I
10.1007/s10458-023-09628-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system's evolution. Learning in MASs is difficult since each agent's selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions' influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.
引用
收藏
页数:37
相关论文
共 50 条
  • [1] ASN: action semantics network for multiagent reinforcement learning
    Tianpei Yang
    Weixun Wang
    Jianye Hao
    Matthew E. Taylor
    Yong Liu
    Xiaotian Hao
    Yujing Hu
    Yingfeng Chen
    Changjie Fan
    Chunxu Ren
    Ye Huang
    Jiangcheng Zhu
    Yang Gao
    Autonomous Agents and Multi-Agent Systems, 2023, 37
  • [2] Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network
    Du, Wei
    Ding, Shifei
    Zhang, Chenglong
    Shi, Zhongzhi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6851 - 6860
  • [3] A Hetero-Relation Transformer Network for Multiagent Reinforcement Learning
    Park, Junho
    Yoon, Sukmin
    Kim, Yong-Duk
    IEEE TRANSACTIONS ON GAMES, 2025, 17 (01) : 138 - 147
  • [4] Distributed response to network intrusions using multiagent reinforcement learning
    Malialis, Kleanthis
    Kudenko, Daniel
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 41 : 270 - 284
  • [5] Distributed Multiagent Reinforcement Learning With Action Networks for Dynamic Economic Dispatch
    Hu, Chengfang
    Wen, Guanghui
    Wang, Shuai
    Fu, Junjie
    Yu, Wenwu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9553 - 9564
  • [6] Transformer-basedWorking Memory for Multiagent Reinforcement Learning with Action Parsing
    Yang, Yaodong
    Chen, Guangyong
    Wang, Weixun
    Hao, Xiaotian
    Hao, Jianye
    Heng, Pheng Ann
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] Asymmetric multiagent reinforcement learning
    Könönen, V
    IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [8] Mean-Field Multiagent Reinforcement Learning: A Decentralized Network Approach
    Gu, Haotian
    Guo, Xin
    Wei, Xiaoli
    Xu, Renyuan
    MATHEMATICS OF OPERATIONS RESEARCH, 2025, 50 (01) : 506 - 536
  • [9] Virtual Network Embedding Based on Hierarchical Cooperative Multiagent Reinforcement Learning
    Lim, Hyun-Kyo
    Ullah, Ihsan
    Kim, Ju-Bong
    Han, Youn-Hee
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05): : 8552 - 8568
  • [10] A Study on Cooperative Action Selection Considering Unfairness in Decentralized Multiagent Reinforcement Learning
    Matsui, Toshihiro
    Matsuo, Hiroshi
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2017, : 88 - 95