Heterogeneous Skill Learning for Multi-agent Tasks

被引:0
|
作者
Liu, Yuntao [1 ]
Li, Yuan [1 ]
Xu, Xinhai [1 ]
Dou, Yong [2 ]
Liu, Donghong [1 ]
机构
[1] Acad Mil Sci, Beijing, Peoples R China
[2] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heterogeneous behaviours are widespread in many multi-agent tasks, which have not been paid much attention in the community of multi-agent reinforcement learning. It would be a key factor for improving the learning performance to efficiently characterize and automatically find heterogeneous behaviours. In this paper, we introduce the concept of the skill to explore the ability of heterogeneous behaviours. We propose a novel skill-based multi-agent reinforcement learning framework to enable agents to master diverse skills. Specifically, our framework consists of the skill representation mechanism, the skill selector and the skill-based policy learning mechanism. We design an auto-encoder model to generate the latent variable as the skill representation by incorporating the environment information, which ensures the distinguishable of agents for skill selection and the discriminability for skill learning. With the representation, a skill selection mechanism is invented to realize the assignment from agents to skills. Meanwhile, diverse skill-based policies are generated through a novel skill-based policy learning method. To promote efficient skill discovery, a mutual information based intrinsic reward function is constructed. Empirical results show that our framework obtains the best performance on three challenging benchmarks, i.e., StarCraft II micromanagement tasks, Google Research Football and GoBigger, over state-of-the-art MARL methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Multi-agent System that Searches for Learning Objects in Heterogeneous Repositories
    De la Prieta, Fernando
    Belen Gil, Ana
    [J]. TRENDS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2010, 71 : 355 - 362
  • [22] Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems
    Kapetanakis, S
    Kudenko, D
    [J]. ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS II: ADAPTATION AND MULTI-AGENT LEARNING, 2005, 3394 : 119 - 131
  • [23] Conservative Multi-agent Online Kernel Learning in Heterogeneous Networks
    Pradhan, Hrusikesha
    Bedi, Amrit Singh
    Koppel, Alec
    Rajawat, Ketan
    [J]. 2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 53 - 57
  • [24] Constraint-based multi-agent reinforcement learning for collaborative tasks
    Shang, Xiumin
    Xu, Tengyu
    Karamouzas, Ioannis
    Kallmann, Marcelo
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2023, 34 (3-4)
  • [25] Efficient Training Techniques for Multi-Agent Reinforcement Learning in Combat Tasks
    Zhang, Guanyu
    Li, Yuan
    Xu, Xinhai
    Dai, Huadong
    [J]. IEEE ACCESS, 2019, 7 : 109301 - 109310
  • [26] MRRC: Multi-agent Reinforcement Learning with Rectification Capability in Cooperative Tasks
    Yu, Sheng
    Zhu, Wei
    Liu, Shuhong
    Gong, Zhengwen
    Chen, Haoran
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 204 - 218
  • [27] Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning
    Kun Jiang
    Wenzhang Liu
    Yuanda Wang
    Lu Dong
    Changyin Sun
    [J]. IEEE/CAA Journal of Automatica Sinica, 2024, 11 (07) : 1591 - 1604
  • [28] Fast Adaptation via Meta Learning in Multi-Agent Cooperative Tasks
    Jia, Hongda
    Ding, Bo
    Wang, Huaimin
    Gong, Xudong
    Zhou, Xing
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 707 - 714
  • [29] Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning
    Jiang, Kun
    Liu, Wenzhang
    Wang, Yuanda
    Dong, Lu
    Sun, Changyin
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (07) : 1591 - 1604
  • [30] Dynamic scheduling of tasks in cloud manufacturing with multi-agent reinforcement learning
    Wang, Xiaohan
    Zhang, Lin
    Liu, Yongkui
    Li, Feng
    Chen, Zhen
    Zhao, Chun
    Bai, Tian
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2022, 65 : 130 - 145