Hierarchical Multi-Agent Skill Discovery

被引:0
|
作者
Yang, Mingyu [1 ]
Yang, Yaodong [2 ]
Lu, Zhenbo [3 ]
Zhou, Wengang [1 ,3 ]
Li, Houqiang [1 ,3 ]
机构
[1] Univ Sci & Technol China, Chengdu, Sichuan, Peoples R China
[2] Peking Univ, Inst AI, Beijing, Peoples R China
[3] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skill discovery has shown significant progress in unsupervised reinforcement learning. This approach enables the discovery of a wide range of skills without any extrinsic reward, which can be effectively combined to tackle complex tasks. However, such unsupervised skill learning has not been well applied to multi-agent reinforcement learning (MARL) due to two primary challenges. One is how to learn skills not only for the individual agents but also for the entire team, and the other is how to coordinate the skills of different agents to accomplish multi-agent tasks. To address these challenges, we present Hierarchical Multi-Agent Skill Discovery (HMASD), a two-level hierarchical algorithm for discovering both team and individual skills in MARL. The high-level policy employs a transformer structure to realize sequential skill assignment, while the low-level policy learns to discover valuable team and individual skills. We evaluate HMASD on sparse reward multi-agent benchmarks, and the results show that HMASD achieves significant performance improvements compared to strong MARL baselines.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Discovery of emergent natural laws by hierarchical multi-agent systems
    Stolk, H
    Gates, K
    Hanan, J
    IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 75 - 82
  • [2] Skill Emergence and Transfer in Multi-Agent Environments
    Kanitscheider, Ingmar
    Baker, Bowen
    Markov, Todor
    Mordatch, Igor
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 55 - 56
  • [3] Heterogeneous Skill Learning for Multi-agent Tasks
    Liu, Yuntao
    Li, Yuan
    Xu, Xinhai
    Dou, Yong
    Liu, Donghong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [5] On the hierarchical structure of multi-agent systems
    Tian, H
    Unland, R
    MULTI-AGENT-SYSTEMS IN PRODUCTION, 2000, : 207 - 212
  • [6] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [7] Multi-agent architecture for Knowledge Discovery
    Pop, Daniel
    Negru, Viorel
    Sandru, Calin
    SYNASC 2006: EIGHTH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, PROCEEDINGS, 2007, : 217 - +
  • [8] Deep Skill Chaining with Diversity for Multi-agent Systems
    Xie, Zaipeng
    Ji, Cheng
    Zhang, Yufeng
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 208 - 220
  • [9] Multi-Agent System With Hierarchical Private Key
    Flonta, Stelian
    Vegh, Laura
    Miclea, Liviu Cristian
    Stefan, Iulia
    Enyedi, Szilard
    2014 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS, 2014,
  • [10] Hierarchical structure design for multi-agent consensus
    Xi, Yu-Geng
    Li, Xiao-Li
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2015, 32 (09): : 1191 - 1199