Hierarchical Multi-Agent Skill Discovery

被引：0

作者：

Yang, Mingyu ^{[1
]}

Yang, Yaodong ^{[2
]}

Lu, Zhenbo ^{[3
]}

Zhou, Wengang ^{[1
,3
]}

Li, Houqiang ^{[1
,3
]}

机构：

[1] Univ Sci & Technol China, Chengdu, Sichuan, Peoples R China

[2] Peking Univ, Inst AI, Beijing, Peoples R China

[3] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Skill discovery has shown significant progress in unsupervised reinforcement learning. This approach enables the discovery of a wide range of skills without any extrinsic reward, which can be effectively combined to tackle complex tasks. However, such unsupervised skill learning has not been well applied to multi-agent reinforcement learning (MARL) due to two primary challenges. One is how to learn skills not only for the individual agents but also for the entire team, and the other is how to coordinate the skills of different agents to accomplish multi-agent tasks. To address these challenges, we present Hierarchical Multi-Agent Skill Discovery (HMASD), a two-level hierarchical algorithm for discovering both team and individual skills in MARL. The high-level policy employs a transformer structure to realize sequential skill assignment, while the low-level policy learns to discover valuable team and individual skills. We evaluate HMASD on sparse reward multi-agent benchmarks, and the results show that HMASD achieves significant performance improvements compared to strong MARL baselines.

引用

页数：18

共 50 条

[1] Discovery of emergent natural laws by hierarchical multi-agent systems
Stolk, H
Gates, K
Hanan, J
IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 75 - 82
[2] Skill Emergence and Transfer in Multi-Agent Environments
Kanitscheider, Ingmar
Baker, Bowen
Markov, Todor
Mordatch, Igor
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 55 - 56
[3] Heterogeneous Skill Learning for Multi-agent Tasks
Liu, Yuntao
Li, Yuan
Xu, Xinhai
Dou, Yong
Liu, Donghong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[4] Hierarchical multi-agent reinforcement learning
Mohammad Ghavamzadeh
Sridhar Mahadevan
Rajbala Makar
Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
[5] On the hierarchical structure of multi-agent systems
Tian, H
Unland, R
MULTI-AGENT-SYSTEMS IN PRODUCTION, 2000, : 207 - 212
[6] Hierarchical multi-agent reinforcement learning
Ghavamzadeh, Mohammad
Mahadevan, Sridhar
Makar, Rajbala
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
[7] Multi-agent architecture for Knowledge Discovery
Pop, Daniel
Negru, Viorel
Sandru, Calin
SYNASC 2006: EIGHTH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, PROCEEDINGS, 2007, : 217 - +
[8] Deep Skill Chaining with Diversity for Multi-agent Systems
Xie, Zaipeng
Ji, Cheng
Zhang, Yufeng
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 208 - 220
[9] Multi-Agent System With Hierarchical Private Key
Flonta, Stelian
Vegh, Laura
Miclea, Liviu Cristian
Stefan, Iulia
Enyedi, Szilard
2014 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS, 2014,
[10] Hierarchical structure design for multi-agent consensus
Xi, Yu-Geng
Li, Xiao-Li
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2015, 32 (09): : 1191 - 1199

← 1 2 3 4 5 →