Hierarchical Multi-Agent Skill Discovery

被引：0

作者：

Yang, Mingyu ^{[1
]}

Yang, Yaodong ^{[2
]}

Lu, Zhenbo ^{[3
]}

Zhou, Wengang ^{[1
,3
]}

Li, Houqiang ^{[1
,3
]}

机构：

[1] Univ Sci & Technol China, Chengdu, Sichuan, Peoples R China

[2] Peking Univ, Inst AI, Beijing, Peoples R China

[3] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Skill discovery has shown significant progress in unsupervised reinforcement learning. This approach enables the discovery of a wide range of skills without any extrinsic reward, which can be effectively combined to tackle complex tasks. However, such unsupervised skill learning has not been well applied to multi-agent reinforcement learning (MARL) due to two primary challenges. One is how to learn skills not only for the individual agents but also for the entire team, and the other is how to coordinate the skills of different agents to accomplish multi-agent tasks. To address these challenges, we present Hierarchical Multi-Agent Skill Discovery (HMASD), a two-level hierarchical algorithm for discovering both team and individual skills in MARL. The high-level policy employs a transformer structure to realize sequential skill assignment, while the low-level policy learns to discover valuable team and individual skills. We evaluate HMASD on sparse reward multi-agent benchmarks, and the results show that HMASD achieves significant performance improvements compared to strong MARL baselines.

引用

页数：18

共 50 条

[21] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
Han, Dongge
Boehmer, Wendelin
Wooldridge, Michael
Rogers, Alex
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
[22] A Hierarchical Framework for Cooperative Tasks in Multi-agent Systems
Zhu, Yuanning
Yang, Qingkai
Tian, Daiying
Fang, Hao
2024 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, CIS AND IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, RAM, CIS-RAM 2024, 2024, : 480 - 485
[23] Hybrid coordination of multi-agent networks with hierarchical leaders
He, Ding-Xin
Xu, Guang-Hui
Guan, Zhi-Hong
Chi, Ming
Zheng, Ding-Fu
COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2015, 27 (1-3) : 110 - 119
[24] A hierarchical multi-agent system for natural language diagnosis
Balsa, J
ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 195 - 196
[25] Hierarchical Influence Maximization for Advertising in Multi-agent Markets
Maghami, Mahsa
Sukthankar, Gita
2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 27 - 33
[26] Hierarchical Multi-Agent Training Based on Reinforcement Learning
Wang, Guanghua
Li, Wenjie
Wu, Zhanghua
Guo, Xian
2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS, 2024, : 11 - 18
[27] Performance Measure of Hierarchical Structures for Multi-agent Systems
Ali Raza
Muhammad Iqbal
Jun Moon
Shun-Ichi Azuma
International Journal of Control, Automation and Systems, 2022, 20 : 780 - 788
[28] Multi-agent hierarchical reinforcement learning for energy management
Jendoubi, Imen
Bouffard, Francois
APPLIED ENERGY, 2023, 332
[29] ALMA: Hierarchical Learning for Composite Multi-Agent Tasks
Iqbal, Shariq
Costales, Robby
Sha, Fei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[30] Minimizing communication costs in hierarchical multi-agent systems
Bhutani, KR
Khan, B
PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 1435 - 1442

← 1 2 3 4 5 →