Hierarchical Multi-Agent Skill Discovery

被引:0
|
作者
Yang, Mingyu [1 ]
Yang, Yaodong [2 ]
Lu, Zhenbo [3 ]
Zhou, Wengang [1 ,3 ]
Li, Houqiang [1 ,3 ]
机构
[1] Univ Sci & Technol China, Chengdu, Sichuan, Peoples R China
[2] Peking Univ, Inst AI, Beijing, Peoples R China
[3] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skill discovery has shown significant progress in unsupervised reinforcement learning. This approach enables the discovery of a wide range of skills without any extrinsic reward, which can be effectively combined to tackle complex tasks. However, such unsupervised skill learning has not been well applied to multi-agent reinforcement learning (MARL) due to two primary challenges. One is how to learn skills not only for the individual agents but also for the entire team, and the other is how to coordinate the skills of different agents to accomplish multi-agent tasks. To address these challenges, we present Hierarchical Multi-Agent Skill Discovery (HMASD), a two-level hierarchical algorithm for discovering both team and individual skills in MARL. The high-level policy employs a transformer structure to realize sequential skill assignment, while the low-level policy learns to discover valuable team and individual skills. We evaluate HMASD on sparse reward multi-agent benchmarks, and the results show that HMASD achieves significant performance improvements compared to strong MARL baselines.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Boehmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
  • [22] A Hierarchical Framework for Cooperative Tasks in Multi-agent Systems
    Zhu, Yuanning
    Yang, Qingkai
    Tian, Daiying
    Fang, Hao
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, CIS AND IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, RAM, CIS-RAM 2024, 2024, : 480 - 485
  • [23] Hybrid coordination of multi-agent networks with hierarchical leaders
    He, Ding-Xin
    Xu, Guang-Hui
    Guan, Zhi-Hong
    Chi, Ming
    Zheng, Ding-Fu
    COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2015, 27 (1-3) : 110 - 119
  • [24] A hierarchical multi-agent system for natural language diagnosis
    Balsa, J
    ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 195 - 196
  • [25] Hierarchical Influence Maximization for Advertising in Multi-agent Markets
    Maghami, Mahsa
    Sukthankar, Gita
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 27 - 33
  • [26] Hierarchical Multi-Agent Training Based on Reinforcement Learning
    Wang, Guanghua
    Li, Wenjie
    Wu, Zhanghua
    Guo, Xian
    2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS, 2024, : 11 - 18
  • [27] Performance Measure of Hierarchical Structures for Multi-agent Systems
    Ali Raza
    Muhammad Iqbal
    Jun Moon
    Shun-Ichi Azuma
    International Journal of Control, Automation and Systems, 2022, 20 : 780 - 788
  • [28] Multi-agent hierarchical reinforcement learning for energy management
    Jendoubi, Imen
    Bouffard, Francois
    APPLIED ENERGY, 2023, 332
  • [29] ALMA: Hierarchical Learning for Composite Multi-Agent Tasks
    Iqbal, Shariq
    Costales, Robby
    Sha, Fei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [30] Minimizing communication costs in hierarchical multi-agent systems
    Bhutani, KR
    Khan, B
    PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 1435 - 1442