Hierarchical Reinforcement Learning: A Comprehensive Survey

被引:139
|
作者
Pateria, Shubham [1 ]
Subagdja, Budhitama [2 ]
Tan, Ah-hwee [2 ]
Quek, Chai [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, 50 Nanyang Ave, Singapore 639798, Singapore
[2] Singapore Management Univ, Sch Comp & Informat Syst, 80 Stamford Rd, Singapore 178902, Singapore
基金
新加坡国家研究基金会;
关键词
Hierarchical reinforcement learning; subtask discovery; skill discovery; hierarchical reinforcement learning survey; hierarchical reinforcement learning taxonomy; TEMPORAL ABSTRACTION; FRAMEWORK; OPTIONS;
D O I
10.1145/3453160
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of challenging long-horizon decision-making tasks into simpler subtasks. During the past years, the landscape of HRL research has grown profoundly, resulting in copious approaches. A comprehensive overview of this vast landscape is necessary to study HRL in an organized manner. We provide a survey of the diverse HRL approaches concerning the challenges of learning hierarchical policies, subtask discovery, transfer learning, and multi-agent learning using HRL. The survey is presented according to a novel taxonomy of the approaches. Based on the survey, a set of important open problems is proposed to motivate the future research in HRL. Furthermore, we outline a few suitable task domains for evaluating the HRL approaches and a few interesting examples of the practical applications of HRL in the Supplementary Material.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] A comprehensive survey on safe reinforcement learning
    García, Javier
    Fernández, Fernando
    Journal of Machine Learning Research, 2015, 16 : 1437 - 1480
  • [2] A comprehensive survey of multiagent reinforcement learning
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
  • [3] A Comprehensive Survey on Safe Reinforcement Learning
    Garcia, Javier
    Fernandez, Fernando
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 1437 - 1480
  • [4] Reinforcement learning in robotic applications: a comprehensive survey
    Singh, Bharat
    Kumar, Rajesh
    Singh, Vinay Pratap
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (02) : 945 - 990
  • [5] Reinforcement Learning for IoT Security: A Comprehensive Survey
    Uprety, Aashma
    Rawat, Danda B.
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11): : 8693 - 8706
  • [6] Reinforcement learning in robotic applications: a comprehensive survey
    Bharat Singh
    Rajesh Kumar
    Vinay Pratap Singh
    Artificial Intelligence Review, 2022, 55 : 945 - 990
  • [7] Hierarchical Reinforcement Learning: A Survey and Open Research Challenges
    Hutsebaut-Buysse, Matthias
    Mets, Kevin
    Latre, Steven
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (01): : 172 - 221
  • [8] Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey
    Chen, Wuhui
    Qiu, Xiaoyu
    Cai, Ting
    Dai, Hong-Ning
    Zheng, Zibin
    Zhang, Yan
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03): : 1659 - 1692
  • [9] Deep reinforcement learning in computer vision: a comprehensive survey
    Le, Ngan
    Rathour, Vidhiwar Singh
    Yamazaki, Kashu
    Luu, Khoa
    Savvides, Marios
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (04) : 2733 - 2819
  • [10] Deep reinforcement learning in computer vision: a comprehensive survey
    Ngan Le
    Vidhiwar Singh Rathour
    Kashu Yamazaki
    Khoa Luu
    Marios Savvides
    Artificial Intelligence Review, 2022, 55 : 2733 - 2819