Recent advances in hierarchical reinforcement learning

被引:3
|
作者
Barto, AG [1 ]
Mahadevan, S [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Autonomous Learning Lab, Amherst, MA 01003 USA
关键词
D O I
10.1023/A:1022140919877
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning is bedeviled by the curse of dimensionality: the number of parameters to be learned grows exponentially with the size of any compact encoding of a state. Recent attempts to combat the curse of dimensionality have turned to principled ways of exploiting temporal abstraction, where decisions are not required at each step, but rather invoke the execution of temporally-extended activities which follow their own policies until termination. This leads naturally to hierarchical control architectures and associated learning algorithms. We review several approaches to temporal abstraction and hierarchical organization that machine learning researchers have recently developed. Common to these approaches is a reliance on the theory of semi-Markov decision processes, which we emphasize in our review. We then discuss extensions of these ideas to concurrent activities, multiagent coordination, and hierarchical memory for addressing partial observability. Concluding remarks address open challenges facing the further development of reinforcement learning in a hierarchical setting.
引用
收藏
页码:41 / 77
页数:37
相关论文
共 50 条
  • [21] A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
    Wai-Chung Kwan
    Hong-Ru Wang
    Hui-Min Wang
    Kam-Fai Wong
    Machine Intelligence Research, 2023, 20 : 318 - 334
  • [22] A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
    Kwan, Wai-Chung
    Wang, Hong-Ru
    Wang, Hui-Min
    Wong, Kam-Fai
    MACHINE INTELLIGENCE RESEARCH, 2023, 20 (03) : 318 - 334
  • [23] Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges
    Chen, Xin
    Qu, Guannan
    Tang, Yujie
    Low, Steven
    Li, Na
    IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (04) : 2935 - 2958
  • [24] Recent advances in applying deep reinforcement learning for flow control: Perspectives and future directions
    Vignon, C.
    Rabault, J.
    Vinuesa, R.
    PHYSICS OF FLUIDS, 2023, 35 (03)
  • [25] Recent advances in reinforcement learning-based autonomous driving behavior planning: A survey
    Wu, Jingda
    Huang, Chao
    Huang, Hailong
    Lv, Chen
    Wang, Yuntong
    Wang, Fei-Yue
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 164
  • [26] Deep Reinforcement Learning with Hierarchical Structures
    Li, Siyuan
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4899 - 4900
  • [27] Hierarchical Reinforcement Learning: A Comprehensive Survey
    Pateria, Shubham
    Subagdja, Budhitama
    Tan, Ah-hwee
    Quek, Chai
    ACM COMPUTING SURVEYS, 2021, 54 (05)
  • [28] Recent Advances in Mechanical Reinforcement of Zwitterionic Hydrogels
    Lin, Weifeng
    Wei, Xinyue
    Liu, Sihang
    Zhang, Juan
    Yang, Tian
    Chen, Shengfu
    GELS, 2022, 8 (09)
  • [29] Hierarchical reinforcement learning for biped locomotion
    Sugimoto, Norikazu
    Hyon, Sang-Ho
    Morimoto, Jun
    NEUROSCIENCE RESEARCH, 2009, 65 : S183 - S183
  • [30] FeUdal Networks for Hierarchical Reinforcement Learning
    Vezhnevets, Alexander Sasha
    Osindero, Simon
    Schaul, Tom
    Heess, Nicolas
    Jaderberg, Max
    Silver, David
    Kavukcuoglu, Koray
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70