On Efficiency in Hierarchical Reinforcement Learning

被引:0
|
作者
Wen, Zheng [1 ]
Precup, Doina [1 ]
Ibrahimi, Morteza [1 ]
Barreto, Andre [1 ]
Van Roy, Benjamin [1 ]
Singh, Satinder [1 ]
机构
[1] DeepMind, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical Reinforcement Learning (HRL) approaches promise to provide more efficient solutions to sequential decision making problems, both in terms of statistical as well as computational efficiency. While this has been demonstrated empirically over time in a variety of tasks, theoretical results quantifying the benefits of such methods are still few and far between. In this paper, we discuss the kind of structure in a Markov decision process which gives rise to efficient HRL methods. Specifically, we formalize the intuition that HRL can exploit well repeating "subMDPs", with similar reward and transition structure. We show that, under reasonable assumptions, a model-based Thompson sampling-style HRL algorithm that exploits this structure is statistically efficient, as established through a finite-time regret bound. We also establish conditions under which planning with structure-induced options is near-optimal and computationally efficient.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A Sample Efficiency Improved Method via Hierarchical Reinforcement Learning Networks
    Chen, Qinghua
    Dallas, Evan
    Shahverdi, Pourya
    Korneder, Jessica
    Rawashdeh, Osamah A.
    Louie, Wing-Yue Geoffrey
    [J]. 2022 31ST IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN 2022), 2022, : 1498 - 1505
  • [2] Concurrent Hierarchical Reinforcement Learning
    Marthi, Bhaskara
    Russell, Stuart
    Latham, David
    Guestrin, Carlos
    [J]. 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 779 - 785
  • [3] Improving Energy Efficiency in Green Femtocell Networks: A Hierarchical Reinforcement Learning Framework
    Chen, Xianfu
    Zhang, Honggang
    Chen, Tao
    Lasanen, Mika
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2013,
  • [4] Hierarchical reinforcement learning with OMQ
    Shen, Jing
    Liu, Haibo
    Gu, Guochang
    [J]. PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 584 - 588
  • [5] Hierarchical Imitation and Reinforcement Learning
    Le, Hoang M.
    Jiang, Nan
    Agarwal, Alekh
    Dudik, Miroslav
    Yue, Yisong
    Daume, Hal, III
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [6] Budgeted Hierarchical Reinforcement Learning
    Leon, Aurelia
    Denoyer, Ludovic
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [7] Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential Calculation
    Zhou, Ziyun
    Shang, Jingwei
    Li, Yimang
    [J]. ELECTRONICS, 2023, 12 (17)
  • [8] A Hierarchical Deep Reinforcement Learning Framework With High Efficiency and Generalization for Fast and Safe Navigation
    Zhu, Wei
    Hayashibe, Mitsuhiro
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (05) : 4962 - 4971
  • [9] Hierarchical Reinforcement Learning: A Comprehensive Survey
    Pateria, Shubham
    Subagdja, Budhitama
    Tan, Ah-hwee
    Quek, Chai
    [J]. ACM COMPUTING SURVEYS, 2021, 54 (05)
  • [10] Deep Reinforcement Learning with Hierarchical Structures
    Li, Siyuan
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4899 - 4900