FeUdal Networks for Hierarchical Reinforcement Learning

被引:0
|
作者
Vezhnevets, Alexander Sasha [1 ]
Osindero, Simon [1 ]
Schaul, Tom [1 ]
Heess, Nicolas [1 ]
Jaderberg, Max [1 ]
Silver, David [1 ]
Kavukcuoglu, Koray [1 ]
机构
[1] DeepMind, London, England
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical reinforcement learning. Our approach is inspired by the feudal reinforcement learning proposal of Dayan and Hinton, and gains power and efficacy by decoupling end-to-end learning across multiple levels - allowing it to utilise different resolutions of time. Our framework employs a Manager module and a Worker module. The Manager operates at a lower temporal resolution and sets abstract goals which are conveyed to and enacted by the Worker. The Worker generates primitive actions at every tick of the environment. The decoupled structure of FuN conveys several benefits - in addition to facilitating very long timescale credit assignment it also encourages the emergence of sub-policies associated with different goals set by the Manager. These properties allow FuN to dramatically outperform a strong baseline agent on tasks that involve long-term credit assignment or memorisation.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Recent advances in hierarchical reinforcement learning
    Barto, AG
    Mahadevan, S
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2003, 13 (04): : 343 - 379
  • [22] A neural model of hierarchical reinforcement learning
    Rasmussen, Daniel
    Voelker, Aaron
    Eliasmith, Chris
    PLOS ONE, 2017, 12 (07):
  • [23] Hierarchical Reinforcement Learning for Quadruped Locomotion
    Jain, Deepali
    Iscen, Atil
    Caluwaerts, Ken
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
  • [24] Hierarchical Reinforcement Learning With Timed Subgoals
    Guertler, Nico
    Buechler, Dieter
    Martius, Georg
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [25] Reinforcement Active Learning Hierarchical Loops
    Gordon, Goren
    Ahissar, Ehud
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 3008 - 3015
  • [26] Recent Advances in Hierarchical Reinforcement Learning
    Andrew G. Barto
    Sridhar Mahadevan
    Discrete Event Dynamic Systems, 2003, 13 : 41 - 77
  • [27] Recent Advances in Hierarchical Reinforcement Learning
    Andrew G. Barto
    Sridhar Mahadevan
    Discrete Event Dynamic Systems, 2003, 13 (4) : 341 - 379
  • [28] Reinforcement Learning From Hierarchical Critics
    Cao, Zehong
    Lin, Chin-Teng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 1066 - 1073
  • [29] Hierarchical Adversarial Inverse Reinforcement Learning
    Chen, Jiayu
    Lan, Tian
    Aggarwal, Vaneet
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 17549 - 17558
  • [30] Partial Order Hierarchical Reinforcement Learning
    Hengst, Bernhard
    AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5360 : 138 - 149