FeUdal Networks for Hierarchical Reinforcement Learning

被引:0
|
作者
Vezhnevets, Alexander Sasha [1 ]
Osindero, Simon [1 ]
Schaul, Tom [1 ]
Heess, Nicolas [1 ]
Jaderberg, Max [1 ]
Silver, David [1 ]
Kavukcuoglu, Koray [1 ]
机构
[1] DeepMind, London, England
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical reinforcement learning. Our approach is inspired by the feudal reinforcement learning proposal of Dayan and Hinton, and gains power and efficacy by decoupling end-to-end learning across multiple levels - allowing it to utilise different resolutions of time. Our framework employs a Manager module and a Worker module. The Manager operates at a lower temporal resolution and sets abstract goals which are conveyed to and enacted by the Worker. The Worker generates primitive actions at every tick of the environment. The decoupled structure of FuN conveys several benefits - in addition to facilitating very long timescale credit assignment it also encourages the emergence of sub-policies associated with different goals set by the Manager. These properties allow FuN to dramatically outperform a strong baseline agent on tasks that involve long-term credit assignment or memorisation.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Scalable Evolutionary Hierarchical Reinforcement Learning
    Abramowitz, Sasha
    Nitschke, Geoff
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 272 - 275
  • [42] A Neural Signature of Hierarchical Reinforcement Learning
    Ribas-Fernandes, Jose J. F.
    Solway, Alec
    Diuk, Carlos
    McGuire, Joseph T.
    Barto, Andrew G.
    Niv, Yael
    Botvinick, Matthew M.
    NEURON, 2011, 71 (02) : 370 - 379
  • [43] A Novel Artificial Hydrocarbon Networks Based Value Function Approximation in Hierarchical Reinforcement Learning
    Ponce, Hiram
    ADVANCES IN SOFT COMPUTING, MICAI 2016, PT II, 2017, 10062 : 211 - 225
  • [44] Real-Time Object Navigation With Deep Neural Networks and Hierarchical Reinforcement Learning
    Staroverov, Aleksey
    Yudin, Dmitry A.
    Belkin, Ilya
    Adeshkin, Vasily
    Solomentsev, Yaroslav K.
    Panov, Aleksandr I.
    IEEE ACCESS, 2020, 8 : 195608 - 195621
  • [45] Dynamic task scheduling method for relay satellite networks based on hierarchical reinforcement learning
    Liu R.
    Ma T.
    Wu W.
    Yao C.
    Yang Q.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (07): : 207 - 217
  • [46] Optimal Hierarchical Learning Path Design With Reinforcement Learning
    Li, Xiao
    Xu, Hanchen
    Zhang, Jinming
    Chang, Hua-hua
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2021, 45 (01) : 54 - 70
  • [47] Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning
    Li, Tianyu
    Lambert, Nathan
    Calandra, Roberto
    Meier, Franziska
    Rai, Akshara
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 413 - 419
  • [48] Effectively Learning Initiation Sets in Hierarchical Reinforcement Learning
    Bagaria, Akhil
    Abbatematteo, Ben
    Gottesman, Omer
    Corsaro, Matt
    Rammohan, Sreehari
    Konidaris, George
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [49] Feudal Latent Space Exploration for Coordinated Multi-Agent Reinforcement Learning
    Liu, Xiangyu
    Tan, Ying
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7775 - 7783
  • [50] Reinforcement Learning with Adaptive Networks
    Sasaki, Tomoki
    Yamada, Satoshi
    2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS), 2017, : 1 - 5