FeUdal Networks for Hierarchical Reinforcement Learning

被引：0

作者：

Vezhnevets, Alexander Sasha ^{[1
]}

Osindero, Simon ^{[1
]}

Schaul, Tom ^{[1
]}

Heess, Nicolas ^{[1
]}

Jaderberg, Max ^{[1
]}

Silver, David ^{[1
]}

Kavukcuoglu, Koray ^{[1
]}

机构：

[1] DeepMind, London, England

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical reinforcement learning. Our approach is inspired by the feudal reinforcement learning proposal of Dayan and Hinton, and gains power and efficacy by decoupling end-to-end learning across multiple levels - allowing it to utilise different resolutions of time. Our framework employs a Manager module and a Worker module. The Manager operates at a lower temporal resolution and sets abstract goals which are conveyed to and enacted by the Worker. The Worker generates primitive actions at every tick of the environment. The decoupled structure of FuN conveys several benefits - in addition to facilitating very long timescale credit assignment it also encourages the emergence of sub-policies associated with different goals set by the Manager. These properties allow FuN to dramatically outperform a strong baseline agent on tasks that involve long-term credit assignment or memorisation.

引用

页数：10

共 50 条

[41] Scalable Evolutionary Hierarchical Reinforcement Learning
Abramowitz, Sasha
Nitschke, Geoff
PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 272 - 275
[42] A Neural Signature of Hierarchical Reinforcement Learning
Ribas-Fernandes, Jose J. F.
Solway, Alec
Diuk, Carlos
McGuire, Joseph T.
Barto, Andrew G.
Niv, Yael
Botvinick, Matthew M.
NEURON, 2011, 71 (02) : 370 - 379
[43] A Novel Artificial Hydrocarbon Networks Based Value Function Approximation in Hierarchical Reinforcement Learning
Ponce, Hiram
ADVANCES IN SOFT COMPUTING, MICAI 2016, PT II, 2017, 10062 : 211 - 225
[44] Real-Time Object Navigation With Deep Neural Networks and Hierarchical Reinforcement Learning
Staroverov, Aleksey
Yudin, Dmitry A.
Belkin, Ilya
Adeshkin, Vasily
Solomentsev, Yaroslav K.
Panov, Aleksandr I.
IEEE ACCESS, 2020, 8 : 195608 - 195621
[45] Dynamic task scheduling method for relay satellite networks based on hierarchical reinforcement learning
Liu R.
Ma T.
Wu W.
Yao C.
Yang Q.
Tongxin Xuebao/Journal on Communications, 2023, 44 (07): : 207 - 217
[46] Optimal Hierarchical Learning Path Design With Reinforcement Learning
Li, Xiao
Xu, Hanchen
Zhang, Jinming
Chang, Hua-hua
APPLIED PSYCHOLOGICAL MEASUREMENT, 2021, 45 (01) : 54 - 70
[47] Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning
Li, Tianyu
Lambert, Nathan
Calandra, Roberto
Meier, Franziska
Rai, Akshara
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 413 - 419
[48] Effectively Learning Initiation Sets in Hierarchical Reinforcement Learning
Bagaria, Akhil
Abbatematteo, Ben
Gottesman, Omer
Corsaro, Matt
Rammohan, Sreehari
Konidaris, George
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[49] Feudal Latent Space Exploration for Coordinated Multi-Agent Reinforcement Learning
Liu, Xiangyu
Tan, Ying
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7775 - 7783
[50] Reinforcement Learning with Adaptive Networks
Sasaki, Tomoki
Yamada, Satoshi
2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS), 2017, : 1 - 5

← 1 2 3 4 5 →