FeUdal Networks for Hierarchical Reinforcement Learning

被引：0

作者：

Vezhnevets, Alexander Sasha ^{[1
]}

Osindero, Simon ^{[1
]}

Schaul, Tom ^{[1
]}

Heess, Nicolas ^{[1
]}

Jaderberg, Max ^{[1
]}

Silver, David ^{[1
]}

Kavukcuoglu, Koray ^{[1
]}

机构：

[1] DeepMind, London, England

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical reinforcement learning. Our approach is inspired by the feudal reinforcement learning proposal of Dayan and Hinton, and gains power and efficacy by decoupling end-to-end learning across multiple levels - allowing it to utilise different resolutions of time. Our framework employs a Manager module and a Worker module. The Manager operates at a lower temporal resolution and sets abstract goals which are conveyed to and enacted by the Worker. The Worker generates primitive actions at every tick of the environment. The decoupled structure of FuN conveys several benefits - in addition to facilitating very long timescale credit assignment it also encourages the emergence of sub-policies associated with different goals set by the Manager. These properties allow FuN to dramatically outperform a strong baseline agent on tasks that involve long-term credit assignment or memorisation.

引用

页数：10

共 50 条

[21] Recent advances in hierarchical reinforcement learning
Barto, AG
Mahadevan, S
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2003, 13 (04): : 343 - 379
[22] A neural model of hierarchical reinforcement learning
Rasmussen, Daniel
Voelker, Aaron
Eliasmith, Chris
PLOS ONE, 2017, 12 (07):
[23] Hierarchical Reinforcement Learning for Quadruped Locomotion
Jain, Deepali
Iscen, Atil
Caluwaerts, Ken
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
[24] Hierarchical Reinforcement Learning With Timed Subgoals
Guertler, Nico
Buechler, Dieter
Martius, Georg
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[25] Reinforcement Active Learning Hierarchical Loops
Gordon, Goren
Ahissar, Ehud
2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 3008 - 3015
[26] Recent Advances in Hierarchical Reinforcement Learning
Andrew G. Barto
Sridhar Mahadevan
Discrete Event Dynamic Systems, 2003, 13 : 41 - 77
[27] Recent Advances in Hierarchical Reinforcement Learning
Andrew G. Barto
Sridhar Mahadevan
Discrete Event Dynamic Systems, 2003, 13 (4) : 341 - 379
[28] Reinforcement Learning From Hierarchical Critics
Cao, Zehong
Lin, Chin-Teng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 1066 - 1073
[29] Hierarchical Adversarial Inverse Reinforcement Learning
Chen, Jiayu
Lan, Tian
Aggarwal, Vaneet
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 17549 - 17558
[30] Partial Order Hierarchical Reinforcement Learning
Hengst, Bernhard
AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5360 : 138 - 149

← 1 2 3 4 5 →