Finding hidden hierarchy in reinforcement learning

被引：0

作者：

Poulton, G

Guo, Y

Lu, W

机构：

[1] CSIRO, Autonomous Syst Informat & Commun Technol Ctr, Epping, NSW 1710, Australia

[2] Univ New S Wales, Kensington, NSW 2033, Australia

来源：

KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS | 2005年 / 3683卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

HEXQ is a reinforcement learning algorithm that decomposes a problem into subtasks and constructs a hierarchy using state variables. The maximum number of levels is constrained by the number of variables representing a state. In HEXQ, values learned for a subtask can be reused in different contexts if the subtasks are identical. If not, values for non-identical subtasks need to be trained separately. This paper introduces a method that tackles these two restrictions. Experimental results show that this method can save the training time dramatically.

引用

页码：554 / 561

页数：8

共 50 条

[21] Unconscious reinforcement learning of hidden brain states supported by confidence
Aurelio Cortese
Hakwan Lau
Mitsuo Kawato
Nature Communications, 11
[22] Expectation-Maximization for Inverse Reinforcement Learning with Hidden Data
Bogert, Kenneth
Lin, Jonathan Feng-Shun
Doshi, Prashant
Kulic, Dana
AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1034 - 1042
[23] Path-finding Using Reinforcement Learning and Affective States
Feldmaier, Johannes
Diepold, Klaus
2014 23RD IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN), 2014, : 543 - 548
[24] Reinforcement learning of a path-finding behaviour by a mobile robot
Malmstrom, K
Munday, L
Sitte, J
ANZIIS 96 - 1996 AUSTRALIAN NEW ZEALAND CONFERENCE ON INTELLIGENT INFORMATION SYSTEMS, PROCEEDINGS, 1996, : 334 - 337
[25] A reinforcement learning approach involving a shortest path finding algorithm
Kwon, WY
Lee, S
Suh, IH
IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 436 - 441
[26] Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
Uchibe, Eiji
Doya, Kenji
NEURAL NETWORKS, 2008, 21 (10) : 1447 - 1455
[27] FINDING THE OPTIMAL SEQUENCE OF FEATURES SELECTION BASED ON REINFORCEMENT LEARNING
Bi, Song
Liu, Lei
Han, Cunwu
Sun, Dehui
2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 347 - 350
[28] A Hierarchy of Deep Reinforcement Learning Agents for Decision Making in Blockchain Nodes
Abu Mallouh, Arafat
Abuzaghleh, Omar
Qawaqneh, Zakariya
IEEE EUROCON 2021 - 19TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES, 2021, : 197 - 202
[29] Hidden Treasure in the Linnean Hierarchy
John Dupré
Biology and Philosophy, 2002, 17 (3) : 423 - 433
[30] A hidden hierarchy of neutrino masses
Jezabek, M
Urban, P
PHYSICS LETTERS B, 2002, 541 (1-2) : 142 - 150

← 1 2 3 4 5 →