Finding hidden hierarchy in reinforcement learning

被引:0
|
作者
Poulton, G
Guo, Y
Lu, W
机构
[1] CSIRO, Autonomous Syst Informat & Commun Technol Ctr, Epping, NSW 1710, Australia
[2] Univ New S Wales, Kensington, NSW 2033, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
HEXQ is a reinforcement learning algorithm that decomposes a problem into subtasks and constructs a hierarchy using state variables. The maximum number of levels is constrained by the number of variables representing a state. In HEXQ, values learned for a subtask can be reused in different contexts if the subtasks are identical. If not, values for non-identical subtasks need to be trained separately. This paper introduces a method that tackles these two restrictions. Experimental results show that this method can save the training time dramatically.
引用
收藏
页码:554 / 561
页数:8
相关论文
共 50 条
  • [21] Unconscious reinforcement learning of hidden brain states supported by confidence
    Aurelio Cortese
    Hakwan Lau
    Mitsuo Kawato
    Nature Communications, 11
  • [22] Expectation-Maximization for Inverse Reinforcement Learning with Hidden Data
    Bogert, Kenneth
    Lin, Jonathan Feng-Shun
    Doshi, Prashant
    Kulic, Dana
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1034 - 1042
  • [23] Path-finding Using Reinforcement Learning and Affective States
    Feldmaier, Johannes
    Diepold, Klaus
    2014 23RD IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN), 2014, : 543 - 548
  • [24] Reinforcement learning of a path-finding behaviour by a mobile robot
    Malmstrom, K
    Munday, L
    Sitte, J
    ANZIIS 96 - 1996 AUSTRALIAN NEW ZEALAND CONFERENCE ON INTELLIGENT INFORMATION SYSTEMS, PROCEEDINGS, 1996, : 334 - 337
  • [25] A reinforcement learning approach involving a shortest path finding algorithm
    Kwon, WY
    Lee, S
    Suh, IH
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 436 - 441
  • [26] Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
    Uchibe, Eiji
    Doya, Kenji
    NEURAL NETWORKS, 2008, 21 (10) : 1447 - 1455
  • [27] FINDING THE OPTIMAL SEQUENCE OF FEATURES SELECTION BASED ON REINFORCEMENT LEARNING
    Bi, Song
    Liu, Lei
    Han, Cunwu
    Sun, Dehui
    2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 347 - 350
  • [28] A Hierarchy of Deep Reinforcement Learning Agents for Decision Making in Blockchain Nodes
    Abu Mallouh, Arafat
    Abuzaghleh, Omar
    Qawaqneh, Zakariya
    IEEE EUROCON 2021 - 19TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES, 2021, : 197 - 202
  • [29] Hidden Treasure in the Linnean Hierarchy
    John Dupré
    Biology and Philosophy, 2002, 17 (3) : 423 - 433
  • [30] A hidden hierarchy of neutrino masses
    Jezabek, M
    Urban, P
    PHYSICS LETTERS B, 2002, 541 (1-2) : 142 - 150