Finding hidden hierarchy in reinforcement learning

被引:0
|
作者
Poulton, G
Guo, Y
Lu, W
机构
[1] CSIRO, Autonomous Syst Informat & Commun Technol Ctr, Epping, NSW 1710, Australia
[2] Univ New S Wales, Kensington, NSW 2033, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
HEXQ is a reinforcement learning algorithm that decomposes a problem into subtasks and constructs a hierarchy using state variables. The maximum number of levels is constrained by the number of variables representing a state. In HEXQ, values learned for a subtask can be reused in different contexts if the subtasks are identical. If not, values for non-identical subtasks need to be trained separately. This paper introduces a method that tackles these two restrictions. Experimental results show that this method can save the training time dramatically.
引用
收藏
页码:554 / 561
页数:8
相关论文
共 50 条
  • [31] Finding hidden tumors
    Technol Rev, 2006, 6 NOVEMBER
  • [32] Finding hidden tumors
    Bourzac, Katherine
    TECHNOLOGY REVIEW, 2006, 109 (05) : 78 - 80
  • [33] Finding Hidden Motives
    Lu, Cindy
    CELL, 2016, 165 (01) : 5 - 7
  • [34] Finding hidden assets
    Forger, Gary R.
    Modern Materials Handling, 2002, 57 (03)
  • [35] FINDING HIDDEN AUDIENCES
    YEP, BH
    RIGGS, NP
    JOURNAL OF EXTENSION, 1978, 16 (JUL-): : 5 - 10
  • [36] THE HIDDEN HIERARCHY - RESSNER,U
    UNDERWOOD, J
    JOURNAL OF SOCIAL POLICY, 1988, 17 : 261 - 262
  • [37] Finding Hidden Cases
    Bowdoin, C. D.
    Buchanan, C. S.
    AMERICAN JOURNAL OF PUBLIC HEALTH AND THE NATIONS HEALTH, 1949, 39 (11): : 1441 - 1445
  • [38] FINDING HIDDEN CASES
    BOWDOIN, CD
    BUCHANAN, CS
    AMERICAN JOURNAL OF PUBLIC HEALTH, 1949, 39 (11) : 1441 - 1445
  • [39] Autonomous motion recognition by combining reinforcement learning and hidden Markov model
    Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama, 226-8503, Japan
    Syst Comput Jpn, 2006, 14 (34-43):
  • [40] A hidden anti-jamming method based on deep reinforcement learning
    Wang, Yifan
    Liu, Xin
    Wang, Mei
    Yu, Yu
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (09): : 3444 - 3457