Finding hidden hierarchy in reinforcement learning

被引：0

作者：

Poulton, G

Guo, Y

Lu, W

机构：

[1] CSIRO, Autonomous Syst Informat & Commun Technol Ctr, Epping, NSW 1710, Australia

[2] Univ New S Wales, Kensington, NSW 2033, Australia

来源：

KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS | 2005年 / 3683卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

HEXQ is a reinforcement learning algorithm that decomposes a problem into subtasks and constructs a hierarchy using state variables. The maximum number of levels is constrained by the number of variables representing a state. In HEXQ, values learned for a subtask can be reused in different contexts if the subtasks are identical. If not, values for non-identical subtasks need to be trained separately. This paper introduces a method that tackles these two restrictions. Experimental results show that this method can save the training time dramatically.

引用

页码：554 / 561

页数：8

共 50 条

[31] Finding hidden tumors
Technol Rev, 2006, 6 NOVEMBER
[32] Finding hidden tumors
Bourzac, Katherine
TECHNOLOGY REVIEW, 2006, 109 (05) : 78 - 80
[33] Finding Hidden Motives
Lu, Cindy
CELL, 2016, 165 (01) : 5 - 7
[34] Finding hidden assets
Forger, Gary R.
Modern Materials Handling, 2002, 57 (03)
[35] FINDING HIDDEN AUDIENCES
YEP, BH
RIGGS, NP
JOURNAL OF EXTENSION, 1978, 16 (JUL-): : 5 - 10
[36] THE HIDDEN HIERARCHY - RESSNER,U
UNDERWOOD, J
JOURNAL OF SOCIAL POLICY, 1988, 17 : 261 - 262
[37] Finding Hidden Cases
Bowdoin, C. D.
Buchanan, C. S.
AMERICAN JOURNAL OF PUBLIC HEALTH AND THE NATIONS HEALTH, 1949, 39 (11): : 1441 - 1445
[38] FINDING HIDDEN CASES
BOWDOIN, CD
BUCHANAN, CS
AMERICAN JOURNAL OF PUBLIC HEALTH, 1949, 39 (11) : 1441 - 1445
[39] Autonomous motion recognition by combining reinforcement learning and hidden Markov model
Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama, 226-8503, Japan
Syst Comput Jpn, 2006, 14 (34-43):
[40] A hidden anti-jamming method based on deep reinforcement learning
Wang, Yifan
Liu, Xin
Wang, Mei
Yu, Yu
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (09): : 3444 - 3457

← 1 2 3 4 5 →