A Method for Finding Multiple Subgoals for Reinforcement Learning

被引：0

作者：

Ogihara, Fuminori ^{[1
]}

Murata, Junichi ^{[1
]}

机构：

[1] Kyushu Univ, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka, Japan

来源：

PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11) | 2011年

关键词：

reinforcement learning; subgoal discovery; the state visiting frequency; the particular state;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a new method for discovering multiple subgoals automatically to accelerate reinforcement learning. There have been proposed several methods for discovery of subgoals. Some use state visiting frequencies in the trajectories that reach the goal state. When a state visiting frequency is very high, this state is regarded as the subgoal. Because this kind of methods need that the goal state is reached many times to collect trajectories, they take a long time for discovering subgoals. In addition, they cannot discover the potential subgoals that will become appropriate subgoals when the goal state changes. On the other hand, some methods identify subgoals by partitioning local state transition graphs. But this kind of methods require large calculation amounts. We propose a new method that solves the above drawbacks. The new method utilizes state visiting frequencies. But we collect trajectories that go through particular non-goal states selected at random. For each particular state, trajectories are collected. Most of the trajectories reach the particular state more easily that the goal state. Therefore, it is expected that we can discover subgoals quickly and discover multiple subgoals together.

引用

页码：804 / 807

页数：4

共 50 条

[1] Hierarchical Reinforcement Learning With Timed Subgoals
Guertler, Nico
Buechler, Dieter
Martius, Georg
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[2] Introduction and control of subgoals in reinforcement learning
Murata, Junichi
Abe, Yasuomi
Ota, Keisuke
PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2007, : 329 - +
[3] Curricular Subgoals for Inverse Reinforcement Learning
Liu, Shunyu
Qing, Yunpeng
Xu, Shuqi
Wu, Hongyan
Zhang, Jiangtao
Cong, Jingyuan
Chen, Tianhao
Liu, Yun-Fu
Song, Mingli
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3016 - 3027
[4] COMBINATIONS OF MICRO-MACRO STATES AND SUBGOALS DISCOVERY IN HIERARCHICAL REINFORCEMENT LEARNING FOR PATH FINDING
Setyawan, Gembong Edhi
Sawada, Hideyuki
Hartono, Pitoyo
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (02): : 447 - 462
[5] Autonomic discovery of subgoals in hierarchical reinforcement learning
XIAO Ding
LI Yi-tong
SHI Chuan
The Journal of China Universities of Posts and Telecommunications, 2014, (05) : 94 - 104
[6] Autonomic discovery of subgoals in hierarchical reinforcement learning
XIAO Ding
LI Yi-tong
SHI Chuan
TheJournalofChinaUniversitiesofPostsandTelecommunications, 2014, 21 (05) : 94 - 104
[7] Hierarchical reinforcement learning with subpolicies specializing for learned subgoals
Bakker, B
Schmidhuber, J
Proceedings of the Second IASTED International Conference on Neural Networks and Computational Intelligence, 2004, : 125 - 130
[8] Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Chane-Sane, Elliot
Schmid, Cordelia
Laptev, Ivan
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[9] Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
Zhang, Tianren
Guo, Shangqi
Tan, Tian
Hu, Xiaolin
Chen, Feng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[10] A Local Graph Clustering Algorithm for Discovering Subgoals in Reinforcement Learning
Entezari, Negin
Shiri, Mohammad Ebrahim
Moradi, Parham
COMMUNICATION AND NETWORKING, PT II, 2010, 120 : 41 - 50

← 1 2 3 4 5 →