A Method for Finding Multiple Subgoals for Reinforcement Learning

被引：0

作者：

Ogihara, Fuminori ^{[1
]}

Murata, Junichi ^{[1
]}

机构：

[1] Kyushu Univ, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka, Japan

来源：

PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11) | 2011年

关键词：

reinforcement learning; subgoal discovery; the state visiting frequency; the particular state;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a new method for discovering multiple subgoals automatically to accelerate reinforcement learning. There have been proposed several methods for discovery of subgoals. Some use state visiting frequencies in the trajectories that reach the goal state. When a state visiting frequency is very high, this state is regarded as the subgoal. Because this kind of methods need that the goal state is reached many times to collect trajectories, they take a long time for discovering subgoals. In addition, they cannot discover the potential subgoals that will become appropriate subgoals when the goal state changes. On the other hand, some methods identify subgoals by partitioning local state transition graphs. But this kind of methods require large calculation amounts. We propose a new method that solves the above drawbacks. The new method utilizes state visiting frequencies. But we collect trajectories that go through particular non-goal states selected at random. For each particular state, trajectories are collected. Most of the trajectories reach the particular state more easily that the goal state. Therefore, it is expected that we can discover subgoals quickly and discover multiple subgoals together.

引用

页码：804 / 807

页数：4

共 50 条

[21] Hierarchical Reinforcement Learning-Based End-to-End Visual Servoing With Smooth Subgoals
He, Yaozhen
Gao, Jian
Li, Huiping
Chen, Yimin
Li, Yufeng
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (09) : 11009 - 11018
[22] A LOAD BALANCE PERSONALIZED PATH FINDING WITH MULTIPLE-AGENT DEEP REINFORCEMENT LEARNING
LI, Naipeng
Guo, Yuchun
Chen, Yishuai
Guo, Hengyuan
Soradi-zeid, Samaneh
FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2023, 31 (06)
[23] Reinforcement learning method for attribute fusion in multiple hypothesis tracking
Korpisaari, P
Saarinen, J
FUSION'98: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MULTISOURCE-MULTISENSOR INFORMATION FUSION, VOLS 1 AND 2, 1998, : 215 - 222
[24] FINDING GEODESICS ON GRAPHS USING REINFORCEMENT LEARNING
Kious, Daniel
Mailler, Cecile
Schapira, Bruno
ANNALS OF APPLIED PROBABILITY, 2022, 32 (05): : 3889 - 3929
[25] ON CHOICE OF SUBGOALS FOR LEARNING CONTROL SYSTEMS
JONES, LE
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1968, AC13 (06) : 613 - +
[26] Learning to generate subgoals for action sequences
Schmidhuber, J.
Proceedings of the International Conference on Artificial Neural Networks, 1991,
[27] Federated Reinforcement Learning Acceleration Method for Precise Control of Multiple Devices
Lim, Hyun-Kyo
Kim, Ju-Bong
Ullah, Ihsan
Heo, Joo-Seong
Han, Youn-Hee
IEEE ACCESS, 2021, 9 : 76296 - 76306
[28] A collaborative siege method of multiple unmanned vehicles based on reinforcement learning
Su, Muqing
Pu, Ruimin
Wang, Yin
Yu, Meng
INTELLIGENCE & ROBOTICS, 2024, 4 (01): : 39 - 60
[29] New reinforcement learning method using multiple Q-tables
Park, MS
Choi, AY
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMPUTER SCIENCE II, 2002, : 88 - 92
[30] Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning
Gao, Mingsheng
Zhang, Xiaoxuan
SENSORS, 2022, 22 (18)

← 1 2 3 4 5 →