A Method for Finding Multiple Subgoals for Reinforcement Learning

被引：0

作者：

Ogihara, Fuminori ^{[1
]}

Murata, Junichi ^{[1
]}

机构：

[1] Kyushu Univ, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka, Japan

来源：

PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11) | 2011年

关键词：

reinforcement learning; subgoal discovery; the state visiting frequency; the particular state;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a new method for discovering multiple subgoals automatically to accelerate reinforcement learning. There have been proposed several methods for discovery of subgoals. Some use state visiting frequencies in the trajectories that reach the goal state. When a state visiting frequency is very high, this state is regarded as the subgoal. Because this kind of methods need that the goal state is reached many times to collect trajectories, they take a long time for discovering subgoals. In addition, they cannot discover the potential subgoals that will become appropriate subgoals when the goal state changes. On the other hand, some methods identify subgoals by partitioning local state transition graphs. But this kind of methods require large calculation amounts. We propose a new method that solves the above drawbacks. The new method utilizes state visiting frequencies. But we collect trajectories that go through particular non-goal states selected at random. For each particular state, trajectories are collected. Most of the trajectories reach the particular state more easily that the goal state. Therefore, it is expected that we can discover subgoals quickly and discover multiple subgoals together.

引用

页码：804 / 807

页数：4

共 50 条

[41] Distribution Data Across Multiple Cloud Storage using Reinforcement Learning Method
Algarni, Abdullah
Kudenko, Daniel
ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 431 - 438
[42] A Multiple-Goal Reinforcement Learning Method for Complex Vehicle Overtaking Maneuvers
Ngai, Daniel Chi Kit
Yung, Nelson Hon Ching
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (02) : 509 - 522
[43] A Novel Method for Multiple Biomedical Events Extraction with Reinforcement Learning and Knowledge Bases
Zhao, Weizhong
Zhao, Yao
Jiang, Xingpeng
He, Tingting
Liu, Fan
Li, Ning
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 402 - 407
[44] Cooperative encirclement method for multiple unmanned ground vehicles based on reinforcement learning
Su M.
Wang Y.
Pu R.
Yu M.
Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2024, 46 (07): : 1237 - 1250
[45] LEARNING SUBGOALS AND METHODS FOR SOLVING PROBABILITY PROBLEMS
CATRAMBONE, R
HOLYOAK, KJ
MEMORY & COGNITION, 1990, 18 (06) : 593 - 603
[46] The RBMLE method for Reinforcement Learning
Mete, Akshay
Singh, Rahul
Kumar, P. R.
2022 56TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2022, : 107 - 112
[47] Aggregation of multiple reinforcement learning algorithms
Jiang, Ju
Kamel, Mohamed S.
Chen, Lei
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2006, 15 (05) : 855 - 861
[48] Reinforcement learning using multiple actions
Nakama, Hayato
Asano, Tsubasa
Yamada, Satoshi
NEUROSCIENCE RESEARCH, 2010, 68 : E330 - E330
[49] Reinforcement Learning for Control with Multiple Frequencies
Lee, Jongmin
Lee, Byung-Jun
Kim, Kee-Eung
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[50] Offline Reinforcement Learning at Multiple Frequencies
Burns, Kaylee
Yu, Tianhe
Finn, Chelsea
Hausman, Karol
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2041 - 2051

← 1 2 3 4 5 →