A Method for Finding Multiple Subgoals for Reinforcement Learning

被引:0
|
作者
Ogihara, Fuminori [1 ]
Murata, Junichi [1 ]
机构
[1] Kyushu Univ, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka, Japan
关键词
reinforcement learning; subgoal discovery; the state visiting frequency; the particular state;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new method for discovering multiple subgoals automatically to accelerate reinforcement learning. There have been proposed several methods for discovery of subgoals. Some use state visiting frequencies in the trajectories that reach the goal state. When a state visiting frequency is very high, this state is regarded as the subgoal. Because this kind of methods need that the goal state is reached many times to collect trajectories, they take a long time for discovering subgoals. In addition, they cannot discover the potential subgoals that will become appropriate subgoals when the goal state changes. On the other hand, some methods identify subgoals by partitioning local state transition graphs. But this kind of methods require large calculation amounts. We propose a new method that solves the above drawbacks. The new method utilizes state visiting frequencies. But we collect trajectories that go through particular non-goal states selected at random. For each particular state, trajectories are collected. Most of the trajectories reach the particular state more easily that the goal state. Therefore, it is expected that we can discover subgoals quickly and discover multiple subgoals together.
引用
收藏
页码:804 / 807
页数:4
相关论文
共 50 条
  • [21] Hierarchical Reinforcement Learning-Based End-to-End Visual Servoing With Smooth Subgoals
    He, Yaozhen
    Gao, Jian
    Li, Huiping
    Chen, Yimin
    Li, Yufeng
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (09) : 11009 - 11018
  • [22] A LOAD BALANCE PERSONALIZED PATH FINDING WITH MULTIPLE-AGENT DEEP REINFORCEMENT LEARNING
    LI, Naipeng
    Guo, Yuchun
    Chen, Yishuai
    Guo, Hengyuan
    Soradi-zeid, Samaneh
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2023, 31 (06)
  • [23] Reinforcement learning method for attribute fusion in multiple hypothesis tracking
    Korpisaari, P
    Saarinen, J
    FUSION'98: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MULTISOURCE-MULTISENSOR INFORMATION FUSION, VOLS 1 AND 2, 1998, : 215 - 222
  • [24] FINDING GEODESICS ON GRAPHS USING REINFORCEMENT LEARNING
    Kious, Daniel
    Mailler, Cecile
    Schapira, Bruno
    ANNALS OF APPLIED PROBABILITY, 2022, 32 (05): : 3889 - 3929
  • [25] ON CHOICE OF SUBGOALS FOR LEARNING CONTROL SYSTEMS
    JONES, LE
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1968, AC13 (06) : 613 - +
  • [26] Learning to generate subgoals for action sequences
    Schmidhuber, J.
    Proceedings of the International Conference on Artificial Neural Networks, 1991,
  • [27] Federated Reinforcement Learning Acceleration Method for Precise Control of Multiple Devices
    Lim, Hyun-Kyo
    Kim, Ju-Bong
    Ullah, Ihsan
    Heo, Joo-Seong
    Han, Youn-Hee
    IEEE ACCESS, 2021, 9 : 76296 - 76306
  • [28] A collaborative siege method of multiple unmanned vehicles based on reinforcement learning
    Su, Muqing
    Pu, Ruimin
    Wang, Yin
    Yu, Meng
    INTELLIGENCE & ROBOTICS, 2024, 4 (01): : 39 - 60
  • [29] New reinforcement learning method using multiple Q-tables
    Park, MS
    Choi, AY
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMPUTER SCIENCE II, 2002, : 88 - 92
  • [30] Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning
    Gao, Mingsheng
    Zhang, Xiaoxuan
    SENSORS, 2022, 22 (18)