A Method for Finding Multiple Subgoals for Reinforcement Learning

被引:0
|
作者
Ogihara, Fuminori [1 ]
Murata, Junichi [1 ]
机构
[1] Kyushu Univ, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka, Japan
关键词
reinforcement learning; subgoal discovery; the state visiting frequency; the particular state;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new method for discovering multiple subgoals automatically to accelerate reinforcement learning. There have been proposed several methods for discovery of subgoals. Some use state visiting frequencies in the trajectories that reach the goal state. When a state visiting frequency is very high, this state is regarded as the subgoal. Because this kind of methods need that the goal state is reached many times to collect trajectories, they take a long time for discovering subgoals. In addition, they cannot discover the potential subgoals that will become appropriate subgoals when the goal state changes. On the other hand, some methods identify subgoals by partitioning local state transition graphs. But this kind of methods require large calculation amounts. We propose a new method that solves the above drawbacks. The new method utilizes state visiting frequencies. But we collect trajectories that go through particular non-goal states selected at random. For each particular state, trajectories are collected. Most of the trajectories reach the particular state more easily that the goal state. Therefore, it is expected that we can discover subgoals quickly and discover multiple subgoals together.
引用
收藏
页码:804 / 807
页数:4
相关论文
共 50 条
  • [1] Hierarchical Reinforcement Learning With Timed Subgoals
    Guertler, Nico
    Buechler, Dieter
    Martius, Georg
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Introduction and control of subgoals in reinforcement learning
    Murata, Junichi
    Abe, Yasuomi
    Ota, Keisuke
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2007, : 329 - +
  • [3] Curricular Subgoals for Inverse Reinforcement Learning
    Liu, Shunyu
    Qing, Yunpeng
    Xu, Shuqi
    Wu, Hongyan
    Zhang, Jiangtao
    Cong, Jingyuan
    Chen, Tianhao
    Liu, Yun-Fu
    Song, Mingli
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3016 - 3027
  • [4] COMBINATIONS OF MICRO-MACRO STATES AND SUBGOALS DISCOVERY IN HIERARCHICAL REINFORCEMENT LEARNING FOR PATH FINDING
    Setyawan, Gembong Edhi
    Sawada, Hideyuki
    Hartono, Pitoyo
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (02): : 447 - 462
  • [5] Autonomic discovery of subgoals in hierarchical reinforcement learning
    XIAO Ding
    LI Yi-tong
    SHI Chuan
    The Journal of China Universities of Posts and Telecommunications, 2014, (05) : 94 - 104
  • [6] Autonomic discovery of subgoals in hierarchical reinforcement learning
    XIAO Ding
    LI Yi-tong
    SHI Chuan
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2014, 21 (05) : 94 - 104
  • [7] Hierarchical reinforcement learning with subpolicies specializing for learned subgoals
    Bakker, B
    Schmidhuber, J
    Proceedings of the Second IASTED International Conference on Neural Networks and Computational Intelligence, 2004, : 125 - 130
  • [8] Goal-Conditioned Reinforcement Learning with Imagined Subgoals
    Chane-Sane, Elliot
    Schmid, Cordelia
    Laptev, Ivan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning
    Zhang, Tianren
    Guo, Shangqi
    Tan, Tian
    Hu, Xiaolin
    Chen, Feng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [10] A Local Graph Clustering Algorithm for Discovering Subgoals in Reinforcement Learning
    Entezari, Negin
    Shiri, Mohammad Ebrahim
    Moradi, Parham
    COMMUNICATION AND NETWORKING, PT II, 2010, 120 : 41 - 50