A Method for Finding Multiple Subgoals for Reinforcement Learning

被引:0
|
作者
Ogihara, Fuminori [1 ]
Murata, Junichi [1 ]
机构
[1] Kyushu Univ, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka, Japan
关键词
reinforcement learning; subgoal discovery; the state visiting frequency; the particular state;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new method for discovering multiple subgoals automatically to accelerate reinforcement learning. There have been proposed several methods for discovery of subgoals. Some use state visiting frequencies in the trajectories that reach the goal state. When a state visiting frequency is very high, this state is regarded as the subgoal. Because this kind of methods need that the goal state is reached many times to collect trajectories, they take a long time for discovering subgoals. In addition, they cannot discover the potential subgoals that will become appropriate subgoals when the goal state changes. On the other hand, some methods identify subgoals by partitioning local state transition graphs. But this kind of methods require large calculation amounts. We propose a new method that solves the above drawbacks. The new method utilizes state visiting frequencies. But we collect trajectories that go through particular non-goal states selected at random. For each particular state, trajectories are collected. Most of the trajectories reach the particular state more easily that the goal state. Therefore, it is expected that we can discover subgoals quickly and discover multiple subgoals together.
引用
收藏
页码:804 / 807
页数:4
相关论文
共 50 条
  • [41] Distribution Data Across Multiple Cloud Storage using Reinforcement Learning Method
    Algarni, Abdullah
    Kudenko, Daniel
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 431 - 438
  • [42] A Multiple-Goal Reinforcement Learning Method for Complex Vehicle Overtaking Maneuvers
    Ngai, Daniel Chi Kit
    Yung, Nelson Hon Ching
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (02) : 509 - 522
  • [43] A Novel Method for Multiple Biomedical Events Extraction with Reinforcement Learning and Knowledge Bases
    Zhao, Weizhong
    Zhao, Yao
    Jiang, Xingpeng
    He, Tingting
    Liu, Fan
    Li, Ning
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 402 - 407
  • [44] Cooperative encirclement method for multiple unmanned ground vehicles based on reinforcement learning
    Su M.
    Wang Y.
    Pu R.
    Yu M.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2024, 46 (07): : 1237 - 1250
  • [45] LEARNING SUBGOALS AND METHODS FOR SOLVING PROBABILITY PROBLEMS
    CATRAMBONE, R
    HOLYOAK, KJ
    MEMORY & COGNITION, 1990, 18 (06) : 593 - 603
  • [46] The RBMLE method for Reinforcement Learning
    Mete, Akshay
    Singh, Rahul
    Kumar, P. R.
    2022 56TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2022, : 107 - 112
  • [47] Aggregation of multiple reinforcement learning algorithms
    Jiang, Ju
    Kamel, Mohamed S.
    Chen, Lei
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2006, 15 (05) : 855 - 861
  • [48] Reinforcement learning using multiple actions
    Nakama, Hayato
    Asano, Tsubasa
    Yamada, Satoshi
    NEUROSCIENCE RESEARCH, 2010, 68 : E330 - E330
  • [49] Reinforcement Learning for Control with Multiple Frequencies
    Lee, Jongmin
    Lee, Byung-Jun
    Kim, Kee-Eung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [50] Offline Reinforcement Learning at Multiple Frequencies
    Burns, Kaylee
    Yu, Tianhe
    Finn, Chelsea
    Hausman, Karol
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2041 - 2051