A memory-based reinforcement learning model utilizing macro-actions

被引:2
|
作者
Murata, M [1 ]
Ozawa, S [1 ]
机构
[1] Kobe Univ, Grad Sch Sci & Technol, Kobe, Hyogo, Japan
关键词
D O I
10.1007/3-211-27389-1_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the difficulties in reinforcement learning (RL) is that an optimal policy is acquired through enormous trials. As a solution to reduce waste explorations in learning, recently the exploitation of macro-actions has been focused. In this paper, we propose a memory-based reinforcement learning model in which macro-actions are generated and exploited effectively. Through the experiments for two standard tasks, we confirmed that our proposed method could decrease waste explorations especially in the early training stage. This property contributes to enhancing training efficiency in RL tasks.
引用
收藏
页码:78 / 81
页数:4
相关论文
共 50 条
  • [1] Learning macro-actions in reinforcement learning
    Randlov, J
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 1045 - 1051
  • [2] An acquiring method of macro-actions in reinforcement learning
    Yoshikawa, Takeshi
    Kurihara, Masahito
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4813 - +
  • [3] Automatic construction and evaluation of macro-actions in reinforcement learning
    Farahani, Marzieh Davoodabadi
    Mozayani, Nasser
    [J]. APPLIED SOFT COMPUTING, 2019, 82
  • [4] Automatic generation of macro-actions using genetic algorithm for reinforcement learning
    Tateyama, T
    Kawata, S
    Oguchi, T
    [J]. SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 286 - 289
  • [5] Multi-Agent/Robot Deep Reinforcement Learning with Macro-Actions
    Xiao, Yuchen
    Hoffman, Joshua
    Xia, Tian
    Amato, Christopher
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13965 - 13966
  • [6] A Method for Learning Macro-Actions for Virtual Characters Using Programming by Demonstration and Reinforcement Learning
    Sung, Yunsick
    Cho, Kyungeun
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2012, 8 (03): : 409 - 420
  • [7] A Reinforcement Learning Model Using Macro-actions in Multi-task Grid-World Problems
    Onda, Hiroshi
    Ozawa, Seiichi
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3088 - 3093
  • [8] Strategic Attentive Writer for Learning Macro-Actions
    Vezhnevets, Alexander
    Mnih, Volodymyr
    Agapiou, John
    Osindero, Simon
    Graves, Alex
    Vinyals, Oriol
    Kavukcuoglu, Koray
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [9] Implicit Learning of Compiled Macro-Actions for Planning
    Newton, M. A. Hakim
    Levine, John
    [J]. ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 323 - 328
  • [10] Memory-Based Explainable Reinforcement Learning
    Cruz, Francisco
    Dazeley, Richard
    Vamplew, Peter
    [J]. AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 66 - 77