An acquiring method of macro-actions in reinforcement learning

被引:4
|
作者
Yoshikawa, Takeshi [1 ]
Kurihara, Masahito [1 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Kita Ku, Sapporo, Hokkaido 0600814, Japan
关键词
D O I
10.1109/ICSMC.2006.385067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning is a framing of enabling agents to learn from interaction with environments. It has focused generally on Markov decision process (MDP) domains, but a domain may be non-Markovian in the real world. In this paper, we introduce a new description of macro-actions with tree structure in reinforcement learning. The macro-action is an action control structure which provides an agent with control which applies a collection of related microscopic actions as a single action unit. And we propose a simple method for dynamically acquiring macro-actions from the experiences of agents during reinforcement learning process.
引用
收藏
页码:4813 / +
页数:2
相关论文
共 50 条
  • [1] Learning macro-actions in reinforcement learning
    Randlov, J
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 1045 - 1051
  • [2] Automatic construction and evaluation of macro-actions in reinforcement learning
    Farahani, Marzieh Davoodabadi
    Mozayani, Nasser
    [J]. APPLIED SOFT COMPUTING, 2019, 82
  • [3] A Method for Learning Macro-Actions for Virtual Characters Using Programming by Demonstration and Reinforcement Learning
    Sung, Yunsick
    Cho, Kyungeun
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2012, 8 (03): : 409 - 420
  • [4] Multi-Agent/Robot Deep Reinforcement Learning with Macro-Actions
    Xiao, Yuchen
    Hoffman, Joshua
    Xia, Tian
    Amato, Christopher
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13965 - 13966
  • [5] Automatic generation of macro-actions using genetic algorithm for reinforcement learning
    Tateyama, T
    Kawata, S
    Oguchi, T
    [J]. SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 286 - 289
  • [6] A memory-based reinforcement learning model utilizing macro-actions
    Murata, M
    Ozawa, S
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2005, : 78 - 81
  • [7] Strategic Attentive Writer for Learning Macro-Actions
    Vezhnevets, Alexander
    Mnih, Volodymyr
    Agapiou, John
    Osindero, Simon
    Graves, Alex
    Vinyals, Oriol
    Kavukcuoglu, Koray
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [8] Implicit Learning of Compiled Macro-Actions for Planning
    Newton, M. A. Hakim
    Levine, John
    [J]. ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 323 - 328
  • [9] MAGIC: Learning Macro-Actions for Online POMDP Planning
    Lee, Yiyuan
    Cai, Panpan
    Hsu, David
    [J]. ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [10] A Reinforcement Learning Model Using Macro-actions in Multi-task Grid-World Problems
    Onda, Hiroshi
    Ozawa, Seiichi
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3088 - 3093