Reinforcement learning of dynamic motor sequence: Learning to stand up

被引:0
|
作者
Morimoto, J [1 ]
Doya, K [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, ATR, Human Informat Proc Lab, Nara 6300101, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a learning method for implementing human-like sequential movements in robots. As an example of dynamic sequential movement, we consider the "stand-up" task for a two-joint, three-link: robot. In contrast to the case of steady walking or standing, the desired trajectory for such a transient behavior is very difficult to derive. The goal of the task is to find a path that links a lying state to an upright state under the constraints of the system dynamics. The geometry of the robot is such that there is no static solution; the robot has to stand up dynamically utilizing the momentum of its body. We use reinforcement learning, in particular, a continuous time and state temporal difference (TD) learning method. For successful results, we use 1) an efficient method of value function approximation in a high-dimensional state space, and 2) a hierarchical architecture which divides a large state space into a few smaller pieces.
引用
收藏
页码:1721 / 1726
页数:6
相关论文
共 50 条
  • [41] Dynamic cortical involvement in implicit and explicit motor sequence learning -: A PET study
    Honda, M
    Deiber, MP
    Ibáñez, V
    Pascual-Leone, A
    Zhuang, P
    Hallett, M
    BRAIN, 1998, 121 : 2159 - 2173
  • [42] Interactions between motor exploration and reinforcement learning
    Uehara, Shintaro
    Mawase, Firas
    Therrien, Amanda S.
    Cherry-Allen, Kendra M.
    Celnik, Pablo
    JOURNAL OF NEUROPHYSIOLOGY, 2019, 122 (02) : 797 - 808
  • [43] Reinforcement learning of motor skills with policy gradients
    Peters, Jan
    Schaal, Stefan
    NEURAL NETWORKS, 2008, 21 (04) : 682 - 697
  • [44] Reinforcement Learning in Latent Action Sequence Space
    Kim, Heecheol
    Yamada, Masanori
    Miyoshi, Kosuke
    Iwata, Tomoharu
    Yamakawa, Hiroshi
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5497 - 5503
  • [45] Improving reinforcement learning by using sequence trees
    Girgin, Sertan
    Polat, Faruk
    Alhajj, Reda
    MACHINE LEARNING, 2010, 81 (03) : 283 - 331
  • [46] Sequence labeling with reinforcement learning and ranking algorithms
    Maes, Francis
    Denoyer, Ludovic
    Gallinari, Patrick
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 648 - +
  • [47] Reinforcement learning for disassembly sequence planning optimization
    Allagui, Amal
    Belhadj, Imen
    Plateaux, Regis
    Hammadi, Moncef
    Penas, Olivia
    Aifaoui, Nizar
    COMPUTERS IN INDUSTRY, 2023, 151
  • [48] Improving reinforcement learning by using sequence trees
    Sertan Girgin
    Faruk Polat
    Reda Alhajj
    Machine Learning, 2010, 81 : 283 - 331
  • [49] Reinforcement Learning for on-line Sequence Transformation
    Rypesc, Grzegorz
    Lepak, Lukasz
    Wawrzynski, Pawel
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 133 - 139
  • [50] AlphaSeq: Sequence Discovery With Deep Reinforcement Learning
    Shao, Yulin
    Liew, Soung Chang
    Wang, Taotao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) : 3319 - 3333