Natural policy gradient reinforcement learning for a CPG control of a biped robot

被引:0
|
作者
Nakamura, Y [1 ]
Mori, T [1 ]
Ishii, S [1 ]
机构
[1] Nara Inst Sci & Technol, Nara 63001, Japan
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Motivated by the perspective that animals' rhythmic movements such as locomotion are controlled by neural circuits called central pattern generators (CPGs), motor control mechanisms by CPG have been studied. As an autonomous learning framework for a CPG controller, we previously proposed a reinforcement learning (RL) method called the CPG-actor-critic method. In this article, we propose a natural policy gradient learning algorithm for the CPG-actor-critic method, and applied our RL to an automatic control problem by a biped robot simulator. Computer simulations show that our RL makes the biped robot walk stably on various terrain.
引用
收藏
页码:972 / 981
页数:10
相关论文
共 50 条
  • [11] Hybrid reinforcement learning and its application to biped robot control
    Yamada, S
    Watanabe, A
    Nakashima, M
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1071 - 1077
  • [12] Adaptive Natural Policy Gradient in Reinforcement Learning
    Li, Dazi
    Qiao, Zengyuan
    Song, Tianheng
    Jin, Qibing
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 605 - 610
  • [13] Parallel Deep Reinforcement Learning Method for Gait Control of Biped Robot
    Tao, Chongben
    Xue, Jie
    Zhang, Zufeng
    Gao, Zhen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (06) : 2802 - 2806
  • [14] CPG Control for Biped Hopping Robot in Unpredictable Environment
    Tingting Wang
    Wei Guo
    Mantian Li
    Fusheng Zha
    Lining Sun
    Journal of Bionic Engineering, 2012, 9 : 29 - 38
  • [15] CPG Control for Biped Hopping Robot in Unpredictable Environment
    Wang, Tingting
    Guo, Wei
    Li, Mantian
    Zha, Fusheng
    Sun, Lining
    JOURNAL OF BIONIC ENGINEERING, 2012, 9 (01) : 29 - 38
  • [16] Walking control of biped robot NAO based on CPG
    Cai, Zhiqiang
    Liu, Chengju
    Chen, Qijun
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2011, 39 (SUPPL. 2): : 40 - 43
  • [17] Using policy gradient reinforcement learning on autonomous robot controllers
    Grudic, GZ
    Kumar, V
    Ungar, L
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 406 - 411
  • [18] A Disturbance Rejection Control Method Based on Deep Reinforcement Learning for a Biped Robot
    Liu, Chuzhao
    Gao, Junyao
    Tian, Dingkui
    Zhang, Xuefeng
    Liu, Huaxin
    Meng, Libo
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 17
  • [19] Balance Control of a Biped Robot on a Rotating Platform Based on Efficient Reinforcement Learning
    Xi, Ao
    Mudiyanselage, Thushal Wijekoon
    Tao, Dacheng
    Chen, Chao
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (04) : 938 - 951
  • [20] Balance Control of a Biped Robot on a Rotating Platform Based on Efficient Reinforcement Learning
    Ao Xi
    Thushal Wijekoon Mudiyanselage
    Dacheng Tao
    Chao Chen
    IEEE/CAA Journal of Automatica Sinica, 2019, 6 (04) : 938 - 951