Natural policy gradient reinforcement learning for a CPG control of a biped robot

被引:0
|
作者
Nakamura, Y [1 ]
Mori, T [1 ]
Ishii, S [1 ]
机构
[1] Nara Inst Sci & Technol, Nara 63001, Japan
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Motivated by the perspective that animals' rhythmic movements such as locomotion are controlled by neural circuits called central pattern generators (CPGs), motor control mechanisms by CPG have been studied. As an autonomous learning framework for a CPG controller, we previously proposed a reinforcement learning (RL) method called the CPG-actor-critic method. In this article, we propose a natural policy gradient learning algorithm for the CPG-actor-critic method, and applied our RL to an automatic control problem by a biped robot simulator. Computer simulations show that our RL makes the biped robot walk stably on various terrain.
引用
收藏
页码:972 / 981
页数:10
相关论文
共 50 条
  • [41] Curvilinear Bipedal Walk Learning in Nao Humanoid Robot using a CPG Based Policy Gradient Method
    Shahbazi, Hamed
    Jamshidi, Kemal
    Monadjemi, Amir Hasan
    MECHANICAL AND AEROSPACE ENGINEERING, PTS 1-7, 2012, 110-116 : 5161 - 5166
  • [42] A Stochastic Policy Gradient Based Adaptive Control for Biped Walking
    Song, Sumian
    Yan, Gangfeng
    Tang, Chong
    Wang, Zidong
    Lin, Zhiyun
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3224 - 3229
  • [43] Online Control for Biped Robot with Incremental Learning Mechanism
    Yang, Liang
    Lai, Guanyu
    Chen, Yong
    Guo, Zhihui
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [44] A modification of gradient policy in reinforcement learning procedure
    Abas, Marcel
    Skripcak, Tomas
    2012 15TH INTERNATIONAL CONFERENCE ON INTERACTIVE COLLABORATIVE LEARNING (ICL), 2012,
  • [45] Policy Gradient Method For Robust Reinforcement Learning
    Wang, Yue
    Zou, Shaofeng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [46] Reinforcement Learning to Rank with Pairwise Policy Gradient
    Xu, Jun
    Wei, Zeng
    Xia, Long
    Lan, Yanyan
    Yin, Dawei
    Cheng, Xueqi
    Wen, Ji-Rong
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 509 - 518
  • [47] Scalable Multitask Policy Gradient Reinforcement Learning
    El Bsat, Salam
    Ammar, Haitham Bou
    Taylor, Matthew E.
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1847 - 1853
  • [48] Active structural control framework using policy-gradient reinforcement learning
    Eshkevari, Soheila Sadeghi
    Eshkevari, Soheil Sadeghi
    Sen, Debarshi
    Pakzad, Shamim N.
    ENGINEERING STRUCTURES, 2022, 274
  • [49] Continuous Parameter Control in Genetic Algorithms using Policy Gradient Reinforcement Learning
    de Miguel Gomez, Alejandro
    Toosi, Farshad Ghassemi
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 115 - 122
  • [50] Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching
    Denkert, Robert
    Pham, Huyên
    Warin, Xavier
    Applied Mathematics and Optimization, 2025, 91 (01):