Natural policy gradient reinforcement learning for a CPG control of a biped robot

被引：0

作者：

Nakamura, Y ^{[1
]}

Mori, T ^{[1
]}

Ishii, S ^{[1
]}

机构：

[1] Nara Inst Sci & Technol, Nara 63001, Japan

来源：

PARALLEL PROBLEM SOLVING FROM NATURE - PPSN VIII | 2004年 / 3242卷

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Motivated by the perspective that animals' rhythmic movements such as locomotion are controlled by neural circuits called central pattern generators (CPGs), motor control mechanisms by CPG have been studied. As an autonomous learning framework for a CPG controller, we previously proposed a reinforcement learning (RL) method called the CPG-actor-critic method. In this article, we propose a natural policy gradient learning algorithm for the CPG-actor-critic method, and applied our RL to an automatic control problem by a biped robot simulator. Computer simulations show that our RL makes the biped robot walk stably on various terrain.

引用

页码：972 / 981

页数：10

共 50 条

[41] Curvilinear Bipedal Walk Learning in Nao Humanoid Robot using a CPG Based Policy Gradient Method
Shahbazi, Hamed
Jamshidi, Kemal
Monadjemi, Amir Hasan
MECHANICAL AND AEROSPACE ENGINEERING, PTS 1-7, 2012, 110-116 : 5161 - 5166
[42] A Stochastic Policy Gradient Based Adaptive Control for Biped Walking
Song, Sumian
Yan, Gangfeng
Tang, Chong
Wang, Zidong
Lin, Zhiyun
2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3224 - 3229
[43] Online Control for Biped Robot with Incremental Learning Mechanism
Yang, Liang
Lai, Guanyu
Chen, Yong
Guo, Zhihui
APPLIED SCIENCES-BASEL, 2021, 11 (18):
[44] A modification of gradient policy in reinforcement learning procedure
Abas, Marcel
Skripcak, Tomas
2012 15TH INTERNATIONAL CONFERENCE ON INTERACTIVE COLLABORATIVE LEARNING (ICL), 2012,
[45] Policy Gradient Method For Robust Reinforcement Learning
Wang, Yue
Zou, Shaofeng
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[46] Reinforcement Learning to Rank with Pairwise Policy Gradient
Xu, Jun
Wei, Zeng
Xia, Long
Lan, Yanyan
Yin, Dawei
Cheng, Xueqi
Wen, Ji-Rong
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 509 - 518
[47] Scalable Multitask Policy Gradient Reinforcement Learning
El Bsat, Salam
Ammar, Haitham Bou
Taylor, Matthew E.
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1847 - 1853
[48] Active structural control framework using policy-gradient reinforcement learning
Eshkevari, Soheila Sadeghi
Eshkevari, Soheil Sadeghi
Sen, Debarshi
Pakzad, Shamim N.
ENGINEERING STRUCTURES, 2022, 274
[49] Continuous Parameter Control in Genetic Algorithms using Policy Gradient Reinforcement Learning
de Miguel Gomez, Alejandro
Toosi, Farshad Ghassemi
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 115 - 122
[50] Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching
Denkert, Robert
Pham, Huyên
Warin, Xavier
Applied Mathematics and Optimization, 2025, 91 (01):

← 1 2 3 4 5 →