Reinforcement learning method-based stable gait synthesis for biped robot

被引:0
|
作者
Hu, LY [1 ]
Sun, ZQ [1 ]
机构
[1] Tsinghua Univ, Comp Sci & Technol Dept, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A stable gait generation algorithm based on T-S type fuzzy learning net is proposed in this paper. Gait generation is divided into model construction and error learning. Reference gait model and dynamic model are firstly constructed with basic gait geometric knowledge. Then reinforcement learning method is introduced into T-S type fuzzy network to learn the gain parameters for hip trajectory adjustment. Few fuzzy rules with ZMP stable knowledge are needed to formulate the nonlinear relation between the ZMP curve and hip trajectory. The problem of finding multi-variables in continuous space is also simplified to searching independent action gains simultaneously. Results of simulation on a biped robot proved the feasibility.
引用
收藏
页码:1017 / 1022
页数:6
相关论文
共 50 条
  • [11] A Stable Gait Planning Method of Biped Robot Based on Ankle motion Smooth Fitting
    Dong, En Zeng
    Wang, Dan Dan
    Tong, Ji Gang
    Chen, Chao
    Wang, Zeng Hui
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2018, 16 (01) : 284 - 294
  • [12] A Surrogate Model based Gait Learning for Biped Robot
    Luo, Dingsheng
    Wang, Yi
    Wu, Xihong
    [J]. ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING II, PTS 1-3, 2013, 433-435 : 138 - 145
  • [13] A Stable Gait Planning Method of Biped Robot Based on Ankle motion Smooth Fitting
    En Zeng Dong
    Dan Dan Wang
    Ji Gang Tong
    Chao Chen
    Zeng Hui Wang
    [J]. International Journal of Control, Automation and Systems, 2018, 16 : 284 - 294
  • [14] Reinforcement learning for a biped robot based on a CPG-actor-critic method
    Nakamura, Yutaka
    Mori, Takeshi
    Sato, Masa-Aki
    Ishii, Shin
    [J]. NEURAL NETWORKS, 2007, 20 (06) : 723 - 735
  • [15] A Disturbance Rejection Control Method Based on Deep Reinforcement Learning for a Biped Robot
    Liu, Chuzhao
    Gao, Junyao
    Tian, Dingkui
    Zhang, Xuefeng
    Liu, Huaxin
    Meng, Libo
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 17
  • [16] Prescribed synergy method-based hybrid intelligent gait synthesis for piped robot
    Zhou, CJ
    Jagannathan, K
    Myint, T
    [J]. ICRA '99: IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, PROCEEDINGS, 1999, : 1384 - 1389
  • [17] Path planning for a statically stable biped robot using PRM and reinforcement learning
    Kulkarni, Prasad
    Goswami, Dip
    Guha, Prithwijit
    Dutta, Ashish
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2006, 47 (03) : 197 - 214
  • [18] Path Planning for a Statically Stable Biped Robot Using PRM and Reinforcement Learning
    Prasad Kulkarni
    Dip Goswami
    Prithwijit Guha
    Ashish Dutta
    [J]. Journal of Intelligent and Robotic Systems, 2006, 47 : 197 - 214
  • [19] Stable Polynomial Gait of a Biped Robot with Toe Joint
    Panwar, Ruchi
    Sukavanam, N.
    [J]. 2017 4TH IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ELECTRONICS (UPCON), 2017, : 382 - 387
  • [20] Optimal Control for Stable Walking Gait of a Biped Robot
    Nhat Dang Khoa Nguyen
    Ba Long Chu
    Van Tien Anh Nguyen
    Van Hien Nguyen
    Tan Tien Nguyen
    [J]. 2017 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2017, : 309 - 313