A Heuristics-Based Reinforcement Learning Method to Control Bipedal Robots

被引:2
|
作者
Qin, Daoling [1 ]
Zhang, Guoteng [1 ]
Zhu, Zhengguo [1 ]
Chen, Teng [1 ]
Zhu, Weiliang [1 ]
Rong, Xuewen [1 ]
Xie, Anhuan [2 ]
Li, Yibin [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[2] Zhejiang Lab, Intelligent Robot Res Ctr, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Legged robots; bipedal locomotion; reinforcement learning;
D O I
10.1142/S0219843623500135
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
A new method is proposed to control bipedal robots to achieve flexible omni-directional motion and robust locomotion under complex disturbances, called the heuristics-based reinforcement learning (HBRL) framework. HBRL shows great training efficiency in simulation. Heuristic reference trajectories play a crucial role in HBRL, which guide the training process. Exploration rewards, leg-foot reset condition, and command curriculum are three significant components to optimize the training process. An estimator network is utilized to supply linear velocities and foot contact information. We train controllers on flat ground in simulation. To demonstrate robustness and versatility, the trained controllers were tested on BRAVER, a point-foot bipedal robot with three joints on each leg. The controllers enabled BRAVER to perform omni-directional locomotion with the maximum forward speed reaching 2m/s. The robot could also maintain balance under external pushing and over uneven terrains.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics
    Wang, Zhicheng
    Wei, Wandi
    Xie, Anhuan
    Zhang, Yifeng
    Wu, Jun
    Zhu, Qiuguo
    [J]. MICROMACHINES, 2022, 13 (10)
  • [2] Optimal Balancing Control of Bipedal Robots Using Reinforcement Learning
    Peng, Fang
    Ding, Lijia
    Li, Zhijun
    Yang, Chenguang
    Su, Chun-Yi
    [J]. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2186 - 2191
  • [3] Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots
    Li, Zhongyu
    Cheng, Xuxin
    Peng, Xue Bin
    Abbeel, Pieter
    Levine, Sergey
    Berseth, Glen
    Sreenath, Koushil
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 2811 - 2817
  • [4] A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots
    Mou, Haiming
    Xue, Jie
    Liu, Jian
    Feng, Zhen
    Li, Qingdu
    Zhang, Jianwei
    [J]. BIOMIMETICS, 2023, 8 (08)
  • [5] A Behavior-Based Reinforcement Learning Approach to Control Walking Bipedal Robots Under Unknown Disturbances
    Beranek, Richard
    Karimi, Masoud
    Ahmadi, Mojtaba
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (05) : 2710 - 2720
  • [6] Heuristics-based parallelization of a class of control systems
    Rabbath, CA
    Lechevin, N
    El Cadi, AA
    Lapierre, SD
    Abdoune, M
    Crainic, T
    [J]. PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 2347 - 2352
  • [7] Heuristics-based learning approach for choquistic regression models
    Tehrani, Ali Fallah
    [J]. PATTERN RECOGNITION LETTERS, 2021, 149 : 137 - 142
  • [8] Heuristics-based design process
    Calle-Escobar, Manuela
    Mejia-Gutierrez, Ricardo
    Nadeau, Jean-Pierre
    Pailhes, Jerome
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2016, 10 (04): : 369 - 386
  • [9] Heuristics-based PON deployment
    Khan, SU
    [J]. IEEE COMMUNICATIONS LETTERS, 2005, 9 (09) : 847 - 849
  • [10] A Collaborative Control Method of Dual-Arm Robots Based on Deep Reinforcement Learning
    Liu, Luyu
    Liu, Qianyuan
    Song, Yong
    Pang, Bao
    Yuan, Xianfeng
    Xu, Qingyang
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 16