Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots

被引:76
|
作者
Li, Zhongyu [1 ]
Cheng, Xuxin [1 ]
Peng, Xue Bin [1 ]
Abbeel, Pieter [1 ]
Levine, Sergey [1 ]
Berseth, Glen [1 ]
Sreenath, Koushil [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
CAPTURABILITY-BASED ANALYSIS; LEGGED LOCOMOTION;
D O I
10.1109/ICRA48506.2021.9560769
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing robust walking controllers for bipedal robots is a challenging endeavor. Traditional model-based locomotion controllers require simplifying assumptions and careful modelling; any small errors can result in unstable control. To address these challenges for bipedal locomotion, we present a model-free reinforcement learning framework for training robust locomotion policies in simulation, which can then be transferred to a real bipedal Cassie robot. To facilitate sim-toreal transfer, domain randomization is used to encourage the policies to learn behaviors that are robust across variations in system dynamics. The learned policies enable Cassie to perform a set of diverse and dynamic behaviors, while also being more robust than traditional controllers and prior learning-based methods that use residual control. We demonstrate this on versatile walking behaviors such as tracking a target walking velocity, walking height, and turning yaw. (Video(1))
引用
收藏
页码:2811 / 2817
页数:7
相关论文
共 50 条
  • [1] Reinforcement learning for versatile, dynamic, and robust bipedal locomotion control
    Li, Zhongyu
    Peng, Xue Bin
    Abbeel, Pieter
    Levine, Sergey
    Berseth, Glen
    Sreenath, Koushil
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2024,
  • [2] Optimal Balancing Control of Bipedal Robots Using Reinforcement Learning
    Peng, Fang
    Ding, Lijia
    Li, Zhijun
    Yang, Chenguang
    Su, Chun-Yi
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2186 - 2191
  • [3] A Heuristics-Based Reinforcement Learning Method to Control Bipedal Robots
    Qin, Daoling
    Zhang, Guoteng
    Zhu, Zhengguo
    Chen, Teng
    Zhu, Weiliang
    Rong, Xuewen
    Xie, Anhuan
    Li, Yibin
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2024, 21 (03)
  • [4] Computed-Torque Control of a Simulated Bipedal Robot with Locomotion by Reinforcement Learning
    Valle, Carlos Magno C. O.
    Tanscheit, Ricardo
    Forero Mendoza, Leonardo A.
    2016 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2016,
  • [5] Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics
    Wang, Zhicheng
    Wei, Wandi
    Xie, Anhuan
    Zhang, Yifeng
    Wu, Jun
    Zhu, Qiuguo
    MICROMACHINES, 2022, 13 (10)
  • [6] Robust compound control of dynamic bipedal robots
    Arcos-Legarda, Jaime
    Cortes-Romero, John
    Tovar, Andres
    MECHATRONICS, 2019, 59 : 154 - 167
  • [7] Robust Bipedal Locomotion Based on a Hierarchical Control Structure
    Luo, Jianwen
    Su, Yao
    Ruan, Lecheng
    Zhao, Ye
    Kim, Donghyun
    Sentis, Luis
    Fu, Chenglong
    ROBOTICA, 2019, 37 (10) : 1750 - 1767
  • [8] Comparative Analysis of Reinforcement Learning Algorithms for Bipedal Robot Locomotion
    Aydogmus, Omur
    Yilmaz, Musa
    IEEE ACCESS, 2024, 12 : 7490 - 7499
  • [9] Reinforcement learning of periodical gaits in locomotion robots
    Svinin, M
    Yamada, K
    Ushio, S
    Ueda, K
    SENSOR FUSION AND DECENTRALIZED CONTROL IN ROBOTIC SYSTEMS II, 1999, 3839 : 180 - 190
  • [10] A Behavior Based Locomotion Controller with Learning for Disturbance Compensation in Bipedal Robots
    Beranek, Richard
    Ahmadi, Mojtaba
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 1915 - 1920