Learning Torque Control for Quadrupedal Locomotion

被引:0
|
作者
Chen, Shuxiao [1 ]
Zhang, Bike [1 ,2 ]
Mueller, Mark W. [1 ]
Rai, Akshara [2 ]
Sreenath, Koushil [1 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
[2] Meta AI, Menlo Pk, CA 94025 USA
关键词
DYNAMICS;
D O I
10.1109/HUMANOIDS57100.2023.10375154
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) has become a promising approach to developing controllers for quadrupedal robots. Conventionally, an RL design for locomotion follows a position-based paradigm, wherein an RL policy outputs target joint positions at a low frequency that are then tracked by a high-frequency proportional-derivative (PD) controller to produce joint torques. In contrast, for the model-based control of quadrupedal locomotion, there has been a paradigm shift from position-based control to torque-based control. In light of the recent advances in model-based control, we explore an alternative to the position-based RL paradigm, by introducing a torque-based RL framework, where an RL policy directly predicts joint torques at a high frequency, thus circumventing the use of a PD controller. The proposed learning torque control framework is validated with extensive experiments, in which a quadruped is capable of traversing various terrain and resisting external disturbances while following user-specified commands. Furthermore, compared to learning position control, learning torque control demonstrates the potential to achieve a higher reward and is more robust to significant external disturbances. To our knowledge, this is the first sim-to-real attempt for end-to-end learning torque control of quadrupedal locomotion.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [31] Reference-Free Model Predictive Control for Quadrupedal Locomotion
    Lunardi, Gianni
    Corberes, Thomas
    Mastalli, Carlos
    Mansard, Nicolas
    Flayols, Thomas
    Tonneau, Steve
    Del Prete, Andrea
    IEEE ACCESS, 2024, 12 : 689 - 698
  • [32] Reference-Free Model Predictive Control for Quadrupedal Locomotion
    Lunardi, Gianni
    Corberes, Thomas
    Mastalli, Carlos
    Mansard, Nicolas
    Flayols, Thomas
    Tonneau, Steve
    Prete, Andrea Del
    IEEE Access, 2024, 12 : 689 - 698
  • [33] Decentralized autonomous control of a quadrupedal locomotion robot using oscillators
    Katsuyoshi Tsujita
    Kazuo Tsuchiya
    Ahmet Onat
    Artificial Life and Robotics, 2001, 5 (3) : 152 - 158
  • [34] SPEED CONTROL IN QUADRUPEDAL LOCOMOTION - PRINCIPLES OF LIMB COORDINATION IN THE DOG
    BLASZCZYK, JW
    DOBRZECKA, C
    ACTA NEUROBIOLOGIAE EXPERIMENTALIS, 1989, 49 (2-3) : 105 - 124
  • [35] Robust Footstep Planning and LQR Control for Dynamic Quadrupedal Locomotion
    Xin, Guiyang
    Xin, Songyan
    Cebe, Oguzhan
    Pollayil, Mathew Jose
    Angelini, Franco
    Garabini, Manolo
    Vijayakumar, Sethu
    Mistry, Michael
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) : 4488 - 4495
  • [36] THE DEVELOPMENT OF QUADRUPEDAL LOCOMOTION IN THE KITTEN
    HOWLAND, DR
    BREGMAN, BS
    GOLDBERGER, ME
    EXPERIMENTAL NEUROLOGY, 1995, 135 (02) : 93 - 107
  • [37] SayTap: Language to Quadrupedal Locomotion
    Tang, Yujin
    Yu, Wenhao
    Tan, Jie
    Zen, Heiga
    Faust, Aleksandra
    Harada, Tatsuya
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [38] Bipedal and quadrupedal locomotion in chimpanzees
    Pontzer, Herman
    Raichlen, David A.
    Rodman, Peter S.
    JOURNAL OF HUMAN EVOLUTION, 2014, 66 : 64 - 82
  • [39] CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning
    Wang, Jiayu
    Hu, Chuxiong
    Zhu, Yu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 7193 - 7200
  • [40] QUADRUPEDAL AND BIPEDAL LOCOMOTION OF LIZARDS
    SNYDER, RC
    COPEIA, 1952, (02) : 64 - 70