A Hierarchical Framework for Quadruped Omnidirectional Locomotion Based on Reinforcement Learning

被引:2
|
作者
Tan, Wenhao [1 ]
Fang, Xing [1 ]
Zhang, Wei [1 ]
Song, Ran [1 ]
Chen, Teng [1 ]
Zheng, Yu [2 ]
Li, Yibin [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
[2] Tencent Robot X Lab, Shenzhen 518100, Peoples R China
基金
中国国家自然科学基金;
关键词
Quadruped robot; reinforcement learning; trajectory generator; robotic manipulation; MODEL;
D O I
10.1109/TASE.2023.3310945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Quadruped locomotion is challenging for many learning-based algorithms. This is because it requires tedious manual tuning to cope with different types of terrains and is difficult to deploy in reality due to the sim-to-real gap between the training and the testing scenarios. This paper proposes a quadruped robot learning system for agile locomotion which does not require any pre-training and works well in various terrains. We introduce a hierarchical framework that uses reinforcement learning as the high-level policy to adjust the low-level trajectory generator for a better adaptability to various terrains. We compact the observation and the action spaces of reinforcement learning to deploy the proposed framework on a host computer interfaced with the robot. Besides, we design an omnidirectional trajectory generator guided by robot posture, which generates omnidirectional foot trajectories to interact with the environment. Experimental results and the supplementary video demonstrate that our hierarchical framework only trained in simulation can be easily deployed in the real world, and also has the advantages of fast convergence and good terrain adaptability. Note to Practitioners-This paper presents a hierarchical framework for quadruped robots. It combines a high-level reinforcement learning controller with a posture-guided trajectory generator to adaptively generate omnidirectional motions. Our method is easy to train as it converges fast and does not need to adjust a dozen or so of rewards. The quadruped robot can be deployed in a real environment directly after being trained in simulation. With the trained hierarchical framework deployed on a remote host computer, the robot works well in a variety of real-world environments unseen in the simulation.
引用
收藏
页码:5367 / 5378
页数:12
相关论文
共 50 条
  • [1] A Hierarchical Framework for Quadruped Locomotion Based on Reinforcement Learning
    Tan, Wenhao
    Fang, Xing
    Zhang, Wei
    Song, Ran
    Chen, Teng
    Zheng, Yu
    Li, Yibin
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8462 - 8468
  • [2] Hierarchical Reinforcement Learning for Quadruped Locomotion
    Jain, Deepali
    Iscen, Atil
    Caluwaerts, Ken
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
  • [3] Reinforcement Learning for Quadruped Locomotion
    Zhao, Kangqiao
    Lin, Feng
    Seah, Hock Soon
    ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 167 - 177
  • [4] Learning Smooth and Omnidirectional Locomotion for Quadruped Robots
    Wu, Jiaxi
    Wang, Chen'an
    Zhang, Dianmin
    Zhong, Shanlin
    Wang, Boxing
    Qiao, Hong
    2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 633 - 638
  • [5] Continual Reinforcement Learning for Quadruped Robot Locomotion
    Gai, Sibo
    Lyu, Shangke
    Zhang, Hongyin
    Wang, Donglin
    ENTROPY, 2024, 26 (01)
  • [6] CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion
    Luo, Ying-Sheng
    Soeseno, Jonathan Hans
    Chen, Trista Pei-Chun
    Chen, Wei-Chao
    ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
  • [7] Omnidirectional Locomotion in a Quadruped Robot: A CPG-based Approach
    Matos, Vitor
    Santos, Cristina P.
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 3392 - 3397
  • [8] Hierarchical reinforcement learning for biped locomotion
    Sugimoto, Norikazu
    Hyon, Sang-Ho
    Morimoto, Jun
    NEUROSCIENCE RESEARCH, 2009, 65 : S183 - S183
  • [9] CPG modulation for navigation and omnidirectional quadruped locomotion
    Santos, Cristina P.
    Matos, Vitor
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2012, 60 (06) : 912 - 927
  • [10] Towards Fault-tolerant Quadruped Locomotion with Reinforcement Learning
    Liu, Dikai
    Yin, Jianxiong
    See, Simon
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1438 - 1441