A Hierarchical Framework for Quadruped Omnidirectional Locomotion Based on Reinforcement Learning

被引:2
|
作者
Tan, Wenhao [1 ]
Fang, Xing [1 ]
Zhang, Wei [1 ]
Song, Ran [1 ]
Chen, Teng [1 ]
Zheng, Yu [2 ]
Li, Yibin [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
[2] Tencent Robot X Lab, Shenzhen 518100, Peoples R China
基金
中国国家自然科学基金;
关键词
Quadruped robot; reinforcement learning; trajectory generator; robotic manipulation; MODEL;
D O I
10.1109/TASE.2023.3310945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Quadruped locomotion is challenging for many learning-based algorithms. This is because it requires tedious manual tuning to cope with different types of terrains and is difficult to deploy in reality due to the sim-to-real gap between the training and the testing scenarios. This paper proposes a quadruped robot learning system for agile locomotion which does not require any pre-training and works well in various terrains. We introduce a hierarchical framework that uses reinforcement learning as the high-level policy to adjust the low-level trajectory generator for a better adaptability to various terrains. We compact the observation and the action spaces of reinforcement learning to deploy the proposed framework on a host computer interfaced with the robot. Besides, we design an omnidirectional trajectory generator guided by robot posture, which generates omnidirectional foot trajectories to interact with the environment. Experimental results and the supplementary video demonstrate that our hierarchical framework only trained in simulation can be easily deployed in the real world, and also has the advantages of fast convergence and good terrain adaptability. Note to Practitioners-This paper presents a hierarchical framework for quadruped robots. It combines a high-level reinforcement learning controller with a posture-guided trajectory generator to adaptively generate omnidirectional motions. Our method is easy to train as it converges fast and does not need to adjust a dozen or so of rewards. The quadruped robot can be deployed in a real environment directly after being trained in simulation. With the trained hierarchical framework deployed on a remote host computer, the robot works well in a variety of real-world environments unseen in the simulation.
引用
收藏
页码:5367 / 5378
页数:12
相关论文
共 50 条
  • [41] A Hierarchical Reinforcement Learning Framework for Stock Selection and Portfolio
    Zha, Lijun
    Dai, Le
    Xu, Tong
    Wu, Di
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [42] A strategy for push recovery in quadruped Robot based on reinforcement Learning
    Chen, Yang-zhen
    Hou, Wen-Qi
    Wang, Jian
    Wang, Jian-Wen
    Ma, Hong-xu
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3145 - 3151
  • [43] SYNLOCO-VE: Synthesizing central pattern generator with reinforcement learning and velocity estimator for quadruped locomotion
    Zhang, Xinyu
    Xiao, Zhiyuan
    Zhou, Xiang
    Zhang, Qingrui
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2025, 46 (02): : 493 - 511
  • [44] CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning
    Wang, Jiayu
    Hu, Chuxiong
    Zhu, Yu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 7193 - 7200
  • [45] Learning to Interrupt: A Hierarchical Deep Reinforcement Learning Framework for Efficient Exploration
    Li, Tingguang
    Pan, Jin
    Zhu, Delong
    Meng, Max Q. -H.
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 648 - 653
  • [46] Learning-Based Model Predictive Control for Quadruped Locomotion on Slippery Ground
    Zhang, Zhitong
    An, Honglei
    Wei, Qing
    Ma, Hongxu
    2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR, 2022, : 47 - 52
  • [47] Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics
    Wang, Zhicheng
    Wei, Wandi
    Xie, Anhuan
    Zhang, Yifeng
    Wu, Jun
    Zhu, Qiuguo
    MICROMACHINES, 2022, 13 (10)
  • [48] Hierarchical Deep Reinforcement Learning-Based Propofol Infusion Assistant Framework in Anesthesia
    Yun, Won Joon
    Shin, MyungJae
    Mohaisen, David
    Lee, Kangwook
    Kim, Joongheon
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2510 - 2521
  • [49] Hierarchical framework for interpretable and specialized deep reinforcement learning-based predictive maintenance
    Abbas, Ammar N.
    Chasparis, Georgios C.
    Kelleher, John D.
    DATA & KNOWLEDGE ENGINEERING, 2024, 149
  • [50] Learning Agile, Robust Locomotion Skills for Quadruped Robot
    Zhang, Zhitong
    Wei, Qing
    Chang, Xu
    Lang, Ling
    Ma, Hongxu
    An, Honglei
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 1011 - 1016