A Hierarchical Framework for Quadruped Omnidirectional Locomotion Based on Reinforcement Learning

被引：2

作者：

Tan, Wenhao ^{[1
]}

Fang, Xing ^{[1
]}

Zhang, Wei ^{[1
]}

Song, Ran ^{[1
]}

Chen, Teng ^{[1
]}

Zheng, Yu ^{[2
]}

Li, Yibin ^{[1
]}

机构：

[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China

[2] Tencent Robot X Lab, Shenzhen 518100, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2024年 / 21卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Quadruped robot; reinforcement learning; trajectory generator; robotic manipulation; MODEL;

D O I：

10.1109/TASE.2023.3310945

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Quadruped locomotion is challenging for many learning-based algorithms. This is because it requires tedious manual tuning to cope with different types of terrains and is difficult to deploy in reality due to the sim-to-real gap between the training and the testing scenarios. This paper proposes a quadruped robot learning system for agile locomotion which does not require any pre-training and works well in various terrains. We introduce a hierarchical framework that uses reinforcement learning as the high-level policy to adjust the low-level trajectory generator for a better adaptability to various terrains. We compact the observation and the action spaces of reinforcement learning to deploy the proposed framework on a host computer interfaced with the robot. Besides, we design an omnidirectional trajectory generator guided by robot posture, which generates omnidirectional foot trajectories to interact with the environment. Experimental results and the supplementary video demonstrate that our hierarchical framework only trained in simulation can be easily deployed in the real world, and also has the advantages of fast convergence and good terrain adaptability. Note to Practitioners-This paper presents a hierarchical framework for quadruped robots. It combines a high-level reinforcement learning controller with a posture-guided trajectory generator to adaptively generate omnidirectional motions. Our method is easy to train as it converges fast and does not need to adjust a dozen or so of rewards. The quadruped robot can be deployed in a real environment directly after being trained in simulation. With the trained hierarchical framework deployed on a remote host computer, the robot works well in a variety of real-world environments unseen in the simulation.

引用

页码：5367 / 5378

页数：12

共 50 条

[1] A Hierarchical Framework for Quadruped Locomotion Based on Reinforcement Learning
Tan, Wenhao
Fang, Xing
Zhang, Wei
Song, Ran
Chen, Teng
Zheng, Yu
Li, Yibin
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8462 - 8468
[2] Hierarchical Reinforcement Learning for Quadruped Locomotion
Jain, Deepali
Iscen, Atil
Caluwaerts, Ken
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7551 - 7557
[3] Reinforcement Learning for Quadruped Locomotion
Zhao, Kangqiao
Lin, Feng
Seah, Hock Soon
ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 167 - 177
[4] Learning Smooth and Omnidirectional Locomotion for Quadruped Robots
Wu, Jiaxi
Wang, Chen'an
Zhang, Dianmin
Zhong, Shanlin
Wang, Boxing
Qiao, Hong
2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 633 - 638
[5] Continual Reinforcement Learning for Quadruped Robot Locomotion
Gai, Sibo
Lyu, Shangke
Zhang, Hongyin
Wang, Donglin
ENTROPY, 2024, 26 (01)
[6] CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion
Luo, Ying-Sheng
Soeseno, Jonathan Hans
Chen, Trista Pei-Chun
Chen, Wei-Chao
ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
[7] Omnidirectional Locomotion in a Quadruped Robot: A CPG-based Approach
Matos, Vitor
Santos, Cristina P.
IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 3392 - 3397
[8] Hierarchical reinforcement learning for biped locomotion
Sugimoto, Norikazu
Hyon, Sang-Ho
Morimoto, Jun
NEUROSCIENCE RESEARCH, 2009, 65 : S183 - S183
[9] CPG modulation for navigation and omnidirectional quadruped locomotion
Santos, Cristina P.
Matos, Vitor
ROBOTICS AND AUTONOMOUS SYSTEMS, 2012, 60 (06) : 912 - 927
[10] Towards Fault-tolerant Quadruped Locomotion with Reinforcement Learning
Liu, Dikai
Yin, Jianxiong
See, Simon
2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1438 - 1441

← 1 2 3 4 5 →