Real-Time Trajectory Adaptation for Quadrupedal Locomotion using Deep Reinforcement Learning

被引：11

作者：

Gangapurwala, Siddhant ^{[1
]}

Geisert, Mathieu ^{[1
]}

Orsolino, Romeo ^{[1
]}

Fallon, Maurice ^{[1
]}

Havoutis, Ioannis ^{[1
]}

机构：

[1] Univ Oxford, Oxford Robot Inst, Dynam Robots Syst DRS Grp, Oxford, England

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021) | 2021年

基金：

欧盟地平线“2020”; 英国工程与自然科学研究理事会;

关键词：

D O I：

10.1109/ICRA48506.2021.9561639

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a control architecture for real-time adaptation and tracking of trajectories generated using a terrain-aware trajectory optimization solver. This approach enables us to circumvent the computationally exhaustive task of online trajectory optimization, and further introduces a control solution robust to systems modeled with approximated dynamics. We train a policy using deep reinforcement learning (RL) to introduce additive deviations to a reference trajectory in order to generate a feedback-based trajectory tracking system for a quadrupedal robot. We train this policy across a multitude of simulated terrains and ensure its generality by introducing training methods that avoid overfilling and convergence towards local optima. Additionally, in order to capture terrain information, we include a latent representation of the height maps in the observation space of the RL environment as a form of exteroceptive feedback. We test the performance of our trained policy by tracking the corrected set points using a model-based whole-body controller and compare it with the tracking behavior obtained without the corrective feedback in several simulation environments, and show that introducing the corrective feedback results in increase of the success rate from 72.7% to 92.4% for tracking precomputed dynamic long horizon trajectories on flat terrain and from 47.5% to 803% on a complex modular uneven terrain. We also show successful transfer of our training approach to the real physical system and further present cogent arguments in support of our framework.

引用

页码：5973 / 5979

页数：7

共 50 条

[1] Deep Reinforcement Learning for Real-Time Trajectory Planning in UAV Networks
Li, Kai
Ni, Wei
Tovar, Eduardo
Guizani, Mohsen
2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 958 - 963
[2] Reinforcement Learning With Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion
Shi, Haojie
Zhou, Bo
Zeng, Hongsheng
Wang, Fan
Dong, Yueqiang
Li, Jiangyong
Wang, Kang
Tian, Hao
Meng, Max Q-H
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 3085 - 3092
[3] Real-time adaptive entry trajectory generation with modular policy and deep reinforcement learning
Peng, Gaoxiang
Wang, Bo
Liu, Lei
Fan, Huijin
Cheng, Zhongtao
AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 142
[4] Deep reinforcement learning based trajectory real-time planning for hypersonic gliding vehicles
Li, Jianfeng
Song, Shenmin
Shi, Xiaoping
Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering, 2024, 238 (16) : 1665 - 1682
[5] Real-time security margin control using deep reinforcement learning
Hagmar, Hannes
Eriksson, Robert
Tuan, Le Anh
ENERGY AND AI, 2023, 13
[6] Real-Time Energy Management of a Microgrid Using Deep Reinforcement Learning
Ji, Ying
Wang, Jianhui
Xu, Jiacan
Fang, Xiaoke
Zhang, Huaguang
ENERGIES, 2019, 12 (12)
[7] Real-time model calibration with deep reinforcement learning
Tian, Yuan
Chao, Manuel Arias
Kulkarni, Chetan
Goebel, Kai
Fink, Olga
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 165
[8] CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning
Wang, Jiayu
Hu, Chuxiong
Zhu, Yu
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 7193 - 7200
[9] A Real-Time Planning and Control Framework for Robust and Dynamic Quadrupedal Locomotion
Jun Li
Haibo Gao
Yuhui Wan
Haitao Yu
Chengxu Zhou
Journal of Bionic Engineering, 2023, 20 : 1449 - 1466
[10] DreamWaQ: Learning Robust Quadrupedal Locomotion With Implicit Terrain Imagination via Deep Reinforcement Learning
Nahrendra, I. Made Aswin
Yu, Byeongho
Myung, Hyun
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5078 - 5084

← 1 2 3 4 5 →