Real-Time Trajectory Adaptation for Quadrupedal Locomotion using Deep Reinforcement Learning

被引:11
|
作者
Gangapurwala, Siddhant [1 ]
Geisert, Mathieu [1 ]
Orsolino, Romeo [1 ]
Fallon, Maurice [1 ]
Havoutis, Ioannis [1 ]
机构
[1] Univ Oxford, Oxford Robot Inst, Dynam Robots Syst DRS Grp, Oxford, England
基金
欧盟地平线“2020”; 英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ICRA48506.2021.9561639
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a control architecture for real-time adaptation and tracking of trajectories generated using a terrain-aware trajectory optimization solver. This approach enables us to circumvent the computationally exhaustive task of online trajectory optimization, and further introduces a control solution robust to systems modeled with approximated dynamics. We train a policy using deep reinforcement learning (RL) to introduce additive deviations to a reference trajectory in order to generate a feedback-based trajectory tracking system for a quadrupedal robot. We train this policy across a multitude of simulated terrains and ensure its generality by introducing training methods that avoid overfilling and convergence towards local optima. Additionally, in order to capture terrain information, we include a latent representation of the height maps in the observation space of the RL environment as a form of exteroceptive feedback. We test the performance of our trained policy by tracking the corrected set points using a model-based whole-body controller and compare it with the tracking behavior obtained without the corrective feedback in several simulation environments, and show that introducing the corrective feedback results in increase of the success rate from 72.7% to 92.4% for tracking precomputed dynamic long horizon trajectories on flat terrain and from 47.5% to 803% on a complex modular uneven terrain. We also show successful transfer of our training approach to the real physical system and further present cogent arguments in support of our framework.
引用
收藏
页码:5973 / 5979
页数:7
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Real-Time Trajectory Planning in UAV Networks
    Li, Kai
    Ni, Wei
    Tovar, Eduardo
    Guizani, Mohsen
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 958 - 963
  • [2] Reinforcement Learning With Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion
    Shi, Haojie
    Zhou, Bo
    Zeng, Hongsheng
    Wang, Fan
    Dong, Yueqiang
    Li, Jiangyong
    Wang, Kang
    Tian, Hao
    Meng, Max Q-H
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 3085 - 3092
  • [3] Real-time adaptive entry trajectory generation with modular policy and deep reinforcement learning
    Peng, Gaoxiang
    Wang, Bo
    Liu, Lei
    Fan, Huijin
    Cheng, Zhongtao
    AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 142
  • [4] Deep reinforcement learning based trajectory real-time planning for hypersonic gliding vehicles
    Li, Jianfeng
    Song, Shenmin
    Shi, Xiaoping
    Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering, 2024, 238 (16) : 1665 - 1682
  • [5] Real-time security margin control using deep reinforcement learning
    Hagmar, Hannes
    Eriksson, Robert
    Tuan, Le Anh
    ENERGY AND AI, 2023, 13
  • [6] Real-Time Energy Management of a Microgrid Using Deep Reinforcement Learning
    Ji, Ying
    Wang, Jianhui
    Xu, Jiacan
    Fang, Xiaoke
    Zhang, Huaguang
    ENERGIES, 2019, 12 (12)
  • [7] Real-time model calibration with deep reinforcement learning
    Tian, Yuan
    Chao, Manuel Arias
    Kulkarni, Chetan
    Goebel, Kai
    Fink, Olga
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 165
  • [8] CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning
    Wang, Jiayu
    Hu, Chuxiong
    Zhu, Yu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 7193 - 7200
  • [9] A Real-Time Planning and Control Framework for Robust and Dynamic Quadrupedal Locomotion
    Jun Li
    Haibo Gao
    Yuhui Wan
    Haitao Yu
    Chengxu Zhou
    Journal of Bionic Engineering, 2023, 20 : 1449 - 1466
  • [10] DreamWaQ: Learning Robust Quadrupedal Locomotion With Implicit Terrain Imagination via Deep Reinforcement Learning
    Nahrendra, I. Made Aswin
    Yu, Byeongho
    Myung, Hyun
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5078 - 5084