Robust quadruped jumping via deep reinforcement learning

被引:4
|
作者
Bellegarda, Guillaume [1 ]
Nguyen, Chuong [2 ]
Nguyen, Quan [2 ]
机构
[1] Ecole Polytech Fed Lausanne EPFL, CH-1015 Lausanne, VD, Switzerland
[2] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
Quadruped jumping; Reinforcement learning; Trajectory optimization; Agile robots; CHEETAH;
D O I
10.1016/j.robot.2024.104799
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a general task of jumping varying distances and heights for a quadrupedal robot in noisy environments, such as off of uneven terrain and with variable robot dynamics parameters. To accurately jump in such conditions, we propose a framework using deep reinforcement learning that leverages and augments the complex solution of nonlinear trajectory optimization for quadrupedal jumping. While the standalone optimization limits jumping to take-off from flat ground and requires accurate assumptions of robot dynamics, our proposed approach improves the robustness to allow jumping off of significantly uneven terrain with variable robot dynamical parameters and environmental conditions. Compared with walking and running, the realization of aggressive jumping on hardware necessitates accounting for the motors' torque-speed relationship as well as the robot's total power limits. By incorporating these constraints into our learning framework, we successfully deploy our policy sim-to-real without further tuning, fully exploiting the available onboard power supply and motors. We demonstrate robustness to environment noise of foot disturbances of up to 6 cm in height, or 33% of the robot's nominal standing height, while jumping 2x the body length in distance.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Learning to Drive via Apprenticeship Learning and Deep Reinforcement Learning
    Huang, Wenhui
    Braghin, Francesco
    Wang, Zhuo
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1536 - 1540
  • [32] Robust Deep Reinforcement Learning for Traffic Signal Control
    Kai Liang Tan
    Anuj Sharma
    Soumik Sarkar
    Journal of Big Data Analytics in Transportation, 2020, 2 (3): : 263 - 274
  • [33] Synthesizing the optimal gait of a quadruped robot with soft actuators using deep reinforcement learning
    Ji, Qinglei
    Fu, Shuo
    Tan, Kaige
    Muralidharan, Seshagopalan Thorapalli
    Lagrelius, Karin
    Danelia, David
    Andrikopoulos, Georgios
    Wang, Xi Vincent
    Wang, Lihui
    Feng, Lei
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78
  • [34] Toward robust and scalable deep spiking reinforcement learning
    Akl, Mahmoud
    Ergene, Deniz
    Walter, Florian
    Knoll, Alois
    FRONTIERS IN NEUROROBOTICS, 2023, 16
  • [35] Robust deep reinforcement learning for personalized HVAC system
    Lim, Se-Heon
    Kim, Tae-Geun
    Yeom, Dongwoo Jason
    Yoon, Sung-Guk
    ENERGY AND BUILDINGS, 2024, 319
  • [36] Robust Deep Reinforcement Learning through Adversarial Loss
    Oikarinen, Tuomas
    Zhang, Wang
    Megretski, Alexandre
    Daniel, Luca
    Weng, Tsui-Wei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [37] Deep Robust Reinforcement Learning for Practical Algorithmic Trading
    Li, Yang
    Zheng, Wanshan
    Zheng, Zibin
    IEEE ACCESS, 2019, 7 : 108014 - 108022
  • [38] Slope Handling for Quadruped Robots Using Deep Reinforcement Learning and Toe Trajectory Planning
    Mastrogeorgiou, Athanasios S.
    Elbahrawy, Yehia S.
    Kecskemethy, Andres
    Papadopoulos, Evangelos G.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 3777 - 3782
  • [39] Research and System Implementation of Quadruped Robot Following Strategy Based on Deep Reinforcement Learning
    Zhong, Peicheng
    Luo, Deyuan
    Pang, Mingjun
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2023, 59 (13): : 79 - 88
  • [40] Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning
    Xie, Yuansheng
    Vosoughi, Soroush
    Hassanpour, Saeed
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 5067 - 5074