Learning Push Recovery Behaviors for Humanoid Walking Using Deep Reinforcement Learning

被引:0
|
作者
Dicksiano C. Melo
Marcos R. O. A. Maximo
Adilson Marques da Cunha
机构
[1] Aeronautics Institute of Technology,Autonomous Computational Systems Lab (LAB
来源
关键词
Deep reinforcement learning; Robotics; Proximal policy optimization;
D O I
暂无
中图分类号
学科分类号
摘要
The development of a robust and versatile biped walking engine might be considered one of the hardest problems in Mobile Robotics. Even well-developed cities contains obstacles that make the navigation of these agents without a human assistance infeasible. Therefore, it is primordial that they be able to restore dynamically their own balance when subject to certain types of external disturbances. Thereby, this article contributes with a implementation of a Push Recovery controller that improves the walking engine’s performance used by a simulated humanoid agent from RoboCup 3D Soccer Simulation League environment. This work applies Proximal Policy Optimization in order to learn a movement policy in this simulator. Our learned policy was able to surpass the baselines with statistical significance. Finally, we propose two approaches based on Transfer Learning and Imitation Learning to achieve a final policy which performs well across an wide range disturbance directions.
引用
收藏
相关论文
共 50 条
  • [1] Learning Push Recovery Behaviors for Humanoid Walking Using Deep Reinforcement Learning
    Melo, Dicksiano C.
    Maximo, Marcos R. O. A.
    da Cunha, Adilson Marques
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 106 (01)
  • [2] Push Recovery Control for Humanoid Robot using Reinforcement Learning
    Seo, Donghyeon
    Kim, Harin
    Kim, Donghan
    [J]. 2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 488 - 492
  • [3] Development of Push-Recovery control system for humanoid robots using deep reinforcement learning
    Aslan, Emrah
    Arserim, Muhammet Ali
    Ucar, Aysegul
    [J]. AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (10)
  • [4] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Muzio, Alexandre F. V.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (01)
  • [5] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Alexandre F. V. Muzio
    Marcos R. O. A. Maximo
    Takashi Yoneyama
    [J]. Journal of Intelligent & Robotic Systems, 2022, 105
  • [6] Push Recovery Strategies through Deep Reinforcement Learning
    Melo, Dicksiano Carvalho
    Maximo, Marcos R. O. A.
    da Cunha, Adilson Marques
    [J]. 2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 240 - 245
  • [7] Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning
    Aslan, Simge Nur
    Tasci, Burak
    Ucar, Aysegul
    Guzelis, Cuneyt
    [J]. INTELLIGENT ENVIRONMENTS 2021, 2021, 29 : 143 - 155
  • [8] A Reinforcement Learning Method for Humanoid Robot Walking
    Liu, Yunda
    Bi, Sheng
    Dong, Min
    Zhang, Yingjie
    Huang, Jialing
    Zhang, Jiawei
    [J]. 2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 623 - 628
  • [9] Learning Capture Points for Humanoid Push Recovery
    Rebula, John
    Canas, Fabian
    Pratt, Jerry
    Goswami, Ambarish
    [J]. HUMANOIDS: 2007 7TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, 2007, : 65 - +
  • [10] Reactive Stepping for Humanoid Robots using Reinforcement Learning: Application to Standing Push Recovery on the Exoskeleton Atalante
    Duburcq, Alexis
    Schramm, Fabian
    Boeris, Guilhem
    Bredeche, Nicolas
    Chevaleyre, Yann
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 9302 - 9309