Learning Push Recovery Behaviors for Humanoid Walking Using Deep Reinforcement Learning

被引：0

作者：

Dicksiano C. Melo

Marcos R. O. A. Maximo

Adilson Marques da Cunha

机构：

[1] Aeronautics Institute of Technology,Autonomous Computational Systems Lab (LAB

来源：

Journal of Intelligent & Robotic Systems | 2022年 / 106卷

关键词：

Deep reinforcement learning; Robotics; Proximal policy optimization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The development of a robust and versatile biped walking engine might be considered one of the hardest problems in Mobile Robotics. Even well-developed cities contains obstacles that make the navigation of these agents without a human assistance infeasible. Therefore, it is primordial that they be able to restore dynamically their own balance when subject to certain types of external disturbances. Thereby, this article contributes with a implementation of a Push Recovery controller that improves the walking engine’s performance used by a simulated humanoid agent from RoboCup 3D Soccer Simulation League environment. This work applies Proximal Policy Optimization in order to learn a movement policy in this simulator. Our learned policy was able to surpass the baselines with statistical significance. Finally, we propose two approaches based on Transfer Learning and Imitation Learning to achieve a final policy which performs well across an wide range disturbance directions.

引用

共 50 条

[1] Learning Push Recovery Behaviors for Humanoid Walking Using Deep Reinforcement Learning
Melo, Dicksiano C.
Maximo, Marcos R. O. A.
da Cunha, Adilson Marques
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 106 (01)
[2] Push Recovery Control for Humanoid Robot using Reinforcement Learning
Seo, Donghyeon
Kim, Harin
Kim, Donghan
[J]. 2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 488 - 492
[3] Development of Push-Recovery control system for humanoid robots using deep reinforcement learning
Aslan, Emrah
Arserim, Muhammet Ali
Ucar, Aysegul
[J]. AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (10)
[4] Deep Reinforcement Learning for Humanoid Robot Behaviors
Muzio, Alexandre F. V.
Maximo, Marcos R. O. A.
Yoneyama, Takashi
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (01)
[5] Deep Reinforcement Learning for Humanoid Robot Behaviors
Alexandre F. V. Muzio
Marcos R. O. A. Maximo
Takashi Yoneyama
[J]. Journal of Intelligent & Robotic Systems, 2022, 105
[6] Push Recovery Strategies through Deep Reinforcement Learning
Melo, Dicksiano Carvalho
Maximo, Marcos R. O. A.
da Cunha, Adilson Marques
[J]. 2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 240 - 245
[7] Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning
Aslan, Simge Nur
Tasci, Burak
Ucar, Aysegul
Guzelis, Cuneyt
[J]. INTELLIGENT ENVIRONMENTS 2021, 2021, 29 : 143 - 155
[8] A Reinforcement Learning Method for Humanoid Robot Walking
Liu, Yunda
Bi, Sheng
Dong, Min
Zhang, Yingjie
Huang, Jialing
Zhang, Jiawei
[J]. 2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 623 - 628
[9] Learning Capture Points for Humanoid Push Recovery
Rebula, John
Canas, Fabian
Pratt, Jerry
Goswami, Ambarish
[J]. HUMANOIDS: 2007 7TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, 2007, : 65 - +
[10] Reactive Stepping for Humanoid Robots using Reinforcement Learning: Application to Standing Push Recovery on the Exoskeleton Atalante
Duburcq, Alexis
Schramm, Fabian
Boeris, Guilhem
Bredeche, Nicolas
Chevaleyre, Yann
[J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 9302 - 9309

← 1 2 3 4 5 →