Development of Push-Recovery control system for humanoid robots using deep reinforcement learning

被引:7
|
作者
Aslan, Emrah [1 ]
Arserim, Muhammet Ali [2 ]
Ucar, Aysegul [3 ]
机构
[1] Dicle Univ, Silvan Vocat Sch, Diyarbakir, Turkiye
[2] Dicle Univ, Engn Fac, Diyarbakir, Turkiye
[3] Firat Univ, Engn Fac, Elazig, Turkiye
关键词
Deep reinforcement learning; deep q network(DQN); double deep q network(DDQN); Humanoid robot; Robotis op2; Push-recovery; GENERATION;
D O I
10.1016/j.asej.2023.102167
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper focuses on the push-recovery problem of bipedal humanoid robots affected by external forces and pushes. Since they are structurally unstable, balance is the most important problem in humanoid robots. Our purpose is to design and implement a completely independent push-recovery control system that can imitate the actions of a human. For humanoid robots to be able to stay in balance while standing or walking, and to prevent balance disorders that may be caused by external forces, an active balance control has been presented. Push-recovery controllers consist of three strategies: ankle strategy, hip strategy, and step strategy. These strategies are biomechanical responses that people show in cases of balance disorder. In our application, both simulation and real-world tests have been performed. The simulation tests of the study were carried out with 3D models in the Webots environment. Real-world tests were performed on the Robotis-OP2 humanoid robot. The gyroscope, accelerometer and motor data from the sensors in our robot were recorded and external pushing force was applied to the robot. The balance of the robot was achieved by using the recorded data and the ankle strategy. To make the robot completely autonomous, Deep Q Network (DQN) and Double Deep Q Network (DDQN) methods from Deep Reinforcement Learning (DPL) algorithms have been applied. The results obtained with the DDQN algorithm yielded 21.03% more successful results compared to the DQN algorithm. The results obtained in the real environment tests showed parallelism to the simulation results.(c) 2023 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/ by-nc-nd/4.0/).
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Robust and accurate feature selection for humanoid push recovery and classification: deep learning approach
    Semwal, Vijay Bhaskar
    Mondal, Kaushik
    Nandi, G. C.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2017, 28 (03): : 565 - 574
  • [32] On Training Flexible Robots using Deep Reinforcement Learning
    Dwiel, Zach
    Candadai, Madhavun
    Phielipp, Mariano
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4666 - 4671
  • [33] Humanoid push recovery using sensory reweighting
    Maalouf, Noel
    Elhajj, Imad H.
    Shammas, Elie
    Asmar, Daniel
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2017, 94 : 208 - 218
  • [34] Online Learning of Low Dimensional Strategies for High-Level Push Recovery in Bipedal Humanoid Robots
    Yi, Seung-Joon
    Zhang, Byoung-Tak
    Hong, Dennis
    Lee, Daniel D.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 1649 - 1655
  • [35] Optimization of ankle exoskeleton control parameters for human upright standing push-recovery
    Pang M.
    Zhan J.
    Tang B.
    Xiang K.
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (02): : 95 - 101
  • [36] Deep-reinforcement-learning-based gait pattern controller on an uneven terrain for humanoid robots
    Kuo, Ping-Huan
    Pao, Chieh-Hsiu
    Chang, En-Yi
    Yau, Her-Terng
    [J]. INTERNATIONAL JOURNAL OF OPTOMECHATRONICS, 2023, 17 (01)
  • [37] Erratum to: Robust and accurate feature selection for humanoid push recovery and classification: deep learning approach
    Vijay Bhaskar Semwal
    Kaushik Mondal
    G. C. Nandi
    [J]. Neural Computing and Applications, 2017, 28 : 1907 - 1907
  • [38] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Muzio, Alexandre F. V.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (01)
  • [39] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Alexandre F. V. Muzio
    Marcos R. O. A. Maximo
    Takashi Yoneyama
    [J]. Journal of Intelligent & Robotic Systems, 2022, 105
  • [40] Deep Reinforcement Learning for Humanoid Robot Dribbling
    Muzio, Alexandre F., V
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    [J]. 2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 246 - 251