Deep Reinforcement Learning for Humanoid Robot Behaviors

被引:16
|
作者
Muzio, Alexandre F. V. [1 ]
Maximo, Marcos R. O. A. [1 ]
Yoneyama, Takashi [2 ]
机构
[1] Aeronaut Inst Technol, Autonomous Computat Syst Lab LAB SCA, Comp Sci Div, Praca Marechal Eduardo Gomes 50, BR-12228900 Sao Jose Dos Campos, SP, Brazil
[2] Aeronaut Inst Technol, Elect Engn Div, Praca Marechal Eduardo Gomes 50, BR-12228900 Sao Jose Dos Campos, SP, Brazil
关键词
Deep reinforcement learning; Robot soccer; Humanoid robots; Robotics;
D O I
10.1007/s10846-022-01619-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RoboCup 3D Soccer Simulation is a robot soccer competition based on a high-fidelity simulator with autonomous humanoid agents, making it an interesting testbed for robotics and artificial intelligence. Due to the recent success of Deep Reinforcement Learning (DRL) in continuous control tasks, many teams have been using this technique to develop motions in Soccer 3D. This article focuses on learning humanoid robot behaviors: completing a racing track as fast as possible and dribbling against a single opponent. Our approach uses a hierarchical controller where a model-free policy learns to interact model-based walking algorithm. Then, we use DRL algorithms for an agent to learn how to perform these behaviors. Finally, the learned dribble policy was evaluated in the Soccer 3D environment. Simulated experiments show that the DRL agent wins against the hand-coded behavior used by the ITAndroids robotics team in 68.2% of dribble attempts.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Alexandre F. V. Muzio
    Marcos R. O. A. Maximo
    Takashi Yoneyama
    Journal of Intelligent & Robotic Systems, 2022, 105
  • [2] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Muzio, Alexandre F. V.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    Journal of Intelligent and Robotic Systems: Theory and Applications, 2022, 105 (01):
  • [3] Deep Reinforcement Learning for Humanoid Robot Dribbling
    Muzio, Alexandre F., V
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 246 - 251
  • [4] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
    Isaac Jesus da Silva
    Danilo Hernani Perico
    Thiago Pedro Donadon Homem
    Reinaldo Augusto da Costa Bianchi
    Journal of Intelligent & Robotic Systems, 2021, 102
  • [5] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
    da Silva, Isaac Jesus
    Perico, Danilo Hernani
    Donadon Homem, Thiago Pedro
    da Costa Bianchi, Reinaldo Augusto
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (03)
  • [6] Learning Push Recovery Behaviors for Humanoid Walking Using Deep Reinforcement Learning
    Melo, Dicksiano C.
    Maximo, Marcos R. O. A.
    da Cunha, Adilson Marques
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 106 (01)
  • [7] Learning Push Recovery Behaviors for Humanoid Walking Using Deep Reinforcement Learning
    Dicksiano C. Melo
    Marcos R. O. A. Maximo
    Adilson Marques da Cunha
    Journal of Intelligent & Robotic Systems, 2022, 106
  • [8] Kinesthetic Learning of Behaviors in a Humanoid Robot
    Cho, Sumin
    Jo, Sungho
    2011 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2011, : 1108 - 1112
  • [9] Generalized Model Learning for Reinforcement Learning on a Humanoid Robot
    Hester, Todd
    Quinlan, Michael
    Stone, Peter
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 2369 - 2374
  • [10] Humanoid robot control based on reinforcement learning
    Iida, S. (iida@ics.nitech.ac.jp), IEEE Robotics and Automation Society; Nagoya University, Japan; City of Nagoya, Japan; Nagoya City Science Museum; Chubu Science and Technology Center (Institute of Electrical and Electronics Engineers Inc.):