Deep Reinforcement Learning for Humanoid Robot Dribbling

被引:4
|
作者
Muzio, Alexandre F., V [1 ]
Maximo, Marcos R. O. A. [1 ]
Yoneyama, Takashi [2 ]
机构
[1] Aeronaut Inst Technol, Comp Sci Div, Autonomous Computat Syst Lab LAB SCA, Praca Marechal Eduardo Gomes 50, BR-12228900 Sao Jose Dos Campos, SP, Brazil
[2] Aeronaut Inst Technol, Elect Engn Div, Praca Marechal Eduardo Gomes 50, BR-12228900 Sao Jose Dos Campos, SP, Brazil
来源
2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020) | 2020年
关键词
D O I
10.1109/lars/sbr/wre51543.2020.9307084
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Humanoid robot soccer is a very traditional competitive task that aims to push the boundaries of state-of-the-art robotics. One of the many challenges of playing soccer is walking and running while not losing balance. Deep Reinforcement Learning (DRL) has been used to solve complex continuous control problems such as those in robotics. In this work, we focused on learning humanoid robot behavior to dribble a ball against a single opponent. Instead of learning how to control joint commands directly, we adopt an approach where the learning agent interacts with a predefined walking engine. Using DRL model-free algorithms (namely, Deep Deterministic Policy Gradients, Trust Region Policy Optimization, and Proximal Policy Optimization), we effectively learn a high level policy that allows a humanoid robot to fulfill this task. Finally, the learned dribble policy was evaluated on a simulated Nao robot from the RoboCup 3D Soccer Simulation League. According to our results, the learned agent was able to surpass the handcoded behavior effectively used by the ITAndroids robotics team in the RoboCup competition.
引用
收藏
页码:246 / 251
页数:6
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Muzio, Alexandre F. V.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (01)
  • [2] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Alexandre F. V. Muzio
    Marcos R. O. A. Maximo
    Takashi Yoneyama
    Journal of Intelligent & Robotic Systems, 2022, 105
  • [3] Deep Reinforcement Learning for Humanoid Robot Behaviors
    Muzio, Alexandre F. V.
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    Journal of Intelligent and Robotic Systems: Theory and Applications, 2022, 105 (01):
  • [4] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
    Isaac Jesus da Silva
    Danilo Hernani Perico
    Thiago Pedro Donadon Homem
    Reinaldo Augusto da Costa Bianchi
    Journal of Intelligent & Robotic Systems, 2021, 102
  • [5] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
    da Silva, Isaac Jesus
    Perico, Danilo Hernani
    Donadon Homem, Thiago Pedro
    da Costa Bianchi, Reinaldo Augusto
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (03)
  • [6] Ball Dribbling for Humanoid Biped Robots: A Reinforcement Learning and Fuzzy Control Approach
    Leottau, Leonardo
    Celemin, Carlos
    Ruiz-del-Solar, Javier
    ROBOCUP 2014: ROBOT WORLD CUP XVIII, 2015, 8992 : 549 - 561
  • [7] Generalized Model Learning for Reinforcement Learning on a Humanoid Robot
    Hester, Todd
    Quinlan, Michael
    Stone, Peter
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 2369 - 2374
  • [8] Humanoid robot control based on reinforcement learning
    Iida, S. (iida@ics.nitech.ac.jp), IEEE Robotics and Automation Society; Nagoya University, Japan; City of Nagoya, Japan; Nagoya City Science Museum; Chubu Science and Technology Center (Institute of Electrical and Electronics Engineers Inc.):
  • [9] Humanoid robot control based on reinforcement learning
    Iida, S
    Kuwayama, K
    Kanoh, M
    Kato, S
    Kunitachi, T
    Itoh, H
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON MICRO-NANOMECHATRONICS AND HUMAN SCIENCE, 2004, : 353 - 358
  • [10] A Reinforcement Learning Method for Humanoid Robot Walking
    Liu, Yunda
    Bi, Sheng
    Dong, Min
    Zhang, Yingjie
    Huang, Jialing
    Zhang, Jiawei
    2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 623 - 628