A Reinforcement Learning Approach to Score Goals in RoboCup 3D Soccer Simulation for Nao Humanoid Robot

被引:0
|
作者
Fahami, Mohammad Amin [1 ]
Roshanzamir, Mohamad [1 ]
Izadi, Navid Hoseini [1 ]
机构
[1] Isfahan Univ Technol, Dept Elect & Comp Engn, Esfahan, Iran
关键词
Reinforcement Learning; Nao Humanoid Robot; RoboCup 3D Soccer simulation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning is one of the best methods to train autonomous robots. Using this method, a robot can learn to make optimal decisions without detailed programming and hard coded instructions. So, this method is useful for learning complex robotic behaviors. For example, in RoboCup competitions this method will be very useful in learning different behaviors. We propose a method for training a robot to score a goal from anywhere on the field by one or more kicks. Using reinforcement learning, Nao robot will learn the optimal policy to kick towards desired points correctly. Learning process is done in two phases. In the first phase, Nao learns to kick such that the ball goes more distance with minimum divergence from the desired path. In the second phase, the robot learns an optimal policy to score a goal by one or more kicks. Using this method, our robot performance increased significantly compared with kicking towards predetermined points in the goal.
引用
收藏
页码:450 / 454
页数:5
相关论文
共 50 条
  • [31] Real-time Active Vision for a Humanoid Soccer Robot using Deep Reinforcement Learning
    Khatibi, Soheil
    Teimouri, Meisam
    Rezaei, Mahdi
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 742 - 751
  • [32] The Walking Skill of Apollo3D-The Champion Team in the RoboCup2013 3D Soccer Simulation Competition
    Liu, Juan
    Liang, Zhiwei
    Shen, Ping
    Hao, Yue
    Zhao, Hecheng
    ROBOCUP 2013: ROBOT WORLD CUP XVII, 2014, 8371 : 104 - 113
  • [33] Walking Algorithm of Apollo3D-The World Champion in the RoboCup2013 3D Soccer Simulation Competition
    Zhao Hecheng
    Liang Zhiwei
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8331 - 8334
  • [34] Multi-robot Collaboration Based on Markov Decision Process in Robocup3D Soccer Simulation Game
    Cui Xuanyu
    Liang Zhiwei
    Yang Yongyi
    Shen Ping
    Wang Jiawen
    Liu Haoran
    Fan Kai
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4345 - 4349
  • [35] Logfile player and Analyzer for RoboCup 3D simulation
    Planthaber, Steffen
    Visser, Ubbo
    ROBOCUP 2006: ROBOT SOCCER WORLD CUP X, 2007, 4434 : 426 - +
  • [36] Efficient Reinforcement Learning for 3D LiDAR Navigation of Mobile Robot
    Zhai, Yu
    Liu, Zhe
    Miao, Yanzi
    Wang, Hesheng
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3755 - 3760
  • [37] Walking motion design of humanoid robots in RoboCup3D simulation platform
    Liang, Zhiwei
    Shen, Ping
    Li, Xuejun
    International Journal of Modelling and Simulation, 2015, 35 (01): : 34 - 41
  • [38] 3D Polygonal Mapping for Humanoid Robot Navigation
    Roychoudhury, Arindam
    Missura, Marcell
    Bennewitz, Maren
    2022 IEEE-RAS 21ST INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2022, : 171 - 177
  • [39] 3D linear visual servoing for humanoid robot
    Namba, K
    Maru, N
    IECON-2002: PROCEEDINGS OF THE 2002 28TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-4, 2002, : 2225 - 2230
  • [40] Assistance via IoT networking cameras and evidence theory for 3D object instance recognition: Application for the NAO humanoid robot
    Didier, Coquin
    Boukezzoula, Reda
    Benoit, Alexandre
    Nguyen, Thanh Long
    INTERNET OF THINGS, 2020, 9