Visual Navigation for Biped Humanoid Robots Using Deep Reinforcement Learning

被引:60
|
作者
Lobos-Tsunekawa, Kenzo [1 ,2 ]
Leiva, Francisco [1 ,2 ]
Ruiz-del-Solar, Javier [1 ,2 ]
机构
[1] Univ Chile, Dept Elect Engn, Santiago 8320000, Chile
[2] Univ Chile, Adv Min Technol Ctr, Santiago 8320000, Chile
来源
关键词
Visual-based navigation; deep learning in robotics and automation and humanoid robots;
D O I
10.1109/LRA.2018.2851148
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter, we propose a map-less visual navigation system for biped humanoid robots, which extracts information from color images to derive motion commands using deep reinforcement learning (DRL). Themap-less visual navigation policy is trained using the Deep Deterministic Policy Gradients (DDPG) algorithm, which corresponds to an actor-critic DRL algorithm. The algorithm is implemented using two separate networks, one for the actor and one for the critic, but with similar structures. In addition to convolutional and fully connected layers, Long Short-Term Memory (LSTM) layers are included to address the limited observability present in the problem. As a proof of concept, we consider the case of robotic soccer using humanoid NAO V5 robots, which have reduced computational capabilities, and low-cost Red - Green - Blue (RGB) cameras as main sensors. The use of DRL allowed to obtain a complex and high performant policy from scratch, without any prior knowledge of the domain, or the dynamics involved. The visual navigation policy is trained in a robotic simulator and then successfully transferred to a physical robot, where it is able to run in 20 ms, allowing its use in real-time applications.
引用
收藏
页码:3247 / 3254
页数:8
相关论文
共 50 条
  • [1] Deep Reinforcement Learning For Visual Navigation of Wheeled Mobile Robots
    Nwaonumah, Ezebuugo
    Samanta, Biswanath
    [J]. IEEE SOUTHEASTCON 2020, 2020,
  • [2] Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning
    Aslan, Simge Nur
    Tasci, Burak
    Ucar, Aysegul
    Guzelis, Cuneyt
    [J]. INTELLIGENT ENVIRONMENTS 2021, 2021, 29 : 143 - 155
  • [3] Ball Dribbling for Humanoid Biped Robots: A Reinforcement Learning and Fuzzy Control Approach
    Leottau, Leonardo
    Celemin, Carlos
    Ruiz-del-Solar, Javier
    [J]. ROBOCUP 2014: ROBOT WORLD CUP XVIII, 2015, 8992 : 549 - 561
  • [4] Autonomous Visual Navigation using Deep Reinforcement Learning: An Overview
    Ejaz, Muhammad Mudassir
    Tang, Tong Boon
    Lu, Cheng-Kai
    [J]. 2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 294 - 299
  • [5] VISUAL NAVIGATION OF WHEELED MOBILE ROBOTS USING DEEP REINFORCEMENT LEARNING: SIMULATION TO REAL-TIME IMPLEMENTATION
    Nwaonumah, Ezebuugo
    Samanta, Biswanath
    [J]. PROCEEDINGS OF THE ASME DYNAMIC SYSTEMS AND CONTROL CONFERENCE, DSCC2020, VOL 1, 2020,
  • [6] Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control
    Wang, Xiaoying
    Zhang, Tong
    [J]. AUTONOMOUS ROBOTS, 2024, 48 (02)
  • [7] Autonomous UAV Visual Navigation Using an Improved Deep Reinforcement Learning
    Samma, Hussein
    El-Ferik, Sami
    [J]. IEEE ACCESS, 2024, 12 : 79967 - 79977
  • [8] Mapless Navigation for Autonomous Robots: A Deep Reinforcement Learning Approach
    Zhang, Pengpeng
    Wei, Changyun
    Cai, Boliang
    Ouyang, Yongping
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3141 - 3146
  • [9] Development of Push-Recovery control system for humanoid robots using deep reinforcement learning
    Aslan, Emrah
    Arserim, Muhammet Ali
    Ucar, Aysegul
    [J]. AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (10)
  • [10] Deep Reinforcement Learning for Visual Semantic Navigation with Memory
    de Andrade Santos, Iury Batista
    Romero, Roseli A. F.
    [J]. 2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 114 - 119