Visual Navigation for Biped Humanoid Robots Using Deep Reinforcement Learning

被引：60

作者：

Lobos-Tsunekawa, Kenzo ^{[1
,2
]}

Leiva, Francisco ^{[1
,2
]}

Ruiz-del-Solar, Javier ^{[1
,2
]}

机构：

[1] Univ Chile, Dept Elect Engn, Santiago 8320000, Chile

[2] Univ Chile, Adv Min Technol Ctr, Santiago 8320000, Chile

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2018年 / 3卷 / 04期

关键词：

Visual-based navigation; deep learning in robotics and automation and humanoid robots;

D O I：

10.1109/LRA.2018.2851148

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In this letter, we propose a map-less visual navigation system for biped humanoid robots, which extracts information from color images to derive motion commands using deep reinforcement learning (DRL). Themap-less visual navigation policy is trained using the Deep Deterministic Policy Gradients (DDPG) algorithm, which corresponds to an actor-critic DRL algorithm. The algorithm is implemented using two separate networks, one for the actor and one for the critic, but with similar structures. In addition to convolutional and fully connected layers, Long Short-Term Memory (LSTM) layers are included to address the limited observability present in the problem. As a proof of concept, we consider the case of robotic soccer using humanoid NAO V5 robots, which have reduced computational capabilities, and low-cost Red - Green - Blue (RGB) cameras as main sensors. The use of DRL allowed to obtain a complex and high performant policy from scratch, without any prior knowledge of the domain, or the dynamics involved. The visual navigation policy is trained in a robotic simulator and then successfully transferred to a physical robot, where it is able to run in 20 ms, allowing its use in real-time applications.

引用

页码：3247 / 3254

页数：8

共 50 条

[1] Deep Reinforcement Learning For Visual Navigation of Wheeled Mobile Robots
Nwaonumah, Ezebuugo
Samanta, Biswanath
[J]. IEEE SOUTHEASTCON 2020, 2020,
[2] Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning
Aslan, Simge Nur
Tasci, Burak
Ucar, Aysegul
Guzelis, Cuneyt
[J]. INTELLIGENT ENVIRONMENTS 2021, 2021, 29 : 143 - 155
[3] Ball Dribbling for Humanoid Biped Robots: A Reinforcement Learning and Fuzzy Control Approach
Leottau, Leonardo
Celemin, Carlos
Ruiz-del-Solar, Javier
[J]. ROBOCUP 2014: ROBOT WORLD CUP XVIII, 2015, 8992 : 549 - 561
[4] Autonomous Visual Navigation using Deep Reinforcement Learning: An Overview
Ejaz, Muhammad Mudassir
Tang, Tong Boon
Lu, Cheng-Kai
[J]. 2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 294 - 299
[5] VISUAL NAVIGATION OF WHEELED MOBILE ROBOTS USING DEEP REINFORCEMENT LEARNING: SIMULATION TO REAL-TIME IMPLEMENTATION
Nwaonumah, Ezebuugo
Samanta, Biswanath
[J]. PROCEEDINGS OF THE ASME DYNAMIC SYSTEMS AND CONTROL CONFERENCE, DSCC2020, VOL 1, 2020,
[6] Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control
Wang, Xiaoying
Zhang, Tong
[J]. AUTONOMOUS ROBOTS, 2024, 48 (02)
[7] Autonomous UAV Visual Navigation Using an Improved Deep Reinforcement Learning
Samma, Hussein
El-Ferik, Sami
[J]. IEEE ACCESS, 2024, 12 : 79967 - 79977
[8] Mapless Navigation for Autonomous Robots: A Deep Reinforcement Learning Approach
Zhang, Pengpeng
Wei, Changyun
Cai, Boliang
Ouyang, Yongping
[J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3141 - 3146
[9] Development of Push-Recovery control system for humanoid robots using deep reinforcement learning
Aslan, Emrah
Arserim, Muhammet Ali
Ucar, Aysegul
[J]. AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (10)
[10] Deep Reinforcement Learning for Visual Semantic Navigation with Memory
de Andrade Santos, Iury Batista
Romero, Roseli A. F.
[J]. 2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 114 - 119

← 1 2 3 4 5 →