Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation

被引：2

作者：

Saunders, Jack ^{[1
]}

Saeedi, Sajad ^{[2
]}

Li, Wenbin ^{[1
]}

机构：

[1] Univ Bath, Dept Comp Sci, Bath, Avon, England

[2] Toronto Metropolitan Univ, Dept Mech & Ind Engn, Toronto, ON, Canada

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA | 2023年

关键词：

ENVIRONMENT;

D O I：

10.1109/ICRA48891.2023.10160675

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) is an agent-based approach for teaching robots to navigate within the physical world. Gathering data for RL is known to be a laborious task, and real-world experiments can be risky. Simulators facilitate the collection of training data in a quicker and more cost-effective manner. However, RL frequently requires a significant number of simulation steps for an agent to become skilful at simple tasks. This is a prevalent issue within the field of RL-based visual quadrotor navigation where state dimensions are typically very large and dynamic models are complex. Furthermore, rendering images and obtaining physical properties of the agent can be computationally expensive. To solve this, we present a simulation framework, built on AirSim, which provides efficient parallel training. Building on this framework, Ape-X is modified to incorporate parallel training of AirSim environments to make use of numerous networked computers. Through experiments we were able to achieve a reduction in training time from 3.9 hours to 11 minutes, for a toy problem, using the aforementioned framework and a total of 74 agents and two networked computers. Further details including a github repo and videos about our project, PRL4AirSim, can be found at https://sites.google.com/view/prl4airsim/home

引用

下载

页码：1357 / 1363

页数：7

共 50 条

[1] Waypoint Navigation of Quadrotor using Deep Reinforcement Learning
Himanshu, K. Harikumar
Pushpangathan, Jinraj, V
IFAC PAPERSONLINE, 2022, 55 (22): : 281 - 286
[2] Quadrotor navigation in dynamic environments with deep reinforcement learning
Fang, Jinbao
Sun, Qiyu
Chen, Yukun
Tang, Yang
ASSEMBLY AUTOMATION, 2021, 41 (03) : 254 - 262
[3] Autonomous Navigation and Control of a Quadrotor Using Deep Reinforcement Learning
Mokhtar, Mohamed
El-Badawy, Ayman
2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 1045 - 1052
[4] A Navigation Scheme for a Random Maze using Reinforcement Learning with Quadrotor Vision
Yu, Xinglin
Wu, Yuhu
Sun, Xi-Ming
2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 518 - 523
[5] Deep Reinforcement Learning for Visual Semantic Navigation with Memory
de Andrade Santos, Iury Batista
Romero, Roseli A. F.
2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 114 - 119
[6] Visual Navigation via Reinforcement Learning and Relational Reasoning
Zhou, Kang
Guo, Chi
Zhang, Huyin
2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 131 - 138
[7] Visual navigation of a quadrotor aerial vehicle
Courbon, Jonathan
Mezouar, Youcef
Guenard, Nicolas
Martinet, Philippe
2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 5315 - +
[8] Control of a Quadrotor With Reinforcement Learning
Hwangbo, Jemin
Sa, Inkyu
Siegwart, Roland
Hutter, Marco
IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04): : 2096 - 2103
[9] Visual Navigation Using Inverse Reinforcement Learning and an Extreme Learning Machine
Fang, Qiang
Zhang, Wenzhuo
Wang, Xitong
ELECTRONICS, 2021, 10 (16)
[10] Quadrotor Autonomous Navigation in Semi-Known Environments Based on Deep Reinforcement Learning
Ou, Jiajun
Guo, Xiao
Lou, Wenjie
Zhu, Ming
REMOTE SENSING, 2021, 13 (21)

← 1 2 3 4 5 →