Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation

被引:2
|
作者
Saunders, Jack [1 ]
Saeedi, Sajad [2 ]
Li, Wenbin [1 ]
机构
[1] Univ Bath, Dept Comp Sci, Bath, Avon, England
[2] Toronto Metropolitan Univ, Dept Mech & Ind Engn, Toronto, ON, Canada
关键词
ENVIRONMENT;
D O I
10.1109/ICRA48891.2023.10160675
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) is an agent-based approach for teaching robots to navigate within the physical world. Gathering data for RL is known to be a laborious task, and real-world experiments can be risky. Simulators facilitate the collection of training data in a quicker and more cost-effective manner. However, RL frequently requires a significant number of simulation steps for an agent to become skilful at simple tasks. This is a prevalent issue within the field of RL-based visual quadrotor navigation where state dimensions are typically very large and dynamic models are complex. Furthermore, rendering images and obtaining physical properties of the agent can be computationally expensive. To solve this, we present a simulation framework, built on AirSim, which provides efficient parallel training. Building on this framework, Ape-X is modified to incorporate parallel training of AirSim environments to make use of numerous networked computers. Through experiments we were able to achieve a reduction in training time from 3.9 hours to 11 minutes, for a toy problem, using the aforementioned framework and a total of 74 agents and two networked computers. Further details including a github repo and videos about our project, PRL4AirSim, can be found at https://sites.google.com/view/prl4airsim/home
引用
下载
收藏
页码:1357 / 1363
页数:7
相关论文
共 50 条
  • [1] Waypoint Navigation of Quadrotor using Deep Reinforcement Learning
    Himanshu, K. Harikumar
    Pushpangathan, Jinraj, V
    IFAC PAPERSONLINE, 2022, 55 (22): : 281 - 286
  • [2] Quadrotor navigation in dynamic environments with deep reinforcement learning
    Fang, Jinbao
    Sun, Qiyu
    Chen, Yukun
    Tang, Yang
    ASSEMBLY AUTOMATION, 2021, 41 (03) : 254 - 262
  • [3] Autonomous Navigation and Control of a Quadrotor Using Deep Reinforcement Learning
    Mokhtar, Mohamed
    El-Badawy, Ayman
    2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 1045 - 1052
  • [4] A Navigation Scheme for a Random Maze using Reinforcement Learning with Quadrotor Vision
    Yu, Xinglin
    Wu, Yuhu
    Sun, Xi-Ming
    2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 518 - 523
  • [5] Deep Reinforcement Learning for Visual Semantic Navigation with Memory
    de Andrade Santos, Iury Batista
    Romero, Roseli A. F.
    2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 114 - 119
  • [6] Visual Navigation via Reinforcement Learning and Relational Reasoning
    Zhou, Kang
    Guo, Chi
    Zhang, Huyin
    2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 131 - 138
  • [7] Visual navigation of a quadrotor aerial vehicle
    Courbon, Jonathan
    Mezouar, Youcef
    Guenard, Nicolas
    Martinet, Philippe
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 5315 - +
  • [8] Control of a Quadrotor With Reinforcement Learning
    Hwangbo, Jemin
    Sa, Inkyu
    Siegwart, Roland
    Hutter, Marco
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (04): : 2096 - 2103
  • [9] Visual Navigation Using Inverse Reinforcement Learning and an Extreme Learning Machine
    Fang, Qiang
    Zhang, Wenzhuo
    Wang, Xitong
    ELECTRONICS, 2021, 10 (16)
  • [10] Quadrotor Autonomous Navigation in Semi-Known Environments Based on Deep Reinforcement Learning
    Ou, Jiajun
    Guo, Xiao
    Lou, Wenjie
    Zhu, Ming
    REMOTE SENSING, 2021, 13 (21)