Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments

被引:0
|
作者
Fei WANG [1 ]
Xiaoping ZHU [1 ]
Zhou ZHOU [2 ]
Yang TANG [1 ]
机构
[1] School of Astronautics, Northwestern Polytechnical University
[2] School of Aeronautics, Northwestern Polytechnical University
关键词
D O I
暂无
中图分类号
V279 [无人驾驶飞机]; V249.3 [导航];
学科分类号
081105 ; 1111 ;
摘要
In some military application scenarios, Unmanned Aerial Vehicles(UAVs) need to perform missions with the assistance of on-board cameras when radar is not available and communication is interrupted, which brings challenges for UAV autonomous navigation and collision avoidance. In this paper, an improved deep-reinforcement-learning algorithm, Deep Q-Network with a Faster R-CNN model and a Data Deposit Mechanism(FRDDM-DQN), is proposed. A Faster R-CNN model(FR) is introduced and optimized to obtain the ability to extract obstacle information from images, and a new replay memory Data Deposit Mechanism(DDM) is designed to train an agent with a better performance. During training, a two-part training approach is used to reduce the time spent on training as well as retraining when the scenario changes. In order to verify the performance of the proposed method, a series of experiments, including training experiments, test experiments, and typical episodes experiments, is conducted in a 3D simulation environment. Experimental results show that the agent trained by the proposed FRDDM-DQN has the ability to navigate autonomously and avoid collisions, and performs better compared to the FRDQN, FR-DDQN, FR-Dueling DQN, YOLO-based YDDM-DQN, and original FR outputbased FR-ODQN.
引用
收藏
页码:237 / 257
页数:21
相关论文
共 50 条
  • [1] Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments
    Wang, Fei
    Zhu, Xiaoping
    Zhou, Zhou
    Tang, Yang
    CHINESE JOURNAL OF AERONAUTICS, 2024, 37 (03) : 237 - 257
  • [2] Deep-Reinforcement-Learning-Based Collision Avoidance in UAV Environment
    Ouahouah, Sihem
    Bagaa, Miloud
    Prados-Garzon, Jonathan
    Taleb, Tarik
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (06) : 4015 - 4030
  • [3] Deep-Reinforcement-Learning-Based Autonomous UAV Navigation With Sparse Rewards
    Wang, Chao
    Wang, Jian
    Wang, Jingjing
    Zhang, Xudong
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07): : 6180 - 6190
  • [4] Holistic Deep-Reinforcement-Learning-based Training for Autonomous Navigation in Crowded Environments
    Kaestner, Linh
    Meusel, Marvin
    Bhuiyan, Teham
    Lambrecht, Jens
    2023 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, AIM, 2023, : 1302 - 1308
  • [5] Autonomous Navigation for Exploration of Unknown Environments and Collision Avoidance in Mobile Robots Using Reinforcement Learning
    Cardona, G. A.
    Bravo, C.
    Quesada, W.
    Ruiz, D.
    Obeng, M.
    Wu, X.
    Calderon, J. M.
    2019 IEEE SOUTHEASTCON, 2019,
  • [6] Deep-Reinforcement-Learning-Based Autonomous Establishment of Local Positioning Systems in Unknown Indoor Environments
    Wu, Zhen
    Yao, Zheng
    Lu, Mingquan
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (15) : 13626 - 13637
  • [7] Deep-Reinforcement-Learning-Based Collision Avoidance of Autonomous Driving System for Vulnerable Road User Safety
    Chen, Haochong
    Cao, Xincheng
    Guvenc, Levent
    Aksun-Guvenc, Bilin
    ELECTRONICS, 2024, 13 (10)
  • [8] Autonomous obstacle avoidance of UAV based on deep reinforcement learning
    Yang, Songyue
    Yu, Guizhen
    Meng, Zhijun
    Wang, Zhangyu
    Li, Han
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 3323 - 3335
  • [9] Deep-Reinforcement-Learning-Based Semantic Navigation of Mobile Robots in Dynamic Environments
    Kaestner, Linh
    Marx, Cornelius
    Lambrecht, Jens
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1110 - 1115
  • [10] Autonomous navigation of UAV in multi-obstacle environments based on a Deep Reinforcement Learning approach
    Zhang, Sitong
    Li, Yibing
    Dong, Qianhui
    Applied Soft Computing, 2022, 115