UAV autonomous obstacle avoidance via causal reinforcement learning

被引:0
|
作者
Sun, Tao [1 ]
Gu, Jiaojiao [1 ]
Mou, Junjie [1 ]
机构
[1] Naval Aeronaut Univ, Yantai 264001, Peoples R China
关键词
Unmanned aerial vehicles (UAVs); Obstacle avoidance; Navigation; Causal inference; Reinforcement learning; SCALE ESTIMATION;
D O I
10.1016/j.displa.2025.102966
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The role of unmanned aerial vehicles (UAVs) in everyday life is becoming increasingly important, and there is a growing demand for UAVs to autonomously perform obstacle avoidance and navigation tasks. Traditional UAV navigation methods typically divide the navigation problem into three stages: perception, mapping, and path planning. However, this approach significantly increases processing delays, causing UAVs to lose their agility advantage. In this paper, we propose a causal reinforcement learning-based end-to-end navigation strategy that directly learns from data, bypassing the explicit mapping and planning steps, thus enhancing responsiveness. To address the issue where using a continuous action space prevents the agent from learning effective experiences from past actions, we introduce an Actor-Critic method with a fixed horizontal plane and a discretized action space. This approach enhances the efficiency of sampling from the experience replay buffer and stabilizes the optimization process, ultimately improving the success rate of the reinforcement learning algorithm in UAV obstacle avoidance and navigation tasks. Furthermore, to overcome the limited generalization capability of end-to-end methods, we incorporate causal inference into the reinforcement learning training process. This step mitigates overfitting caused by insufficient interaction with the environment during training, thereby increasing the success rate of UAVs in performing obstacle avoidance and navigation tasks in unfamiliar environments. We validate the effectiveness of causal inference in improving the generalization capability of the reinforcement learning algorithm by using convergence steps in the training environment and navigation success rates of random targets in the testing environment as quantitative metrics. The results demonstrate that causal inference can effectively reduce overfitting of the policy network to the training environment.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Autonomous RL: Autonomous Vehicle Obstacle Avoidance in a Dynamic Environment using MLP-SARSA Reinforcement Learning
    Arvind, C. S.
    Senthilnath, J.
    2019 IEEE 5TH INTERNATIONAL CONFERENCE ON MECHATRONICS SYSTEM AND ROBOTS (ICMSR 2019), 2019, : 120 - 124
  • [42] Autonomous UAV Navigation via Deep Reinforcement Learning Using PPO
    Kabas, Bilal
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [43] Reinforcement Learning for Autonomous Aircraft Avoidance
    Keong, Choo Wai
    Shin, Hyo-Sang
    Tsourdos, Antonios
    2019 INTERNATIONAL WORKSHOP ON RESEARCH, EDUCATION AND DEVELOPMENT OF UNMANNED AERIAL SYSTEMS (RED UAS 2019), 2019, : 126 - 131
  • [44] Autonomous navigation of UAV in multi-obstacle environments based on a Deep Reinforcement Learning approach
    Zhang, Sitong
    Li, Yibing
    Dong, Qianhui
    APPLIED SOFT COMPUTING, 2022, 115
  • [45] Autonomous navigation of UAV in multi-obstacle environments based on a Deep Reinforcement Learning approach
    Zhang, Sitong
    Li, Yibing
    Dong, Qianhui
    Applied Soft Computing, 2022, 115
  • [46] Deep-reinforcement learning-based route planning with obstacle avoidance for autonomous vessels
    Ryosuke Saga
    Rinto Kozono
    Yutaro Tsurumi
    Yasunori Nihei
    Artificial Life and Robotics, 2024, 29 : 136 - 144
  • [47] Deep-reinforcement learning-based route planning with obstacle avoidance for autonomous vessels
    Saga, Ryosuke
    Kozono, Rinto
    Tsurumi, Yutaro
    Nihei, Yasunori
    ARTIFICIAL LIFE AND ROBOTICS, 2024, 29 (01) : 136 - 144
  • [48] Proximal policy optimization for UAV autonomous guidance, tracking and obstacle avoidance
    Hu D.
    Dong W.
    Xie W.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (01): : 195 - 205
  • [49] Autonomous Navigation and Obstacle Avoidance for Small VTOL UAV in Unknown Environments
    Chen, Cheng
    Wang, Zian
    Gong, Zheng
    Cai, Pengcheng
    Zhang, Chengxi
    Li, Yi
    SYMMETRY-BASEL, 2022, 14 (12):
  • [50] Path following and obstacle avoidance for an autonomous UAV using a depth camera
    Iacono, Massimiliano
    Sgorbissa, Antonio
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 106 : 38 - 46