High-Speed Three-Dimensional Aerial Vehicle Evasion Based on a Multi-Stage Dueling Deep Q-Network

被引:0
|
作者
Yang, Yefeng [1 ,2 ]
Huang, Tao [1 ,2 ]
Wang, Xinxin [1 ]
Wen, Chih-Yung [2 ]
Huang, Xianlin [1 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
[2] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R China
关键词
aerial vehicle evasion; deep reinforcement learning; dueling deep Q-network; multi-stage training; DIFFERENTIAL GAME; GUIDANCE LAW; PURSUERS; MANEUVER; EQUATION; EVADERS;
D O I
10.3390/aerospace9110673
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper proposes a multi-stage dueling deep Q-network (MS-DDQN) algorithm to address the high-speed aerial vehicle evasion problem. High-speed aerial vehicle pursuit and evasion are an ongoing game attracting significant research attention in the field of autonomous aerial vehicle decision making. However, traditional maneuvering methods are usually not applicable in high-speed scenarios. Independent of the aerial vehicle model, the implemented MS-DDQN-based method searches for an approximate optimal maneuvering policy by iteratively interacting with the environment. Furthermore, the multi-stage learning mechanism was introduced to improve the training data quality. Simulation experiments were conducted to compare the proposed method with several typical evasion maneuvering policies and to reveal the effectiveness and robustness of the proposed MS-DDQN algorithm.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Guidance law of interceptors against a high-speed maneuvering target based on deep Q-Network
    Wu, Ming-yu
    He, Xian-jun
    Qiu, Zhi-ming
    Chen, Zhi-hua
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2022, 44 (07) : 1373 - 1387
  • [2] Resource Optimization for Multi-Unmanned Aerial Vehicle Formation Communication Based on an Improved Deep Q-Network
    Li, Jie
    Li, Sai
    Xue, Chenyan
    SENSORS, 2023, 23 (05)
  • [3] A Stealth-Distance Dynamic Weight Deep Q-Network Algorithm for Three-Dimensional Path Planning of Unmanned Aerial Helicopter
    Wang, Zeyang
    Huang, Jun
    Yi, Mingxu
    AEROSPACE, 2023, 10 (08)
  • [4] New three-dimensional inverse method for high-speed vehicle design
    Lee, JW
    Mason, WH
    JOURNAL OF SPACECRAFT AND ROCKETS, 1998, 35 (04) : 473 - 479
  • [5] Multi-Target Optimization Strategy for Unmanned Aerial Vehicle Formation in Forest Fire Monitoring Based on Deep Q-Network Algorithm
    Liu, Wenjia
    Lyu, Sung-Ki
    Liu, Tao
    Wu, Yu-Ting
    Qin, Zhen
    DRONES, 2024, 8 (05)
  • [6] Deep Q-network based multi-layer safety lane changing strategy for vehicle platoon
    Zhang, Jinqi
    Yan, Maode
    Zuo, Lei
    IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (04) : 645 - 656
  • [7] Three-Dimensional Path Planning for Unmanned Helicopter Using Memory-Enhanced Dueling Deep Q Network
    Yao, Jiangyi
    Li, Xiongwei
    Zhang, Yang
    Ji, Jingyu
    Wang, Yanchao
    Zhang, Danyang
    Liu, Yicen
    AEROSPACE, 2022, 9 (08)
  • [8] High-speed target multi-stage interception scheme based on game theory
    Wang, Xin
    Yan, Jie
    Meng, Tingwei
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (09):
  • [9] Multiple Unmanned Aerial Vehicle Autonomous Path Planning Algorithm Based on Whale-Inspired Deep Q-Network
    Wang, Wenshan
    Zhang, Guoyin
    Da, Qingan
    Lu, Dan
    Zhao, Yingnan
    Li, Sizhao
    Lang, Dapeng
    DRONES, 2023, 7 (09)
  • [10] Three-dimensional surface structure reconstruction of reflective objects using multi-stage deep learning
    Li, Wenguo
    Yan, Yuyang
    Lin, Hongjun
    Feng, Zeqian
    OPTICAL REVIEW, 2025, 32 (01) : 63 - 75