High-Speed Three-Dimensional Aerial Vehicle Evasion Based on a Multi-Stage Dueling Deep Q-Network

被引:0
|
作者
Yang, Yefeng [1 ,2 ]
Huang, Tao [1 ,2 ]
Wang, Xinxin [1 ]
Wen, Chih-Yung [2 ]
Huang, Xianlin [1 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
[2] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R China
关键词
aerial vehicle evasion; deep reinforcement learning; dueling deep Q-network; multi-stage training; DIFFERENTIAL GAME; GUIDANCE LAW; PURSUERS; MANEUVER; EQUATION; EVADERS;
D O I
10.3390/aerospace9110673
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper proposes a multi-stage dueling deep Q-network (MS-DDQN) algorithm to address the high-speed aerial vehicle evasion problem. High-speed aerial vehicle pursuit and evasion are an ongoing game attracting significant research attention in the field of autonomous aerial vehicle decision making. However, traditional maneuvering methods are usually not applicable in high-speed scenarios. Independent of the aerial vehicle model, the implemented MS-DDQN-based method searches for an approximate optimal maneuvering policy by iteratively interacting with the environment. Furthermore, the multi-stage learning mechanism was introduced to improve the training data quality. Simulation experiments were conducted to compare the proposed method with several typical evasion maneuvering policies and to reveal the effectiveness and robustness of the proposed MS-DDQN algorithm.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Heuristic Q-learning based on experience replay for three-dimensional path planning of the unmanned aerial vehicle
    Xie, Ronglei
    Meng, Zhijun
    Zhou, Yaoming
    Ma, Yunpeng
    Wu, Zhe
    SCIENCE PROGRESS, 2020, 103 (01)
  • [22] High-speed multi-stage ATM switch based on hierarchical cell resequencing architecture and WDM interconnection
    Yasukawa, S
    Yamanaka, N
    Oki, E
    Kawano, R
    IEICE TRANSACTIONS ON ELECTRONICS, 1999, E82C (02) : 219 - 228
  • [23] High-speed multi-stage ATM switch based on hierarchical cell resequencing architecture and WDM interconnection
    Yasukawa, S
    Yamanaka, N
    Oki, E
    Kawano, R
    IEICE TRANSACTIONS ON COMMUNICATIONS, 1999, E82B (02) : 271 - 280
  • [24] High-speed three-dimensional shape measurement for isolated objects based on fringe projection
    Li, Yong
    Zhao, Cuifang
    Wang, Hui
    Jin, Hongzhen
    JOURNAL OF OPTICS, 2011, 13 (03)
  • [25] High-speed three-dimensional measurements with a fringe projection-based optical sensor
    Braeuer-Burchardt, Christian
    Breitbarth, Andreas
    Kuehmstedt, Peter
    Notni, Gunther
    OPTICAL ENGINEERING, 2014, 53 (11)
  • [26] Radar High-Speed Maneuvering Target Detection Based on Three-Dimensional Scaled Transform
    Zheng, Jibin
    Liu, Hongwei
    Liu, Jun
    Du, Xiaolin
    Liu, Qing Huo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (08) : 2821 - 2833
  • [27] Optimization Research of High-Speed Railway EMU Utilization Schedule Based on Three-Dimensional Space-Time Network
    Qian, Mingjun
    Huang, Xin
    Li, Mingli
    ENGINEERING LETTERS, 2025, 33 (04) : 1008 - 1019
  • [28] High-speed all-optical DNA local sequence alignment based on a three-dimensional artificial neural network
    Maleki, Ehsan
    Babashah, Hossein
    Koohi, Somayyeh
    Kavehvash, Zahra
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2017, 34 (07) : 1173 - 1186
  • [29] Multi-objective optimization of multi-stage heat sink of electric aircraft using three-dimensional thermal network analysis
    Kamiyama, Ayaka
    Inokuma, Kento
    Murata, Akira
    Yamamoto, Shohei
    Iwamoto, Kaoru
    Konno, Taketo
    JOURNAL OF THERMAL SCIENCE AND TECHNOLOGY, 2022, 17 (01)
  • [30] High-speed three-dimensional characterization of fluid flows induced by micro-objects in deep microchannels
    Chia-Yuan Chen
    Kerem Pekkan
    BioChip Journal, 2013, 7 : 95 - 103