Guidance law of interceptors against a high-speed maneuvering target based on deep Q-Network

被引:9
|
作者
Wu, Ming-yu [1 ]
He, Xian-jun [1 ]
Qiu, Zhi-ming [2 ]
Chen, Zhi-hua [1 ]
机构
[1] Nanjing Univ Sci & Technol, Natl Key Lab Transient Phys, Nanjing 210094, Peoples R China
[2] Naval Res Acad, Shanghai, Peoples R China
关键词
High-speed maneuvering target; guidance law; convergence of LOS rate; deep reinforcement learning; deep Q-Network; prioritized experience replay; PROPORTIONAL-NAVIGATION;
D O I
10.1177/01423312211052742
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a novel guidance law for intercepting a high-speed maneuvering target based on deep reinforcement learning, which mainly includes the interceptor-target relative motion model and value function approximation model based on deep Q-Network (DQN) with prioritized experience replay. First, a method called prioritized experience replay is applied to extract more efficient samples and reduce the training time. Second, to cope with the discrete action space of DQN, a normal acceleration is introduced to the state space, and the normal acceleration rate is chosen as the action. Then, the continuous normal acceleration command is obtained using numerical integral method. Third, to make the line-of-sight (LOS) rate converge rapidly, the reward function whose absolute value tends to zero has been constructed. Finally, compared with proportional navigation guidance (PNG) and the Q-Learning-based guidance law (QLG), the simulation experiments are implemented to intercept high-speed maneuvering targets at different acceleration policies. Simulation results demonstrate that the proposed DQN-based guidance law (DQNG) can obtain continuous acceleration command, make the LOS rate converge to zero rapidly, and hit the maneuvering targets using only the LOS rate. It also confirms that DQNG can realize the parallel-like approach and improve the interception performance of the interceptor to high-speed maneuvering targets. The proposed DQNG also has the advantages of avoiding the complicated formula derivation of traditional guidance law and eliminates the acceleration buffeting.
引用
收藏
页码:1373 / 1387
页数:15
相关论文
共 50 条
  • [41] Distributed observer-based fixed-time cooperative guidance law against maneuvering target
    He, Zhichuan
    Fan, Shipeng
    Wang, Jiang
    Wang, Peng
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 34 (01) : 27 - 53
  • [42] High-speed maneuvering target detection approach based on joint RFT and keystone transform
    TIAN Jing
    CUI Wei
    SHEN Qing
    WEI ZiXiang
    WU SiLiang
    Science China(Information Sciences), 2013, 56 (06) : 93 - 105
  • [43] High-speed maneuvering target detection approach based on joint RFT and keystone transform
    Jing Tian
    Wei Cui
    Qing Shen
    ZiXiang Wei
    SiLiang Wu
    Science China Information Sciences, 2013, 56 : 1 - 13
  • [44] Combined Proportional Navigation Law for Intercepting High-speed Maneuvering Targets Based on MPSP
    Shen, Lianjie
    Wang, Yongzhou
    Hao, Feng
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8049 - 8054
  • [45] The X-Layer Optimization in CRN Using Deep Q-Network for Secure High Speed Communication
    Islam, Chowdhury Sajadul
    Mollah, Md. Sarwar Hossain
    2019 11TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE 2019), 2019,
  • [46] Supervised contrastive deep Q-Network for imbalanced radar automatic target recognition
    Liu, Guanliang
    Chen, Wenchao
    Chen, Bo
    Feng, Bo
    Wang, Penghui
    Liu, Hongwei
    Pattern Recognition, 2025, 161
  • [47] A New Feature Selection Algorithm Based on Deep Q-Network
    Li, Xinqian
    Yao, Jie
    Ren, Jia
    Wang, Liqiang
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7100 - 7105
  • [48] Deep Q-network learning-based active speed management under autonomous driving environments
    Kang, Kawon
    Park, Nuri
    Park, Juneyoung
    Abdel-Aty, Mohamed
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024,
  • [49] Dynamic constrained evolutionary optimization based on deep Q-network
    Liang, Zhengping
    Yang, Ruitai
    Wang, Jigang
    Liu, Ling
    Ma, Xiaoliang
    Zhu, Zexuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [50] Attention-Based Deep Q-Network in Complex Systems
    Ni, Kun
    Yu, Danning
    Liu, Yunlong
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 323 - 332