Guidance law of interceptors against a high-speed maneuvering target based on deep Q-Network

被引:9
|
作者
Wu, Ming-yu [1 ]
He, Xian-jun [1 ]
Qiu, Zhi-ming [2 ]
Chen, Zhi-hua [1 ]
机构
[1] Nanjing Univ Sci & Technol, Natl Key Lab Transient Phys, Nanjing 210094, Peoples R China
[2] Naval Res Acad, Shanghai, Peoples R China
关键词
High-speed maneuvering target; guidance law; convergence of LOS rate; deep reinforcement learning; deep Q-Network; prioritized experience replay; PROPORTIONAL-NAVIGATION;
D O I
10.1177/01423312211052742
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a novel guidance law for intercepting a high-speed maneuvering target based on deep reinforcement learning, which mainly includes the interceptor-target relative motion model and value function approximation model based on deep Q-Network (DQN) with prioritized experience replay. First, a method called prioritized experience replay is applied to extract more efficient samples and reduce the training time. Second, to cope with the discrete action space of DQN, a normal acceleration is introduced to the state space, and the normal acceleration rate is chosen as the action. Then, the continuous normal acceleration command is obtained using numerical integral method. Third, to make the line-of-sight (LOS) rate converge rapidly, the reward function whose absolute value tends to zero has been constructed. Finally, compared with proportional navigation guidance (PNG) and the Q-Learning-based guidance law (QLG), the simulation experiments are implemented to intercept high-speed maneuvering targets at different acceleration policies. Simulation results demonstrate that the proposed DQN-based guidance law (DQNG) can obtain continuous acceleration command, make the LOS rate converge to zero rapidly, and hit the maneuvering targets using only the LOS rate. It also confirms that DQNG can realize the parallel-like approach and improve the interception performance of the interceptor to high-speed maneuvering targets. The proposed DQNG also has the advantages of avoiding the complicated formula derivation of traditional guidance law and eliminates the acceleration buffeting.
引用
收藏
页码:1373 / 1387
页数:15
相关论文
共 50 条
  • [1] Integrated guidance and control of interceptors with impact angle constraint against a high-speed maneuvering target
    Hu, Guanjie
    Guo, Jianguo
    Zhou, Jun
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2019, 233 (14) : 5192 - 5204
  • [2] A New Guidance Algorithm Against High-Speed Maneuvering Target
    Song, Kyoung-Rok
    Kim, Tae-Hun
    Lee, Chang-Hun
    Tahk, Min-Jea
    INTERNATIONAL JOURNAL OF AERONAUTICAL AND SPACE SCIENCES, 2021, 22 (05) : 1170 - 1182
  • [3] A New Guidance Algorithm Against High-Speed Maneuvering Target
    Kyoung-Rok Song
    Tae-Hun Kim
    Chang-Hun Lee
    Min-Jea Tahk
    International Journal of Aeronautical and Space Sciences, 2021, 22 : 1170 - 1182
  • [4] Performance analysis of differential geometric guidance law against high-speed target with arbitrarily maneuvering acceleration
    Li, Ke-Bo
    Su, Wen-Shan
    Chen, Lei
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2019, 233 (10) : 3547 - 3563
  • [5] Capture zones and differential game guidance law for high-speed maneuvering target interception
    Mao B.
    Li J.
    Zhang R.
    Zhang P.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2021, 43 (03): : 165 - 174
  • [6] Design of an Optimal Predictable Midcourse Guidance Law to Intercept the High-speed Maneuvering Target
    Yu, Chenfei
    Zhou, Dezhao
    Hu, Leili
    Zhang, Hao
    2018 IEEE CSAA GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2018,
  • [7] ADVANCED MISSILE GUIDANCE-SYSTEM AGAINST A VERY HIGH-SPEED MANEUVERING TARGET
    KURODA, T
    IMADO, F
    AIAA GUIDANCE, NAVIGATION AND CONTROL CONFERENCE, PTS 1 AND 2: A COLLECTION OF TECHNICAL PAPERS, 1989, : 176 - 180
  • [8] Design of fuzzy logic guidance law against high-speed target
    Lin, CL
    Chen, YY
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2000, 23 (01) : 17 - 25
  • [9] Optimal guidance law for intercepting high-speed maneuvering targets
    Dun X.
    Li J.
    Cai J.
    Li, Junlong (45514362@qq.com), 2018, National University of Defense Technology (40): : 176 - 182
  • [10] Memory-extraction-based DRL cooperative guidance against the maneuvering target protected by interceptors
    Sun, Hao
    Yan, Shi
    Liang, Yan
    Ma, Chaoxiong
    Zhang, Tao
    Pei, Liuyu
    Aerospace Science and Technology, 155