Impact time control guidance law with time-varying velocity based on deep reinforcement learning

被引:5
|
作者
Yang, Zhuoqiao [1 ]
Liu, Xiangdong [1 ]
Liu, Haikuo [2 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
关键词
Time-varying velocity; Deep reinforcement learning; Impact time control guidance; Missile guidance;
D O I
10.1016/j.ast.2023.108603
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper investigates the problem of impact-time-control guidance law with the time-varying velocity caused by gravity and aerodynamic drag. Using the deep reinforcement learning (DRL) algorithm, we propose a novel impact time control guidance (ITCG) law in which a DRL agent is trained from scratch without using any prior knowledge. Different from the traditional ITCG law, the proposed method doesn't rely on the time-to-go estimation, which is difficult to derive and inaccurate with the time-varying velocity. Further, a prioritized experience replay method and a novel action exploration method are introduced in the DRL algorithm to improve learning efficiency. Additionally, the agent action is shaped to provide smooth guidance command, which avoids the problem that the guidance command generated by the intelligent algorithm may not be continuous. Numerical simulations are conducted to support the validity of the proposed algorithm.(c) 2023 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Adaptive Deep Learning based Time-Varying Volume Compression
    Pan, Yu
    Zhu, Feiyu
    Gao, Tian
    Yu, Hongfeng
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1187 - 1194
  • [22] Nonlinear Impact Angle Guidance Law Canceling Autopilot Lag for Time-Varying Thrust
    Cho, Sungjin
    Transactions of the Korean Institute of Electrical Engineers, 2024, 73 (10): : 1699 - 1704
  • [23] Nonsingular Terminal Sliding Mode Guidance Law with Time-Varying Impact Angle Constraints
    Li Xiao-bing
    Cai Yuan-li
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1168 - 1173
  • [24] Analytic Solution for Nonlinear Impact-Angle Guidance Law with Time-Varying Thrust
    Cho, Sungjin
    MATHEMATICS, 2022, 10 (21)
  • [25] Learning control for bilinear parametric systems with time-varying delays and time-varying control gains
    Sun, Yunping
    Wei, Feng
    Li, Jinxu
    Li, Yongbo
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 834 - 839
  • [26] Time-Varying Sliding Mode Based Fixed-Time Missile Guidance and Attitude Control With Impact Angle Constraints
    Wei, Shenghui
    Song, Shenmin
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2025, 35 (03) : 972 - 990
  • [27] Field-of-View Constrained Impact Time Control Guidance via Time-Varying Sliding Mode Control
    Ma, Shuai
    Wang, Xugang
    Wang, Zhongyuan
    AEROSPACE, 2021, 8 (09)
  • [28] Evolutionary reinforcement learning system with time-varying parameters
    Umesako, K
    Obayashi, M
    Kobayashi, K
    ELECTRICAL ENGINEERING IN JAPAN, 2006, 156 (01) : 54 - 60
  • [29] Scheduling of Time-Varying Workloads Using Reinforcement Learning
    Mondal, Shanka Subhra
    Sheoran, Nikhil
    Mitra, Subrata
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9000 - 9008
  • [30] 3D Trajectory Design of UAV Based on Deep Reinforcement Learning in Time-varying Scenes
    Li, Qingya
    Guo, Li
    Dong, Chao
    Mu, Xidong
    2021 THE 7TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING, ICCIP 2021, 2021, : 56 - 62