Impact time control guidance law with time-varying velocity based on deep reinforcement learning

被引：5

作者：

Yang, Zhuoqiao ^{[1
]}

Liu, Xiangdong ^{[1
]}

Liu, Haikuo ^{[2
]}

机构：

[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China

[2] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China

来源：

AEROSPACE SCIENCE AND TECHNOLOGY | 2023年 / 142卷

关键词：

Time-varying velocity; Deep reinforcement learning; Impact time control guidance; Missile guidance;

D O I：

10.1016/j.ast.2023.108603

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This paper investigates the problem of impact-time-control guidance law with the time-varying velocity caused by gravity and aerodynamic drag. Using the deep reinforcement learning (DRL) algorithm, we propose a novel impact time control guidance (ITCG) law in which a DRL agent is trained from scratch without using any prior knowledge. Different from the traditional ITCG law, the proposed method doesn't rely on the time-to-go estimation, which is difficult to derive and inaccurate with the time-varying velocity. Further, a prioritized experience replay method and a novel action exploration method are introduced in the DRL algorithm to improve learning efficiency. Additionally, the agent action is shaped to provide smooth guidance command, which avoids the problem that the guidance command generated by the intelligent algorithm may not be continuous. Numerical simulations are conducted to support the validity of the proposed algorithm.(c) 2023 Elsevier Masson SAS. All rights reserved.

引用

页数：11

共 50 条

[21] Adaptive Deep Learning based Time-Varying Volume Compression
Pan, Yu
Zhu, Feiyu
Gao, Tian
Yu, Hongfeng
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1187 - 1194
[22] Nonlinear Impact Angle Guidance Law Canceling Autopilot Lag for Time-Varying Thrust
Cho, Sungjin
Transactions of the Korean Institute of Electrical Engineers, 2024, 73 (10): : 1699 - 1704
[23] Nonsingular Terminal Sliding Mode Guidance Law with Time-Varying Impact Angle Constraints
Li Xiao-bing
Cai Yuan-li
2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1168 - 1173
[24] Analytic Solution for Nonlinear Impact-Angle Guidance Law with Time-Varying Thrust
Cho, Sungjin
MATHEMATICS, 2022, 10 (21)
[25] Learning control for bilinear parametric systems with time-varying delays and time-varying control gains
Sun, Yunping
Wei, Feng
Li, Jinxu
Li, Yongbo
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 834 - 839
[26] Time-Varying Sliding Mode Based Fixed-Time Missile Guidance and Attitude Control With Impact Angle Constraints
Wei, Shenghui
Song, Shenmin
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2025, 35 (03) : 972 - 990
[27] Field-of-View Constrained Impact Time Control Guidance via Time-Varying Sliding Mode Control
Ma, Shuai
Wang, Xugang
Wang, Zhongyuan
AEROSPACE, 2021, 8 (09)
[28] Evolutionary reinforcement learning system with time-varying parameters
Umesako, K
Obayashi, M
Kobayashi, K
ELECTRICAL ENGINEERING IN JAPAN, 2006, 156 (01) : 54 - 60
[29] Scheduling of Time-Varying Workloads Using Reinforcement Learning
Mondal, Shanka Subhra
Sheoran, Nikhil
Mitra, Subrata
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9000 - 9008
[30] 3D Trajectory Design of UAV Based on Deep Reinforcement Learning in Time-varying Scenes
Li, Qingya
Guo, Li
Dong, Chao
Mu, Xidong
2021 THE 7TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING, ICCIP 2021, 2021, : 56 - 62

← 1 2 3 4 5 →