Online Trajectory Planning Method for Midcourse Guidance Phase Based on Deep Reinforcement Learning

被引：3

作者：

Li, Wanli ^{[1
]}

Li, Jiong ^{[2
]}

Li, Ningbo ^{[3
]}

Shao, Lei ^{[2
]}

Li, Mingjie ^{[1
]}

机构：

[1] AF Engn Univ, Grad Coll, Xian 710051, Peoples R China

[2] AF Engn Univ, Air Def & Missile Def Coll, Xian 710051, Peoples R China

[3] China Aerodynam Res & Dev Ctr, Mianyang 621000, Peoples R China

来源：

AEROSPACE | 2023年 / 10卷 / 05期

基金：

中国国家自然科学基金;

关键词：

midcourse guidance; online trajectory planning; Markov decision process (MDP); deep deterministic policy gradient (DDPG); course learning (CL); OPTIMIZATION;

D O I：

10.3390/aerospace10050441

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Concerned with the problem of interceptor midcourse guidance trajectory online planning satisfying multiple constraints, an online midcourse guidance trajectory planning method based on deep reinforcement learning (DRL) is proposed. The Markov decision process (MDP) corresponding to the background of a trajectory planning problem is designed, and the key reward function is composed of the final reward and the negative step feedback reward, which lays the foundation for the interceptor training trajectory planning method in the interactive data of a simulation environment; at the same time, concerned with the problems of unstable learning and training efficiency, a trajectory planning training strategy combined with course learning (CL) and deep deterministic policy gradient (DDPG) is proposed to realize the progressive progression of trajectory planning learning and training from satisfying simple objectives to complex objectives, and improve the convergence of the algorithm. The simulation results show that our method can not only generate the optimal trajectory with good results, but its trajectory generation speed is also more than 10 times faster than the hp pseudo spectral convex method (PSC), and can also resist the error influence mainly caused by random wind interference, which has certain application value and good research prospects.

引用

页数：17

共 50 条

[31] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
Han, Huiyan
Wang, Jiaqi
Kuang, Liqun
Han, Xie
Xue, Hongxin
SENSORS, 2023, 23 (12)
[32] Path planning of manipulator based on deep reinforcement learning and screw method
Wang Y.
Wang Y.-H.
Yin Z.-Z.
Wan P.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (03): : 516 - 524
[33] A path planning method based on deep reinforcement learning for crowd evacuation
Meng X.
Liu H.
Li W.
Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (6) : 2925 - 2939
[34] Emergency communication network planning method based on deep reinforcement learning
Yin C.
Yang R.
Zhu W.
Zou X.
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (09): : 2091 - 2097
[35] Online Multimodal Transportation Planning using Deep Reinforcement Learning
Farahani, Amirreza
Genga, Laura
Dijkman, Remco
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1691 - 1698
[36] Deep Reinforcement Learning With Optimized Reward Functions for Robotic Trajectory Planning
Xie, Jiexin
Shao, Zhenzhou
Li, Yue
Guan, Yong
Tan, Jindong
IEEE ACCESS, 2019, 7 : 105669 - 105679
[37] Trajectory Planning for Automated Parking Systems Using Deep Reinforcement Learning
Du, Zhuo
Miao, Qiheng
Zong, Changfu
INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2020, 21 (04) : 881 - 887
[38] Trajectory Planning for Automated Parking Systems Using Deep Reinforcement Learning
Zhuo Du
Qiheng Miao
Changfu Zong
International Journal of Automotive Technology, 2020, 21 : 881 - 887
[39] Online longitudinal trajectory planning for connected and autonomous vehicles in mixed traffic flow with deep reinforcement learning approach
Cheng, Yanqiu
Hu, Xianbiao
Chen, Kuanmin
Yu, Xinlian
Luo, Yulong
JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 27 (03) : 396 - 410
[40] Automatic Ultrasound Guidance Based on Deep Reinforcement Learning
Jarosik, Piotr
Lewandowski, Marcin
2019 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2019, : 475 - 478

← 1 2 3 4 5 →