Online Trajectory Planning Method for Midcourse Guidance Phase Based on Deep Reinforcement Learning

被引:3
|
作者
Li, Wanli [1 ]
Li, Jiong [2 ]
Li, Ningbo [3 ]
Shao, Lei [2 ]
Li, Mingjie [1 ]
机构
[1] AF Engn Univ, Grad Coll, Xian 710051, Peoples R China
[2] AF Engn Univ, Air Def & Missile Def Coll, Xian 710051, Peoples R China
[3] China Aerodynam Res & Dev Ctr, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
midcourse guidance; online trajectory planning; Markov decision process (MDP); deep deterministic policy gradient (DDPG); course learning (CL); OPTIMIZATION;
D O I
10.3390/aerospace10050441
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Concerned with the problem of interceptor midcourse guidance trajectory online planning satisfying multiple constraints, an online midcourse guidance trajectory planning method based on deep reinforcement learning (DRL) is proposed. The Markov decision process (MDP) corresponding to the background of a trajectory planning problem is designed, and the key reward function is composed of the final reward and the negative step feedback reward, which lays the foundation for the interceptor training trajectory planning method in the interactive data of a simulation environment; at the same time, concerned with the problems of unstable learning and training efficiency, a trajectory planning training strategy combined with course learning (CL) and deep deterministic policy gradient (DDPG) is proposed to realize the progressive progression of trajectory planning learning and training from satisfying simple objectives to complex objectives, and improve the convergence of the algorithm. The simulation results show that our method can not only generate the optimal trajectory with good results, but its trajectory generation speed is also more than 10 times faster than the hp pseudo spectral convex method (PSC), and can also resist the error influence mainly caused by random wind interference, which has certain application value and good research prospects.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Deep reinforcement learning-based reactive trajectory planning method for UAVs
    Cao, Lijia
    Wang, Lin
    Liu, Yang
    Xu, Weihong
    Geng, Chuang
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2024, 238 (10) : 1018 - 1037
  • [2] On-line Rapid Planning Method for Interceptor Missile Midcourse Guidance Trajectory
    Li W.
    Li J.
    Li M.
    Binggong Xuebao/Acta Armamentarii, 2021, 42 (12): : 2617 - 2625
  • [3] Fast Trajectory Generation Method for Midcourse Guidance Based on Convex Optimization
    Zhang, Jinlin
    Li, Jiong
    Zhou, Chijun
    Lei, Humin
    Li, Wanli
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2022, 2022
  • [4] Fast Trajectory Generation Method for Midcourse Guidance Based on Convex Optimization
    Zhang, Jinlin
    Li, Jiong
    Zhou, Chijun
    Lei, Humin
    Li, Wanli
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2022, 2022
  • [5] A Deep Reinforcement Learning Based UAV Trajectory Planning Method For Integrated Sensing And Communications Networks
    Lin, Heyun
    Zhang, Zhihai
    Wei, Longkun
    Zhou, Zihao
    Zheng, Tian
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [6] A Trajectory Planning Method for Capture Operation of Space Robotic Arm Based on Deep Reinforcement Learning
    Song, Bing Yang
    Li, Jin Quan
    Liu, Xiao Yu
    Wang, Guo Lei
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2024, 24 (09)
  • [7] Online Trajectory Planning with Reinforcement Learning for Pedestrian Avoidance
    Feher, Arpad
    Aradi, Szilard
    Becsi, Tamas
    ELECTRONICS, 2022, 11 (15)
  • [8] Time optimal trajectory planning of excavator based on deep reinforcement learning
    Zhang Y.-Y.
    Sun Z.-Y.
    Sun Q.-L.
    Wang Y.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (05): : 1433 - 1440
  • [9] Deep Reinforcement Learning Based Trajectory Planning Under Uncertain Constraints
    Chen, Lienhung
    Jiang, Zhongliang
    Cheng, Long
    Knoll, Alois C.
    Zhou, Mingchuan
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [10] Optimizing pedestrian simulation based on expert trajectory guidance and deep reinforcement learning
    Senlin Mu
    Xiao Huang
    Moyang Wang
    Di Zhang
    Dong Xu
    Xiang Li
    GeoInformatica, 2023, 27 : 709 - 736