Online Trajectory Planning Method for Midcourse Guidance Phase Based on Deep Reinforcement Learning

被引:3
|
作者
Li, Wanli [1 ]
Li, Jiong [2 ]
Li, Ningbo [3 ]
Shao, Lei [2 ]
Li, Mingjie [1 ]
机构
[1] AF Engn Univ, Grad Coll, Xian 710051, Peoples R China
[2] AF Engn Univ, Air Def & Missile Def Coll, Xian 710051, Peoples R China
[3] China Aerodynam Res & Dev Ctr, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
midcourse guidance; online trajectory planning; Markov decision process (MDP); deep deterministic policy gradient (DDPG); course learning (CL); OPTIMIZATION;
D O I
10.3390/aerospace10050441
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Concerned with the problem of interceptor midcourse guidance trajectory online planning satisfying multiple constraints, an online midcourse guidance trajectory planning method based on deep reinforcement learning (DRL) is proposed. The Markov decision process (MDP) corresponding to the background of a trajectory planning problem is designed, and the key reward function is composed of the final reward and the negative step feedback reward, which lays the foundation for the interceptor training trajectory planning method in the interactive data of a simulation environment; at the same time, concerned with the problems of unstable learning and training efficiency, a trajectory planning training strategy combined with course learning (CL) and deep deterministic policy gradient (DDPG) is proposed to realize the progressive progression of trajectory planning learning and training from satisfying simple objectives to complex objectives, and improve the convergence of the algorithm. The simulation results show that our method can not only generate the optimal trajectory with good results, but its trajectory generation speed is also more than 10 times faster than the hp pseudo spectral convex method (PSC), and can also resist the error influence mainly caused by random wind interference, which has certain application value and good research prospects.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
    Han, Huiyan
    Wang, Jiaqi
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    SENSORS, 2023, 23 (12)
  • [32] Path planning of manipulator based on deep reinforcement learning and screw method
    Wang Y.
    Wang Y.-H.
    Yin Z.-Z.
    Wan P.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (03): : 516 - 524
  • [33] A path planning method based on deep reinforcement learning for crowd evacuation
    Meng X.
    Liu H.
    Li W.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (6) : 2925 - 2939
  • [34] Emergency communication network planning method based on deep reinforcement learning
    Yin C.
    Yang R.
    Zhu W.
    Zou X.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (09): : 2091 - 2097
  • [35] Online Multimodal Transportation Planning using Deep Reinforcement Learning
    Farahani, Amirreza
    Genga, Laura
    Dijkman, Remco
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1691 - 1698
  • [36] Deep Reinforcement Learning With Optimized Reward Functions for Robotic Trajectory Planning
    Xie, Jiexin
    Shao, Zhenzhou
    Li, Yue
    Guan, Yong
    Tan, Jindong
    IEEE ACCESS, 2019, 7 : 105669 - 105679
  • [37] Trajectory Planning for Automated Parking Systems Using Deep Reinforcement Learning
    Du, Zhuo
    Miao, Qiheng
    Zong, Changfu
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2020, 21 (04) : 881 - 887
  • [38] Trajectory Planning for Automated Parking Systems Using Deep Reinforcement Learning
    Zhuo Du
    Qiheng Miao
    Changfu Zong
    International Journal of Automotive Technology, 2020, 21 : 881 - 887
  • [39] Online longitudinal trajectory planning for connected and autonomous vehicles in mixed traffic flow with deep reinforcement learning approach
    Cheng, Yanqiu
    Hu, Xianbiao
    Chen, Kuanmin
    Yu, Xinlian
    Luo, Yulong
    JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 27 (03) : 396 - 410
  • [40] Automatic Ultrasound Guidance Based on Deep Reinforcement Learning
    Jarosik, Piotr
    Lewandowski, Marcin
    2019 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2019, : 475 - 478