Online Trajectory Planning Method for Midcourse Guidance Phase Based on Deep Reinforcement Learning

被引:3
|
作者
Li, Wanli [1 ]
Li, Jiong [2 ]
Li, Ningbo [3 ]
Shao, Lei [2 ]
Li, Mingjie [1 ]
机构
[1] AF Engn Univ, Grad Coll, Xian 710051, Peoples R China
[2] AF Engn Univ, Air Def & Missile Def Coll, Xian 710051, Peoples R China
[3] China Aerodynam Res & Dev Ctr, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
midcourse guidance; online trajectory planning; Markov decision process (MDP); deep deterministic policy gradient (DDPG); course learning (CL); OPTIMIZATION;
D O I
10.3390/aerospace10050441
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Concerned with the problem of interceptor midcourse guidance trajectory online planning satisfying multiple constraints, an online midcourse guidance trajectory planning method based on deep reinforcement learning (DRL) is proposed. The Markov decision process (MDP) corresponding to the background of a trajectory planning problem is designed, and the key reward function is composed of the final reward and the negative step feedback reward, which lays the foundation for the interceptor training trajectory planning method in the interactive data of a simulation environment; at the same time, concerned with the problems of unstable learning and training efficiency, a trajectory planning training strategy combined with course learning (CL) and deep deterministic policy gradient (DDPG) is proposed to realize the progressive progression of trajectory planning learning and training from satisfying simple objectives to complex objectives, and improve the convergence of the algorithm. The simulation results show that our method can not only generate the optimal trajectory with good results, but its trajectory generation speed is also more than 10 times faster than the hp pseudo spectral convex method (PSC), and can also resist the error influence mainly caused by random wind interference, which has certain application value and good research prospects.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Reentry trajectory optimization based on Deep Reinforcement Learning
    Gao, Jiashi
    Shi, Xinming
    Cheng, Zhongtao
    Xiong, Jizhang
    Liu, Lei
    Wang, Yongji
    Yang, Ye
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 2588 - 2592
  • [42] Anti-collision Trajectory Planning for Satellite Formation Reconstruction Based on Deep Reinforcement Learning
    Li, Hongbo
    Zong, Qun
    Zhang, Xiuyun
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4672 - 4677
  • [43] Deep reinforcement learning-based rehabilitation robot trajectory planning with optimized reward functions
    Wang, Xusheng
    Xie, Jiexin
    Guo, Shijie
    Li, Yue
    Sun, Pengfei
    Gan, Zhongxue
    ADVANCES IN MECHANICAL ENGINEERING, 2021, 13 (12)
  • [44] Robust Adversarial Attacks Detection Based on Explainable Deep Reinforcement Learning for UAV Guidance and Planning
    Hickling T.
    Aouf N.
    Spencer P.
    IEEE Transactions on Intelligent Vehicles, 2023, 8 (10): : 4381 - 4394
  • [45] Deep reinforcement learning trajectory planning for robotic manipulator based on simulation-efficient training
    Zhao, Bin
    Wu, Yao
    Wu, Chengdong
    Sun, Ruohuai
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [46] Deep reinforcement learning based trajectory real-time planning for hypersonic gliding vehicles
    Li, Jianfeng
    Song, Shenmin
    Shi, Xiaoping
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2024, 238 (16) : 1665 - 1682
  • [47] An Online Deep Reinforcement Learning Based Parameter Identification Method for HVDC System
    Hu, Jianxiong
    Wang, Qi
    Tang, Yi
    2021 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2021,
  • [48] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
    Meng, Haitao
    Zhang, Hengrui
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
  • [49] A novel method based on deep reinforcement learning for machining process route planning
    Zhang, Hang
    Wang, Wenhu
    Zhang, Shusheng
    Zhang, Yajun
    Zhou, Jingtao
    Wang, Zhen
    Huang, Bo
    Huang, Rui
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2024, 86
  • [50] Deep reinforcement learning based planning method in state space for lunar rovers
    Gao, Ai
    Lu, Siyao
    Xu, Rui
    Li, Zhaoyu
    Wang, Bang
    Zhu, Shengying
    Gao, Yuhui
    Pan, Bo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127