Online Trajectory Planning Method for Midcourse Guidance Phase Based on Deep Reinforcement Learning

被引:3
|
作者
Li, Wanli [1 ]
Li, Jiong [2 ]
Li, Ningbo [3 ]
Shao, Lei [2 ]
Li, Mingjie [1 ]
机构
[1] AF Engn Univ, Grad Coll, Xian 710051, Peoples R China
[2] AF Engn Univ, Air Def & Missile Def Coll, Xian 710051, Peoples R China
[3] China Aerodynam Res & Dev Ctr, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
midcourse guidance; online trajectory planning; Markov decision process (MDP); deep deterministic policy gradient (DDPG); course learning (CL); OPTIMIZATION;
D O I
10.3390/aerospace10050441
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Concerned with the problem of interceptor midcourse guidance trajectory online planning satisfying multiple constraints, an online midcourse guidance trajectory planning method based on deep reinforcement learning (DRL) is proposed. The Markov decision process (MDP) corresponding to the background of a trajectory planning problem is designed, and the key reward function is composed of the final reward and the negative step feedback reward, which lays the foundation for the interceptor training trajectory planning method in the interactive data of a simulation environment; at the same time, concerned with the problems of unstable learning and training efficiency, a trajectory planning training strategy combined with course learning (CL) and deep deterministic policy gradient (DDPG) is proposed to realize the progressive progression of trajectory planning learning and training from satisfying simple objectives to complex objectives, and improve the convergence of the algorithm. The simulation results show that our method can not only generate the optimal trajectory with good results, but its trajectory generation speed is also more than 10 times faster than the hp pseudo spectral convex method (PSC), and can also resist the error influence mainly caused by random wind interference, which has certain application value and good research prospects.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Online midcourse guidance method for boost phase interception via adaptive convex programming
    Yang, Biao
    Jing, Wuxing
    Gao, Changsheng
    AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
  • [22] Online midcourse guidance method for boost phase interception via adaptive convex programming
    Yang, Biao
    Jing, Wuxing
    Gao, Changsheng
    Aerospace Science and Technology, 2021, 118
  • [23] Cooperative trajectory planning method in later part of midcourse based on velocity estimation
    Cui Z.
    Wei M.
    Li Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (09): : 2912 - 2921
  • [24] Autonomous Trajectory Planning Method for Stratospheric Airship Regional Station-Keeping Based on Deep Reinforcement Learning
    Liu, Sitong
    Zhou, Shuyu
    Miao, Jinggang
    Shang, Hai
    Cui, Yuxuan
    Lu, Ying
    AEROSPACE, 2024, 11 (09)
  • [25] Intelligent Vehicle Decision-Making and Trajectory Planning Method Based on Deep Reinforcement Learning in the Frenet Space
    Wang, Jiawei
    Chu, Liang
    Zhang, Yao
    Mao, Yabin
    Guo, Chong
    SENSORS, 2023, 23 (24)
  • [26] Online mobile learning resource recommendation method based on deep reinforcement learning
    Li, Pingyang
    Zhang, Juan
    INTERNATIONAL JOURNAL OF INNOVATION AND SUSTAINABLE DEVELOPMENT, 2025, 19 (01)
  • [27] Trajectory planning for airborne radar in extended target tracking based on deep reinforcement learning
    Zhang, Hongyun
    Chen, Hui
    Zhang, Wenxu
    Zhang, Xindi
    DIGITAL SIGNAL PROCESSING, 2024, 153
  • [28] Trajectory Planning of UAV in Wireless Powered IoT System Based on Deep Reinforcement Learning
    Zhang, Jidong
    Yu, Yu
    Wang, Zhigang
    Ao, Shaopeng
    Tang, Jie
    Zhang, Xiuyin
    Wong, Kai-Kit
    2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 645 - 650
  • [29] A Deep Reinforcement Learning Approach for Federated Learning Optimization with UAV Trajectory Planning
    Zhang, Chunyu
    Liu, Yiming
    Zhang, Zhi
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [30] An Improved Multimodal Trajectory Prediction Method Based on Deep Inverse Reinforcement Learning
    Chen, Ting
    Guo, Changxin
    Li, Hao
    Gao, Tao
    Chen, Lei
    Tu, Huizhao
    Yang, Jiangtian
    ELECTRONICS, 2022, 11 (24)