Optimal Multi-impulse Linear Rendezvous via Reinforcement Learning

被引:11
|
作者
Xu, Longwei [1 ]
Zhang, Gang [1 ]
Qiu, Shi [1 ]
Cao, Xibin [1 ]
机构
[1] Harbin Inst Technol, Res Ctr Satellite Technol, Harbin 150001, Peoples R China
来源
关键词
OPTIMAL LOW-THRUST; MANEUVERS; NETWORKS; VICINITY; GAME; GO;
D O I
10.34133/space.0047
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
A reinforcement learning-based approach is proposed to design the multi-impulse rendezvous trajectories in linear relative motions. For the relative motion in elliptical orbits, the relative state propagation is obtained directly from the state transition matrix. This rendezvous problem is constructed as a Markov decision process that reflects the fuel consumption, the transfer time, the relative state, and the dynamical model. An actor-critic algorithm is used to train policy for generating rendezvous maneuvers. The results of the numerical optimization (e.g., differential evolution) are adopted as the expert data set to accelerate the training process. By deploying a policy network, the multi-impulse rendezvous trajectories can be obtained on board. Moreover, the proposed approach is also applied to generate a feasible solution for many impulses (e.g., 20 impulses), which can be used as an initial value for further optimization. The numerical examples with random initial states show that the proposed method is much faster and has slightly worse performance indexes when compared with the evolutionary algorithm.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Reinforcement learning-based multi-impulse rendezvous approach for satellite constellation reconfiguration
    Xu, Longwei
    Zhang, Gang
    Qiu, Shi
    Cao, Xibin
    [J]. ACTA ASTRONAUTICA, 2024, 224 : 325 - 337
  • [2] Optimal multi-impulse rendezvous based on T-H equations
    Ji, Xiaoqin
    Xiao, Lihong
    Chen, Wenhui
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2014, 40 (07): : 905 - 909
  • [3] OPTIMAL MULTI-IMPULSE TRAJECTORIES
    MINKOFF, M
    LION, PM
    [J]. ASTRONAUTICA ACTA, 1969, 14 (04): : 359 - &
  • [4] Fuel optimal multi-impulse orbit rendezvous between neighboring orbits: A numerical approach
    No, TS
    [J]. ASTRODYNAMICS 1997, 1998, 97 : 707 - 718
  • [5] Optimal Multi-impulse Elliptic Rendezvous Using Stagnation Escaping Whale Optimization Algorithm
    Shim, Eun-Song
    Kim, Hae-Dong
    Lee, Seonho
    [J]. INTERNATIONAL JOURNAL OF AERONAUTICAL AND SPACE SCIENCES, 2024, 25 (03) : 1092 - 1104
  • [6] Regularizing fuel-optimal multi-impulse trajectories
    Kenta Oshima
    [J]. Astrodynamics, 2024, 8 : 97 - 119
  • [7] LINEARIZED THEORY OF OPTIMAL MULTI-IMPULSE ORBITAL TRANSFERS
    KUZMAK, GE
    LAVRENKO, NI
    [J]. JOURNAL OF THE ASTRONAUTICAL SCIENCES, 1964, 11 (03): : 88 - &
  • [8] Regularizing fuel-optimal multi-impulse trajectories
    Oshima, Kenta
    [J]. ASTRODYNAMICS, 2024, 8 (01) : 97 - 119
  • [9] APPROXIMATE MULTI-IMPULSE, LARGE AMPLITUDE OPTIMUM RENDEZVOUS BETWEEN NEIGHBOURING ELLIPTICAL ORBITS
    NGUYEN, VN
    [J]. RECHERCHE AEROSPATIALE, 1971, (03): : 171 - &
  • [10] OPTIMAL MULTI-IMPULSE ORBIT TRANSFER USING NONLINEAR RELATIVE MOTION DYNAMICS
    Huang, Weijun
    [J]. KYLE T. ALFRIEND ASTRODYNAMICS SYMPOSIUM, 2011, 139 : 237 - 256