Spacecraft Proximity Maneuvering and Rendezvous With Collision Avoidance Based on Reinforcement Learning

被引:16
|
作者
Qu, Qingyu [1 ]
Liu, Kexin [2 ]
Wang, Wei [2 ]
Lu, Jinhu [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
基金
美国国家科学基金会;
关键词
Space vehicles; Heuristic algorithms; Aerodynamics; Collision avoidance; Oscillators; Orbits; Mathematical models; Aerospace control; autonomous spacecraft rendezvous (ASR); collision avoidance; deep reinforcement learning (DRL); SLIDING MODE CONTROL;
D O I
10.1109/TAES.2022.3180271
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The rapid development of the aerospace industry puts forward the urgent need for the evolution of autonomous spacecraft rendezvous technology, which has gained significant attention recently due to increased applications in various space missions. This article studies the relative position tracking problem of the autonomous spacecraft rendezvous under the requirement of collision avoidance. An exploration-adaptive deep deterministic policy gradient (DDPG) algorithm is proposed to train a definite control strategy for this mission. Similar to the DDPG algorithm, four neural networks are used in this method, where two of them are used to generate the deterministic policy, whereas the other two are used to score the obtained policy. Differently, adaptive noise is introduced to reduce the possibility of oscillations and divergences and to cut down the unnecessary computation by weakening the exploration of stabilization problems. In addition, in order to effectively and quickly adapt to some other similar scenarios, a metalearning-based idea is introduced by fine-tuning the prior strategy. Finally, two numerical simulations show that the trained control strategy can effectively avoid the oscillation phenomenon caused by the artificial potential function. Benefiting from this, the trained control strategy based on deep reinforcement learning technology can decrease the energy consumption by 16.44% during the close proximity phase, compared with the traditional artificial potential function method. Besides, after introducing the metalearning-based idea, a strategy available for some other perturbed scenarios can be trained in a relatively short period of time, which illustrates its adaptability.
引用
收藏
页码:5823 / 5834
页数:12
相关论文
共 50 条
  • [1] Learning Reference Governor for Constrained Spacecraft Rendezvous and Proximity Maneuvering
    Ikeya, Kosuke
    Liu, Kaiwen
    Girard, Anouck
    Kolmanovsky, Ilya
    [J]. JOURNAL OF SPACECRAFT AND ROCKETS, 2023, 60 (04) : 1127 - 1141
  • [2] A Novel Reinforcement Learning Collision Avoidance Algorithm for USVs Based on Maneuvering Characteristics and COLREGs
    Fan, Yunsheng
    Sun, Zhe
    Wang, Guofeng
    [J]. SENSORS, 2022, 22 (06)
  • [3] Model Predictive Control approach for guidance of spacecraft rendezvous and proximity maneuvering
    Di Cairano, S.
    Park, H.
    Kolmanovsky, I.
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2012, 22 (12) : 1398 - 1427
  • [4] Model predictive control for close-proximity maneuvering of spacecraft with adaptive convexification of collision avoidance constraints
    Wang, Xun
    Li, Yanyan
    Zhang, Xueyang
    Zhang, Rui
    Yang, Daoning
    [J]. ADVANCES IN SPACE RESEARCH, 2023, 71 (01) : 477 - 491
  • [5] Autonomous spacecraft collision avoidance with a variable number of space debris based on safe reinforcement learning
    Mu, Chaoxu
    Liu, Shuo
    Lu, Ming
    Liu, Zhaoyang
    Cui, Lei
    Wang, Ke
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 149
  • [6] Sliding mode control for autonomous spacecraft rendezvous with collision avoidance
    Li, Qi
    Yuan, Jianping
    Wang, Huan
    [J]. ACTA ASTRONAUTICA, 2018, 151 : 743 - 751
  • [7] Optimal Sliding Mode Control for Spacecraft Rendezvous with Collision Avoidance
    Feng, Licheng
    Ni, Qing
    Bai, Yuzhu
    Chen, Xiaoqian
    Zhao, Yong
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 2661 - 2668
  • [8] Optimal control of spacecraft for close proximity with collision avoidance
    Ni, Qing
    Feng, Licheng
    Huang, Yiyong
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 1019 - 1023
  • [9] A Collision Avoidance Method Based on Deep Reinforcement Learning
    Feng, Shumin
    Sebastian, Bijo
    Ben-Tzvi, Pinhas
    [J]. ROBOTICS, 2021, 10 (02)
  • [10] Distributed cooperative control with collision avoidance for spacecraft swarm reconfiguration via reinforcement learning
    Sun, Jun
    Meng, Yizhen
    Huang, Jing
    Liu, Fucheng
    Li, Shuang
    [J]. ACTA ASTRONAUTICA, 2023, 205 : 95 - 109