Distributed cooperative control with collision avoidance for spacecraft swarm reconfiguration via reinforcement learning

被引:3
|
作者
Sun, Jun [1 ,2 ]
Meng, Yizhen [1 ,2 ]
Huang, Jing [1 ,2 ]
Liu, Fucheng [1 ,2 ]
Li, Shuang [3 ]
机构
[1] Shanghai Aerosp Control Technol Inst, Shanghai 201109, Peoples R China
[2] Shanghai Key Lab Aerosp Intelligent Control Techno, Shanghai 201109, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Coll Astronaut, Nanjing 21106, Peoples R China
基金
中国国家自然科学基金;
关键词
Spacecraft swarm reconfiguration; Distributed cooperative control; Reinforcement learning; Back -stepping control; Soft and hard constraints; TRACKING CONTROL; SYSTEMS;
D O I
10.1016/j.actaastro.2023.01.017
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This article investigates an adaptive, distributed, and cooperative control strategy for the problem of spacecraft swarm reconfiguration, which involves assembling the spacecraft at a close distance to one another while avoiding collisions and keeping spacecraft far from an obstacle. The key idea is to transform their opposite indices into equivalent ones by using soft and hard constraints. The proposed control strategy is inspired by the actor-critic framework of reinforcement learning (RL) algorithms: The soft constraint is designed by using a critic neural network (NN) for assembling and avoiding obstacles, while collisions among the spacecraft are prevented based on the hard constraint established in an artificial potential field (APF). By drawing support from this idea of equivalent transformation, the adaptive, distributed, and cooperative controller is devised by using an actor NN of the RL algorithm, an APF, and Backstepping control technology. The action NNs are used to estimate the input signals of the desired control and the undesired effects due to disturbance from the APF, and the expected control performance is then obtained by minimizing the output of the critic NN. The computational burden incurred by the NNs is significantly reduced by reducing the number of parameters that need to be learned by NNs. Lyapunov stability theory is used to guarantee that all signals in this closed-loop system are ultimately uniformly bounded to ensure its stability. The results of simulations of a swarm of spacecraft demonstrated the effectiveness of the proposed control strategy.
引用
收藏
页码:95 / 109
页数:15
相关论文
共 50 条
  • [21] Cooperative Collision Avoidance for Multi-Vehicle Systems Using Reinforcement Learning
    Wang, Qichen
    Phillips, Chris
    [J]. 2013 18TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2013, : 98 - 102
  • [22] Research on cooperative collision avoidance problem of multiple UAV based on Reinforcement Learning
    Fang Bin
    Feng XiaoFeng
    Xu Shuo
    [J]. 2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 103 - 109
  • [23] Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning
    Ma, Junchong
    Lu, Huimin
    Xiao, Junhao
    Zeng, Zhiwen
    Zheng, Zhiqiang
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 99 (02) : 371 - 386
  • [24] Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning
    Junchong Ma
    Huimin Lu
    Junhao Xiao
    Zhiwen Zeng
    Zhiqiang Zheng
    [J]. Journal of Intelligent & Robotic Systems, 2020, 99 : 371 - 386
  • [25] Multiple-Spacecraft Reconfiguration Through Collision Avoidance, Bouncing, and Stalemate
    Y. Kim
    M. Mesbahi
    F. Y. Hadaegh
    [J]. Journal of Optimization Theory and Applications, 2004, 122 : 323 - 343
  • [26] Optimal reconfiguration with collision avoidance for a granular spacecraft using laser pressure
    Zhang, Kunpeng
    Zhang, Yao
    [J]. ACTA ASTRONAUTICA, 2019, 160 : 163 - 174
  • [27] Multiple Spacecraft Formation Reconfiguration Planning with Nonconvex Collision Avoidance Constraints
    Zhou, Ding
    Hu, Yuting
    Li, Shunli
    [J]. 2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 643 - 647
  • [28] Multiple-spacecraft reconfiguration through collision avoidance, bouncing, and stalemate
    Kim, Y
    Mesbahi, M
    Hadaegh, FY
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2004, 122 (02) : 323 - 343
  • [29] Multigoal Visual Navigation With Collision Avoidance via Deep Reinforcement Learning
    Xiao, Wendong
    Yuan, Liang
    He, Li
    Ran, Teng
    Zhang, Jianbo
    Cui, Jianping
    [J]. IEEE Transactions on Instrumentation and Measurement, 2022, 71
  • [30] DISTRIBUTED SPACECRAFT PATH PLANNING AND COLLISION AVOIDANCE VIA RECIPROCAL VELOCITY OBSTACLE APPROACH
    Channumsin, Sittiporn
    Radice, Gianmarco
    Ceriotti, Matteo
    [J]. ASTRODYNAMICS 2017, PTS I-IV, 2018, 162 : 2635 - 2649