Reinforcement learning robust optimal control for spacecraft attitude stabilization

被引:0
|
作者
Xiao B. [1 ]
Zhang H. [1 ]
机构
[1] School of Automation, Northwestern Polytechnical University, Xi'an
关键词
adaptive dynamic programming; attitude control; external distur-bance; reinforcement learning; robustness; spacecraft;
D O I
10.7527/S1000-6893.2023.28890
中图分类号
学科分类号
摘要
The problem of optimal attitude stabilization control of rigid spacecraft despite external disturbances is in-vestigated. An online reinforcement learning-based intelligent and robust control approach is presented via the adap-tive dynamic programming technique. In this approach,a critic-only neural network is developed to learn the optimal control policy of the spacecraft attitude system with external disturbance. A new estimation law is synthesized to esti-mate the weights of that network online. The learned controller can achieve near-optimal control performance. Then, a robust control effort is designed and added into the learned controller to formulate an intelligent and robust controller. It is proven that the closed-loop attitude system obtained from the proposed controller is uniformly ultimately bounded and that the weight estimation error of the Critic NN is convergent by Lyapunov theory. Comparison with the traditional actor-critical neural network-based control schemes shows that with less computation complexity and great robustness to external disturbances,the proposed control approach is less dependent of the persistent excitation condition. Simu-lation results verify the superior control performance of the proposed approach. © 2024 Chinese Society of Astronautics. All rights reserved.
引用
收藏
相关论文
共 36 条
  • [1] HUANG W, RONG W, LIU D H, Et al., Design and realization of recovery system of Changé-5 reentry spacecraft, Space Science & Technology, 1, pp. 133-142, (2021)
  • [2] HUANG X Y, LI M D, WANG X L, Et al., The Tian-wen-1 guidance,navigation,and control for Mars entry, descent,and landing[J], Space Science & Technology, 2021, 4, pp. 1-13, (2021)
  • [3] LI J F, WANG Y B, LIU Z Y, Et al., A new recursive composite adaptive controller for robot manipulators[J], Space Science & Technology, 2021, 1, pp. 77-83
  • [4] MA G F, ZHU Q H, WANG P Y, Et al., Adaptive prescribed performance attitude tracking control for spacecraft via terminal sliding-mode technique, Acta Aeronautica et Astronautica Sinica, 39, 6, (2018)
  • [5] SCHAUB H, AKELLA M R, JUNKINS J L., Adaptive control of nonlinear attitude motions realizing linear closed loop dynamics, Journal of Guidance,Control, and Dynamics, 24, 1, pp. 95-100, (2001)
  • [6] XIAO B, CAO L, RAN D C., Attitude exponential stabilization control of rigid bodies via disturbance observer[J], IEEE Transactions on Systems,Man,and Cybernetics: Systems, 51, 5, pp. 2751-2759, (2021)
  • [7] CRASSIDIS J L, MARKLEY F L., Sliding mode control using modified Rodrigues parameters[J], Journal of Guidance,Control,and Dynamics, 19, 6, pp. 1381-1383, (1996)
  • [8] ZHU Q H, DONG R Q, MA G F., Dynamical sliding mode for flexible spacecraft attitude control, Control Theory & Applications, 35, 10, pp. 1430-1435, (2018)
  • [9] KRSTIC M, TSIOTRAS P., Inverse optimal stabilization of a rigid spacecraft[J], IEEE Transactions on Automatic Control, 44, 5, pp. 1042-1049, (1999)
  • [10] SHARMA R, TEWARI A., Optimal nonlinear tracking of spacecraft attitude maneuvers[J], IEEE Transactions on Control Systems Technology, 12, 5, pp. 677-682, (2004)