Data-driven identifier–actor–critic learning for cooperative spacecraft attitude tracking with orientation constraints

被引:0
|
作者
Xia, Kewei [1 ]
Wang, Jianan [2 ]
Zou, Yao [3 ]
Gao, Hongbo [4 ]
Ding, Zhengtao [5 ]
机构
[1] Advanced Research Institute of Multidisciplinary Sciences, Beijing Institute of Technology, Beijing,100081, China
[2] School of Aerospace Engineering, Beijing Institute of Technology, Beijing,100081, China
[3] School of Intelligence Science and Technology, University of Science and Technology Beijing, Beijing,100083, China
[4] Department of Automation, University of Science and Technology of China, Hefei,230026, China
[5] Department of Electrical and Electronic Engineering, University of Manchester, Manchester,M13 9PL, United Kingdom
基金
中国国家自然科学基金;
关键词
Adversarial machine learning - Federated learning - Quadratic programming - Reinforcement learning - Robust control;
D O I
10.1016/j.automatica.2024.112035
中图分类号
学科分类号
摘要
This paper investigates the cooperative attitude tracking issue of a cluster of spacecraft subject to orientation constraints. In particular, all the involved spacecraft cooperatively adjust their attitudes to track a time-varying reference via local information exchange while constraining them inside a mandatory orientation zone as well as outside forbidden orientation zones. A dynamic identifier is first exploited to compensate for the dynamics uncertainty. Next, by integrating the sliding mode with the dynamic identifier, a distributed actor–critic reinforcement learning (RL) control algorithm is designed. Moreover, a data-driven online learning algorithm is proposed for the update of the learning weights, which effectively relieves the typical persistent excitation (PE) to the finite excitation (FE). To overcome the orientation constraint dilemmas, a robust control barrier function (CBF) based quadratic programming optimization is designed. It is shown that the attitude tracking errors are ultimately driven to a small tunable neighborhood of origin without violating the underlying orientation constraints. Finally, simulation results validate and highlight the proposed theoretical results. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [1] Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (09):
  • [2] Synthesis of Data-Driven LightGBM Controller for Spacecraft Attitude Control
    Mahayana, Dimitri
    [J]. IEEE ACCESS, 2023, 11 : 70238 - 70247
  • [3] DATA-DRIVEN IDENTIFICATION AND CONTROL OF FLEXIBLE SPACECRAFT ATTITUDE DYNAMICS
    Chen H.
    Chen T.
    [J]. Lixue Xuebao/Chinese Journal of Theoretical and Applied Mechanics, 2024, 56 (02): : 433 - 445
  • [4] Data-Driven Learning and Control with Multiple Critic Networks
    He, Haibo
    Ni, Zhen
    Zhao, Dongbin
    [J]. PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 523 - 527
  • [5] Data-driven iterative learning cooperative trajectory tracking control for multiple autonomous underwater vehicles with input saturation constraints
    Wu, Chengxi
    Karimi, Hamid Reza
    Shan, Liang
    Dai, Yuewei
    [J]. JOURNAL OF FIELD ROBOTICS, 2024, 41 (07) : 2475 - 2487
  • [6] Cooperative Differential Game-based Optimal Tracking Control of Modular Reconfigurable Robot with Actor-Critic-Identifier Structure
    Liang, Jingkai
    An, Tianjiao
    Dong, Bo
    Ma, Bing
    Gao, Yuhang
    Jiang, Hucheng
    [J]. Proceedings - 2023 China Automation Congress, CAC 2023, 2023, : 4267 - 4272
  • [7] Optimized Data-Driven Prescribed Performance Attitude Control for Actuator Saturated Spacecraft
    Yang, Haoyang
    Hu, Qinglei
    Dong, Hongyang
    Zhao, Xiaowei
    Li, Dongyu
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 28 (03) : 1616 - 1626
  • [8] A Data-Driven Approach to Spacecraft Attitude Control Using Support Vector Regression (SVR)
    Mahayana, Dimitri
    [J]. IEEE Access, 2024, 12 : 177896 - 177910
  • [9] A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming
    Lyu, Daoming
    Yang, Fangkai
    Liu, Bo
    Gustafson, Steven
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2019, (306): : 182 - 195
  • [10] Reinforcement Learning-Based Predefined-Time Tracking Control for Nonlinear Systems Under Identifier-Critic-Actor Structure
    Wang, Jing
    Zhao, Wei
    Cao, Jinde
    Park, Ju H.
    Shen, Hao
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, : 6345 - 6357