Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints

被引:47
|
作者
Dong, Hongyang [1 ]
Zhao, Xiaowei [1 ]
Yang, Haoyang [2 ]
机构
[1] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Attitude control; Payloads; Angular velocity; Optimal control; Artificial neural networks; Cost function; Quaternions; Adaptive dynamic programming (ADP); approximate optimal control; attitude control; reinforcement learning (RL); state constraints; FEEDBACK-CONTROL; SPACECRAFT; TRACKING; STABILIZATION; PARAMETER; SYSTEMS;
D O I
10.1109/TCST.2020.3007401
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article addresses the attitude reorientation problems of rigid bodies under multiple state constraints. A novel reinforcement learning (RL)-based approximate optimal control method is proposed to make the tradeoff between control cost and performance. The novelty lies in that it guarantees constraint handling abilities on attitude forbidden zones and angular velocity limits. To achieve this, barrier functions are employed to encode the constraint information into the cost function. Then, an RL-based learning strategy is developed to approximate the optimal cost function and control policy. A simplified critic-only neural network (NN) is employed to replace the conventional actor-critic structure once adequate data are collected online. This design guarantees the uniform boundedness of reorientation errors and NN weight estimation errors subject to the satisfaction of a finite excitation condition, which is a relaxation compared with the persistent excitation condition that is typically required for this class of problems. More importantly, all underlying state constraints are strictly obeyed during the online learning process. The effectiveness and advantages of the proposed controller are verified by both numerical simulations and experimental tests based on a comprehensive hardware-in-loop testbed.
引用
收藏
页码:1664 / 1673
页数:10
相关论文
共 50 条
  • [41] Optimal control for a class of nonlinear systems with input constraints based on reinforcement learning
    Luo A.
    Xiao W.-B.
    Zhou Q.
    Lu R.-Q.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (01): : 154 - 164
  • [42] Deep Reinforcement Learning-Based Optimal Control of Variable Cycle Engine Performance
    Tao, Bo
    Yang, Li-Ying
    Wu, Dong-Sheng
    Li, Si-Liang
    Huang, Zhao-Xiong
    Sun, Xiao-Shu
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 1002 - 1005
  • [43] Reinforcement Learning-based Distributed Secondary Optimal Control for Multi-Microgrids
    Liu, Wei
    Wen, Zhen
    Shen, Yiping
    Zhang, Zhifang
    2017 IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2017,
  • [44] Reinforcement Learning-Based Adaptive Optimal Control for Nonlinear Systems With Asymmetric Hysteresis
    Zheng, Licheng
    Liu, Zhi
    Wang, Yaonan
    Chen, C. L. Philip
    Zhang, Yun
    Wu, Zongze
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15800 - 15809
  • [45] Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems
    Fan, Zhong-Xin
    Tang, Lintao
    Li, Shihua
    Liu, Rongjie
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (33): : 23987 - 23996
  • [46] A Deep Reinforcement Learning-Based Optimal Transmission Control Method for Streaming Videos
    Yang, Yawen
    Xiao, Yuxuan
    IEEE ACCESS, 2024, 12 : 53088 - 53098
  • [47] Federated deep reinforcement learning-based urban traffic signal optimal control
    Li, Mi
    Pan, Xiaolong
    Liu, Chuhui
    Li, Zirui
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [48] Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints
    Zhao, Bo
    Liu, Derong
    Luo, Chaomin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 4330 - 4340
  • [49] Reinforcement learning robust optimal control for spacecraft attitude stabilization
    Xiao B.
    Zhang H.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (01):
  • [50] Reinforcement Learning based Approximate Optimal Control of Nonlinear Systems using Carleman Linearization
    Kar, Jishnudeep
    Bai, He
    Chakrabortty, Aranya
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3362 - 3367