Reinforcement Learning-Based Approximate Optimal Control for Attitude Reorientation Under State Constraints

被引：47

作者：

Dong, Hongyang ^{[1
]}

Zhao, Xiaowei ^{[1
]}

Yang, Haoyang ^{[2
]}

机构：

[1] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England

[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY | 2021年 / 29卷 / 04期

基金：

英国工程与自然科学研究理事会;

关键词：

Attitude control; Payloads; Angular velocity; Optimal control; Artificial neural networks; Cost function; Quaternions; Adaptive dynamic programming (ADP); approximate optimal control; attitude control; reinforcement learning (RL); state constraints; FEEDBACK-CONTROL; SPACECRAFT; TRACKING; STABILIZATION; PARAMETER; SYSTEMS;

D O I：

10.1109/TCST.2020.3007401

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article addresses the attitude reorientation problems of rigid bodies under multiple state constraints. A novel reinforcement learning (RL)-based approximate optimal control method is proposed to make the tradeoff between control cost and performance. The novelty lies in that it guarantees constraint handling abilities on attitude forbidden zones and angular velocity limits. To achieve this, barrier functions are employed to encode the constraint information into the cost function. Then, an RL-based learning strategy is developed to approximate the optimal cost function and control policy. A simplified critic-only neural network (NN) is employed to replace the conventional actor-critic structure once adequate data are collected online. This design guarantees the uniform boundedness of reorientation errors and NN weight estimation errors subject to the satisfaction of a finite excitation condition, which is a relaxation compared with the persistent excitation condition that is typically required for this class of problems. More importantly, all underlying state constraints are strictly obeyed during the online learning process. The effectiveness and advantages of the proposed controller are verified by both numerical simulations and experimental tests based on a comprehensive hardware-in-loop testbed.

引用

页码：1664 / 1673

页数：10

共 50 条

[1] Deep reinforcement learning-based attitude motion control for humanoid robots with stability constraints
Shi, Qun
Ying, Wangda
Lv, Lei
Xie, Jiajun
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2020, 47 (03): : 335 - 347
[2] Safe Reinforcement Learning-Based Robust Approximate Optimal Control for Hypersonic Flight Vehicles
Shi, Lei
Wang, Xuesong
Cheng, Yuhu
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11401 - 11414
[3] Reinforcement learning-based satellite formation attitude control under multi-constraint
Cai, Yingkai
Low, Kay-Soon
Wang, Zhaokui
ADVANCES IN SPACE RESEARCH, 2024, 74 (11) : 5819 - 5836
[4] Reinforcement learning-based attitude control for a barbell electric sail
Ma, Xiaolei
Wen, Hao
ISA TRANSACTIONS, 2024, 147 : 252 - 264
[5] Reinforcement learning-based consensus control for MASs with intermittent constraints
Luo, Ao
Zhou, Qi
Ren, Hongru
Ma, Hui
Lu, Renquan
NEURAL NETWORKS, 2024, 172
[6] Reinforcement Learning-Based Optimal Battery Control Under Cycle-Based Degradation Cost
Kwon, Kyung-bin
Zhu, Hao
IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (06) : 4909 - 4917
[7] Integral reinforcement learning-based optimal tracking control for uncertain nonlinear systems under input constraint and specified performance constraints
Chang, Ru
Liu, Zhi-Meng
Li, Xiao-Bin
Sun, Chang-Yin
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 8802 - 8824
[8] NN-based reinforcement-learning optimal sliding mode control for drag-free and attitude of spacecraft with state constraints
Jiang, Changwu
Liu, Yuan
ADVANCES IN SPACE RESEARCH, 2024, 73 (01) : 971 - 981
[9] Reinforcement learning-based optimal control of uncertain nonlinear systems
Garcia, Miguel
Dong, Wenjie
INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (12) : 2839 - 2850
[10] Spacecraft attitude reorientation control method based on potential function under complex constraints
Hua, Bing
He, Jie
Zhang, Hong
Wu, Yunhua
Chen, Zhiming
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 144

← 1 2 3 4 5 →