Reinforcement learning based robust tracking control for unmanned helicopter with state constraints and input saturation

被引：0

作者：

Feng, Yiting ^{[1
]}

Zhou, Ye ^{[1
]}

Ho, Hann Woei ^{[1
]}

机构：

[1] Univ Sains Malaysia, Sch Aerosp Engn, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia

来源：

AEROSPACE SCIENCE AND TECHNOLOGY | 2024年 / 155卷

关键词：

Reinforcement learning; Helicopter tracking control; Adaptive critic designs; Constrained system; Backstepping control; TIME-OPTIMAL-CONTROL; QUADROTOR; MRAC;

D O I：

10.1016/j.ast.2024.109549

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

In this paper, an online adaptive optimal control scheme using reinforcement learning (RL) methodology is developed with applications to helicopters in the presence of input saturation and state constraints. Such a control scheme can overcome the strong nonlinearity and coupling dynamics of helicopters by deploying adaptive critic designs (ACDs). Firstly, the backstepping technique is employed to divide the helicopter system into a kinematic loop and a dynamic loop. In the kinematic loop, a constrained Hamilton-Jacobi-Bellman (HJB) equation containing a barrier function is designed to satisfy state constraints. In the dynamic loop, an input- dependent non-quadratic term is incorporated into the HJB equation to solve the input-constrained optimal control problem. Then, a radial basis function (RBF) neural network (NN) is introduced to establish actor-critic networks for the implementation of adaptive optimal control. The critic network is exploited to optimize the tracking performance, while the approximated optimal control for the nominal error dynamic model is derived from the actor network. Meanwhile, a disturbance observer based on RBF NN is designed to compensate for uncertain system dynamics and external disturbances. Using the concurrent learning technique, a novel online update law of actor-critic networks is designed to relax the persistence of excitation (PE) condition. Moreover, the uniform ultimate boundedness (UUB) of parameter estimation error and the asymptotic convergence of state tracking errors are proven through the Lyapunov-based stability analysis. Finally, simulation results are presented to demonstrate that the proposed control strategy is suitable and effective for the helicopter attitude and altitude tracking control problem.

引用

页数：12

共 50 条

[31] Variable gain gradient descent-based reinforcement learning for robust optimal tracking control of uncertain nonlinear system with input constraints
Mishra, Amardeep
Ghosh, Satadal
NONLINEAR DYNAMICS, 2022, 107 (03) : 2195 - 2214
[32] Multivariable Super Twisting Based Robust Trajectory Tracking Control for Small Unmanned Helicopter
Fang, Xing
Wu, Aiguo
Shang, Yujia
Du, Chunyan
MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[33] A Learning-Based Fault Tolerant Tracking Control of an Unmanned Quadrotor Helicopter
Liu, Zhixiang
Yuan, Chi
Zhang, Youmin
Luo, Jun
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 84 (1-4) : 145 - 162
[34] A Learning-Based Fault Tolerant Tracking Control of an Unmanned Quadrotor Helicopter
Zhixiang Liu
Chi Yuan
Youmin Zhang
Jun Luo
Journal of Intelligent & Robotic Systems, 2016, 84 : 145 - 162
[35] Fuzzy reinforcement learning based control of linear systems with input saturation
Liu, Kainan
Ban, Xiaojun
Xie, Shengkun
ISA TRANSACTIONS, 2025, 158 : 405 - 414
[36] Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints
Kim, Yeonsoo
Kim, Jong Woo
AICHE JOURNAL, 2022, 68 (05)
[37] Adaptive fault-tolerant tracking control of flying-wing unmanned aerial vehicle with system input saturation and state constraints
Li, Zhen
Chen, Xin
Xie, Mingyang
Zhao, Zhenhua
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2022, 44 (04) : 880 - 891
[38] Reinforcement learning based time-varying formation control for quadrotor unmanned aerial vehicles system with input saturation
Chi Ma
Yizhe Cao
Dianbiao Dong
Applied Intelligence, 2023, 53 : 28730 - 28744
[39] Reinforcement learning based time-varying formation control for quadrotor unmanned aerial vehicles system with input saturation
Ma, Chi
Cao, Yizhe
Dong, Dianbiao
APPLIED INTELLIGENCE, 2023, 53 (23) : 28730 - 28744
[40] Robust linear parameter varying attitude control of a quadrotor unmanned aerial vehicle with state constraints and input saturation subject to wind disturbance
Soltanpour, Mohammad Reza
Hasanvand, Farshad
Hooshmand, Reza
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020, 42 (06) : 1083 - 1096

← 1 2 3 4 5 →