Improved Reinforcement Learning Using Stability Augmentation With Application to Quadrotor Attitude Control

被引:5
|
作者
Wu, Hangxing [1 ]
Ye, Hui [1 ]
Xue, Wentao [1 ]
Yang, Xiaofei [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Elect & Informat, Zhenjiang 21210U, Jiangsu, Peoples R China
来源
IEEE ACCESS | 2022年 / 10卷
基金
中国国家自然科学基金;
关键词
Training; Attitude control; Rotors; Torque; Reinforcement learning; Neural networks; Stability criteria; attitude control; proximal policy optimization; quadrotor; dimension-wise clipping; stability augmentation system;
D O I
10.1109/ACCESS.2022.3185424
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) has been successfully applied to motion control, without requiring accurate models and selection of control parameters. In this paper, we propose a novel RL algorithm based on proximal policy optimization algorithm with dimension-wise clipping (PPO-DWC) for attitude control of quadrotor. Firstly, dimension-wise clipping technique is introduced to solve the zero-gradient problem of the PPO algorithm, which can quickly converge while maintaining good sampling efficiency, thus improving the control performance. Moreover, following the idea of stability augmentation system (SAS), a feedback controller is designed and integrated into the environment before training the PPO controller to avoid ineffective exploration and improve the system's convergence. The eventual controller consists of two parts: the first is the result of the actor neural network in the PPO algorithm, and the second is the output of the stability augmentation feedback controller. Both of them directly use an end-to-end style of control commands to map the system state. This control architecture is applied in the attitude control of the quadrotor. The simulation results show that the quadrotor can quickly and accurately track the command and has a small steady-state error after the training by the improved PPO algorithm. Meanwhile, compared with the traditional PID controller and basic PPO algorithm, the proposed PPO-DWC algorithm with stability augmentation framework has better performance in tracking accuracy and robustness.
引用
收藏
页码:67590 / 67604
页数:15
相关论文
共 50 条
  • [21] Altitude and Attitude Stabilization of UAV Quadrotor System using Improved Active Disturbance Rejection Control
    Aws Abdulsalam Najm
    Ibraheem Kasim Ibraheem
    Arabian Journal for Science and Engineering, 2020, 45 : 1985 - 1999
  • [22] Altitude and Attitude Stabilization of UAV Quadrotor System using Improved Active Disturbance Rejection Control
    Najm, Aws Abdulsalam
    Ibraheem, Ibraheem Kasim
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (03) : 1985 - 1999
  • [23] Attitude Stabilization for a Quadrotor Using Adaptive Control Algorithm
    Huang, Tianpeng
    Li, Tieshan
    Chen, C. L. Philip
    Li, Yanan
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (01) : 334 - 347
  • [24] Quadrotor helicopter Attitude Control using cascade PID
    Bo, Gaopeng
    Xin, Liuyong
    Hui, Zhang
    Ling, Wanglin
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 5158 - 5163
  • [25] Application of Reinforcement Learning in Controlling Quadrotor UAV Flight Actions
    Shen, Shang-En
    Huang, Yi-Cheng
    DRONES, 2024, 8 (11)
  • [26] Robust Attitude Control of Quadrotor using Sliding Mode
    Dey, Parthibi
    Kurode, Shailaja R.
    Ramachandran, Raghu
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 268 - 272
  • [27] Neuronal Slidding Mode Output Control Application in a Attitude for Quadrotor
    Rodriguez-Mata, A. E.
    Aurelien Cabarbaye, R.
    Baray-Arana, R.
    Acosta-Cano de los Rios, P.
    Rodriguez-Rangel, H.
    2021 18TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2021), 2021,
  • [28] Controller Design for Quadrotor UAVs using Reinforcement Learning
    Bou-Ammar, Haitham
    Voos, Holger
    Ertel, Wolfgang
    2010 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, 2010, : 2130 - 2135
  • [29] Constrained Reinforcement Learning Using Distributional Representation for Trustworthy Quadrotor UAV Tracking Control
    Wang, Yanran
    Boyle, David
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 5877 - 5894
  • [30] Improving Quadrotor Tracking Performance in Wind Disturbances Using Hybrid Reinforcement Learning Control
    Wang, Mengyun
    Liu, Yunzhuo
    Niu, Yifeng
    Yang, Yuqi
    GUIDANCE NAVIGATION AND CONTROL, 2025,