Improved Reinforcement Learning Using Stability Augmentation With Application to Quadrotor Attitude Control

被引:5
|
作者
Wu, Hangxing [1 ]
Ye, Hui [1 ]
Xue, Wentao [1 ]
Yang, Xiaofei [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Elect & Informat, Zhenjiang 21210U, Jiangsu, Peoples R China
来源
IEEE ACCESS | 2022年 / 10卷
基金
中国国家自然科学基金;
关键词
Training; Attitude control; Rotors; Torque; Reinforcement learning; Neural networks; Stability criteria; attitude control; proximal policy optimization; quadrotor; dimension-wise clipping; stability augmentation system;
D O I
10.1109/ACCESS.2022.3185424
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) has been successfully applied to motion control, without requiring accurate models and selection of control parameters. In this paper, we propose a novel RL algorithm based on proximal policy optimization algorithm with dimension-wise clipping (PPO-DWC) for attitude control of quadrotor. Firstly, dimension-wise clipping technique is introduced to solve the zero-gradient problem of the PPO algorithm, which can quickly converge while maintaining good sampling efficiency, thus improving the control performance. Moreover, following the idea of stability augmentation system (SAS), a feedback controller is designed and integrated into the environment before training the PPO controller to avoid ineffective exploration and improve the system's convergence. The eventual controller consists of two parts: the first is the result of the actor neural network in the PPO algorithm, and the second is the output of the stability augmentation feedback controller. Both of them directly use an end-to-end style of control commands to map the system state. This control architecture is applied in the attitude control of the quadrotor. The simulation results show that the quadrotor can quickly and accurately track the command and has a small steady-state error after the training by the improved PPO algorithm. Meanwhile, compared with the traditional PID controller and basic PPO algorithm, the proposed PPO-DWC algorithm with stability augmentation framework has better performance in tracking accuracy and robustness.
引用
收藏
页码:67590 / 67604
页数:15
相关论文
共 50 条
  • [31] Waypoint Navigation of Quadrotor using Deep Reinforcement Learning
    Himanshu, K. Harikumar
    Pushpangathan, Jinraj, V
    IFAC PAPERSONLINE, 2022, 55 (22): : 281 - 286
  • [32] Attitude Control of a Nanosatellite system using Reinforcement Learning and Neural Networks
    Yadava, Deigant
    Hosangadi, Raunak
    Krishna, Sai
    Paliwal, Pranjal
    Jain, Avi
    2018 IEEE AEROSPACE CONFERENCE, 2018,
  • [33] ADAPTIVE CONTROL BY REINFORCEMENT LEARNING FOR SPACECRAFT ATTITUDE CONTROL
    Ramadan, Mohammad
    Younes, Ahmad Bani
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1805 - 1815
  • [34] Inclined Quadrotor Landing using Deep Reinforcement Learning
    Kooi, Jacob E.
    Babuska, Robert
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2361 - 2368
  • [35] Quadrotor Attitude Control Using Special Orthogonal Matrix
    Chen, Tse-yu
    Yu, Jen-te
    2020 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2020,
  • [36] Offline Reinforcement Learning for Quadrotor Control: Overcoming the Ground Effect
    Sacchetto, Luca
    Korte, Mathias
    Gronauer, Sven
    Diepold, Klaus
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7539 - 7544
  • [37] Waypoint Tracking Control for a Quadrotor based on PID and Reinforcement Learning
    Bao, Xurui
    Jing, Zhouhui
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2023, 25 (01): : 90 - 100
  • [38] Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning
    Lopes, Guilherme Cano
    Ferreira, Murillo
    Simoes, Alexandre da Silva
    Colombini, Esther Luna
    15TH LATIN AMERICAN ROBOTICS SYMPOSIUM 6TH BRAZILIAN ROBOTICS SYMPOSIUM 9TH WORKSHOP ON ROBOTICS IN EDUCATION (LARS/SBR/WRE 2018), 2018, : 503 - 508
  • [39] Modular Reinforcement Learning for a Quadrotor UAV With Decoupled Yaw Control
    Yu, Beomyeol
    Lee, Taeyoung
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 572 - 579
  • [40] Robust Quadrotor Control through Reinforcement Learning with Disturbance Compensation
    Pi, Chen-Huan
    Ye, Wei-Yuan
    Cheng, Stone
    APPLIED SCIENCES-BASEL, 2021, 11 (07):