Improved Reinforcement Learning Using Stability Augmentation With Application to Quadrotor Attitude Control

被引：5

作者：

Wu, Hangxing ^{[1
]}

Ye, Hui ^{[1
]}

Xue, Wentao ^{[1
]}

Yang, Xiaofei ^{[1
]}

机构：

[1] Jiangsu Univ Sci & Technol, Sch Elect & Informat, Zhenjiang 21210U, Jiangsu, Peoples R China

来源：

IEEE ACCESS | 2022年 / 10卷

基金：

中国国家自然科学基金;

关键词：

Training; Attitude control; Rotors; Torque; Reinforcement learning; Neural networks; Stability criteria; attitude control; proximal policy optimization; quadrotor; dimension-wise clipping; stability augmentation system;

D O I：

10.1109/ACCESS.2022.3185424

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) has been successfully applied to motion control, without requiring accurate models and selection of control parameters. In this paper, we propose a novel RL algorithm based on proximal policy optimization algorithm with dimension-wise clipping (PPO-DWC) for attitude control of quadrotor. Firstly, dimension-wise clipping technique is introduced to solve the zero-gradient problem of the PPO algorithm, which can quickly converge while maintaining good sampling efficiency, thus improving the control performance. Moreover, following the idea of stability augmentation system (SAS), a feedback controller is designed and integrated into the environment before training the PPO controller to avoid ineffective exploration and improve the system's convergence. The eventual controller consists of two parts: the first is the result of the actor neural network in the PPO algorithm, and the second is the output of the stability augmentation feedback controller. Both of them directly use an end-to-end style of control commands to map the system state. This control architecture is applied in the attitude control of the quadrotor. The simulation results show that the quadrotor can quickly and accurately track the command and has a small steady-state error after the training by the improved PPO algorithm. Meanwhile, compared with the traditional PID controller and basic PPO algorithm, the proposed PPO-DWC algorithm with stability augmentation framework has better performance in tracking accuracy and robustness.

引用

页码：67590 / 67604

页数：15

共 50 条

[31] Waypoint Navigation of Quadrotor using Deep Reinforcement Learning
Himanshu, K. Harikumar
Pushpangathan, Jinraj, V
IFAC PAPERSONLINE, 2022, 55 (22): : 281 - 286
[32] Attitude Control of a Nanosatellite system using Reinforcement Learning and Neural Networks
Yadava, Deigant
Hosangadi, Raunak
Krishna, Sai
Paliwal, Pranjal
Jain, Avi
2018 IEEE AEROSPACE CONFERENCE, 2018,
[33] ADAPTIVE CONTROL BY REINFORCEMENT LEARNING FOR SPACECRAFT ATTITUDE CONTROL
Ramadan, Mohammad
Younes, Ahmad Bani
SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1805 - 1815
[34] Inclined Quadrotor Landing using Deep Reinforcement Learning
Kooi, Jacob E.
Babuska, Robert
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2361 - 2368
[35] Quadrotor Attitude Control Using Special Orthogonal Matrix
Chen, Tse-yu
Yu, Jen-te
2020 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2020,
[36] Offline Reinforcement Learning for Quadrotor Control: Overcoming the Ground Effect
Sacchetto, Luca
Korte, Mathias
Gronauer, Sven
Diepold, Klaus
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7539 - 7544
[37] Waypoint Tracking Control for a Quadrotor based on PID and Reinforcement Learning
Bao, Xurui
Jing, Zhouhui
CONTROL ENGINEERING AND APPLIED INFORMATICS, 2023, 25 (01): : 90 - 100
[38] Intelligent Control of a Quadrotor with Proximal Policy Optimization Reinforcement Learning
Lopes, Guilherme Cano
Ferreira, Murillo
Simoes, Alexandre da Silva
Colombini, Esther Luna
15TH LATIN AMERICAN ROBOTICS SYMPOSIUM 6TH BRAZILIAN ROBOTICS SYMPOSIUM 9TH WORKSHOP ON ROBOTICS IN EDUCATION (LARS/SBR/WRE 2018), 2018, : 503 - 508
[39] Modular Reinforcement Learning for a Quadrotor UAV With Decoupled Yaw Control
Yu, Beomyeol
Lee, Taeyoung
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 572 - 579
[40] Robust Quadrotor Control through Reinforcement Learning with Disturbance Compensation
Pi, Chen-Huan
Ye, Wei-Yuan
Cheng, Stone
APPLIED SCIENCES-BASEL, 2021, 11 (07):

← 1 2 3 4 5 →