Deterministic Policy Gradient With Integral Compensator for Robust Quadrotor Control

被引：109

作者：

Wang, Yuanda ^{[1
,2
]}

Sun, Jia ^{[3
]}

He, Haibo ^{[4
]}

Sun, Changyin ^{[1
,2
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China

[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China

[3] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 10083, Peoples R China

[4] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2020年 / 50卷 / 10期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Reinforcement learning; Rotors; Helicopters; Neural networks; Aerodynamics; Heuristic algorithms; Robustness; Deterministic policy gradient (DPG); neural network; quadrotor; reinforcement learning; REINFORCEMENT; ATTITUDE;

D O I：

10.1109/TSMC.2018.2884725

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a deep reinforcement learning-based robust control strategy for quadrotor helicopters is proposed. The quadrotor is controlled by a learned neural network which directly maps the system states to control commands in an end-to-end style. The learning algorithm is developed based on the deterministic policy gradient algorithm. By introducing an integral compensator to the actor-critic structure, the tracking accuracy and robustness have been greatly enhanced. Moreover, a two-phase learning protocol which includes both offline and online learning phase is proposed for practical implementation. An offline policy is first learned based on a simplified quadrotor model. Then, the policy is online optimized in actual flight. The proposed approach is evaluated in the flight simulator. The results demonstrate that the offline learned policy is highly robust to model errors and external disturbances. It also shows that the online learning could significantly improve the control performance.

引用

页码：3713 / 3725

页数：13

共 50 条

[1] DEEP DETERMINISTIC POLICY GRADIENT WITH GENERALIZED INTEGRAL COMPENSATOR FOR HEIGHT CONTROL OF QUADROTOR
Liu, Anlin
Liu, Lei
Cao, Jinde
Alsaadi, Fawaz E.
JOURNAL OF APPLIED ANALYSIS AND COMPUTATION, 2022, 12 (03): : 868 - 894
[2] Proximal policy optimization with an integral compensator for quadrotor control
Hu, Huan
Wang, Qing-ling
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (05) : 777 - 795
[3] Proximal policy optimization with an integral compensator for quadrotor control
Huan Hu
Qing-ling Wang
Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 777 - 795
[4] Robust Control Strategy for Quadrotor Drone Using Reference Model-Based Deep Deterministic Policy Gradient
Liu, Hongxun
Suzuki, Satoshi
Wang, Wei
Liu, Hao
Wang, Qi
DRONES, 2022, 6 (09)
[5] Adaptive Proportional Integral Robust Control of an Uncertain Robotic Manipulator Based on Deep Deterministic Policy Gradient
Lu, Puwei
Huang, Wenkai
Xiao, Junlong
Zhou, Fobao
Hu, Wei
MATHEMATICS, 2021, 9 (17)
[6] Cooperative control of velocity and heading for unmanned surface vessel based on twin delayed deep deterministic policy gradient with an integral compensator
Wang, Yibai
Zhao, Shulong
Wang, Qingling
OCEAN ENGINEERING, 2023, 288
[7] Deep Deterministic Policy Gradient (DDPG) Agent-Based Sliding Mode Control for Quadrotor Attitudes
Hu, Wenjun
Yang, Yueneng
Liu, Zhiyang
DRONES, 2024, 8 (03)
[8] BYZANTINE-ROBUST FEDERATED DEEP DETERMINISTIC POLICY GRADIENT
Lin, Qifeng
Ling, Qing
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4013 - 4017
[9] Bias Correction in Deterministic Policy Gradient Using Robust MPC
Kordabad, Arash Bahari
Esfahani, Hossein Nejatbakhsh
Gros, Sebastien
2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 1086 - 1091
[10] Low-Level Control of a Quadrotor using Twin Delayed Deep Deterministic Policy Gradient (TD3)
Shehab, Mazen
Zaghloul, Ahmed
El-Badawy, Ayman
2021 18TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2021), 2021,

← 1 2 3 4 5 →