Deterministic Policy Gradient With Integral Compensator for Robust Quadrotor Control

被引:109
|
作者
Wang, Yuanda [1 ,2 ]
Sun, Jia [3 ]
He, Haibo [4 ]
Sun, Changyin [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China
[3] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 10083, Peoples R China
[4] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Reinforcement learning; Rotors; Helicopters; Neural networks; Aerodynamics; Heuristic algorithms; Robustness; Deterministic policy gradient (DPG); neural network; quadrotor; reinforcement learning; REINFORCEMENT; ATTITUDE;
D O I
10.1109/TSMC.2018.2884725
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a deep reinforcement learning-based robust control strategy for quadrotor helicopters is proposed. The quadrotor is controlled by a learned neural network which directly maps the system states to control commands in an end-to-end style. The learning algorithm is developed based on the deterministic policy gradient algorithm. By introducing an integral compensator to the actor-critic structure, the tracking accuracy and robustness have been greatly enhanced. Moreover, a two-phase learning protocol which includes both offline and online learning phase is proposed for practical implementation. An offline policy is first learned based on a simplified quadrotor model. Then, the policy is online optimized in actual flight. The proposed approach is evaluated in the flight simulator. The results demonstrate that the offline learned policy is highly robust to model errors and external disturbances. It also shows that the online learning could significantly improve the control performance.
引用
收藏
页码:3713 / 3725
页数:13
相关论文
共 50 条
  • [1] DEEP DETERMINISTIC POLICY GRADIENT WITH GENERALIZED INTEGRAL COMPENSATOR FOR HEIGHT CONTROL OF QUADROTOR
    Liu, Anlin
    Liu, Lei
    Cao, Jinde
    Alsaadi, Fawaz E.
    JOURNAL OF APPLIED ANALYSIS AND COMPUTATION, 2022, 12 (03): : 868 - 894
  • [2] Proximal policy optimization with an integral compensator for quadrotor control
    Hu, Huan
    Wang, Qing-ling
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (05) : 777 - 795
  • [3] Proximal policy optimization with an integral compensator for quadrotor control
    Huan Hu
    Qing-ling Wang
    Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 777 - 795
  • [4] Robust Control Strategy for Quadrotor Drone Using Reference Model-Based Deep Deterministic Policy Gradient
    Liu, Hongxun
    Suzuki, Satoshi
    Wang, Wei
    Liu, Hao
    Wang, Qi
    DRONES, 2022, 6 (09)
  • [5] Adaptive Proportional Integral Robust Control of an Uncertain Robotic Manipulator Based on Deep Deterministic Policy Gradient
    Lu, Puwei
    Huang, Wenkai
    Xiao, Junlong
    Zhou, Fobao
    Hu, Wei
    MATHEMATICS, 2021, 9 (17)
  • [6] Cooperative control of velocity and heading for unmanned surface vessel based on twin delayed deep deterministic policy gradient with an integral compensator
    Wang, Yibai
    Zhao, Shulong
    Wang, Qingling
    OCEAN ENGINEERING, 2023, 288
  • [7] Deep Deterministic Policy Gradient (DDPG) Agent-Based Sliding Mode Control for Quadrotor Attitudes
    Hu, Wenjun
    Yang, Yueneng
    Liu, Zhiyang
    DRONES, 2024, 8 (03)
  • [8] BYZANTINE-ROBUST FEDERATED DEEP DETERMINISTIC POLICY GRADIENT
    Lin, Qifeng
    Ling, Qing
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4013 - 4017
  • [9] Bias Correction in Deterministic Policy Gradient Using Robust MPC
    Kordabad, Arash Bahari
    Esfahani, Hossein Nejatbakhsh
    Gros, Sebastien
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 1086 - 1091
  • [10] Low-Level Control of a Quadrotor using Twin Delayed Deep Deterministic Policy Gradient (TD3)
    Shehab, Mazen
    Zaghloul, Ahmed
    El-Badawy, Ayman
    2021 18TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2021), 2021,