Hybrid Soft Actor-Critic and Incremental Dual Heuristic Programming Reinforcement Learning for Fault-Tolerant Flight Control

被引：0

作者：

Teirlinck, C. ^{[1
]}

van Kampen, Erik-Jan ^{[1
]}

机构：

[1] Delft Univ Technol, Control & Simulat, POB 5058, NL-2600 GB Delft, Netherlands

来源：

AIAA SCITECH 2024 FORUM | 2024年

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recent advancements in fault-tolerant flight control have involved model-free offline and online Reinforcement Learning (RL) algorithms in order to provide robust and adaptive control to autonomous systems. Inspired by recent work on Incremental Dual Heuristic Programming (IDHP) and Soft Actor-Critic (SAC), this research proposes a hybrid SAC-IDHP framework aiming to combine adaptive online learning from IDHP with the high complexity generalization power of SAC in controlling a fully coupled system. The hybrid framework is implemented into the inner loop of a cascaded altitude controller for a high-fidelity, six-degree-of-freedom model of the Cessna Citation II PH-LAB research aircraft. Compared to SAC-only, the SAC-IDHP hybrid demonstrates an improvement in tracking performance of 0.74%, 5.46% and 0.82% in nMAE for nominal case, longitudinal and lateral failure cases respectively. Random online policy initialization is eliminated due to identity initialization of the hybrid policy, resulting in an argument for increased safety. Additionally, robustness to biased sensor noise, initial flight condition and random critic initialization is demonstrated.

引用

页数：22

共 50 条

[11] Actor-Critic Reinforcement Learning for Control With Stability Guarantee
Han, Minghao
Zhang, Lixian
Wang, Jun
Pan, Wei
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6217 - 6224
[12] Stepwise Soft Actor-Critic for UAV Autonomous Flight Control
Hwang, Ha Jun
Jang, Jaeyeon
Choi, Jongkwan
Bae, Jung Ho
Kim, Sung Ho
Kim, Chang Ouk
DRONES, 2023, 7 (09)
[13] Dual Variable Actor-Critic for Adaptive Safe Reinforcement Learning
Lee, Junseo
Heo, Jaeseok
Kim, Dohyeong
Lee, Gunmin
Oh, Songhwai
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7568 - 7573
[14] Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Fan, Zhou
Su, Rui
Zhang, Weinan
Yu, Yong
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2279 - 2285
[15] An extension of Genetic Network Programming with Reinforcement Learning using actor-critic
Hatakeyama, Hiroyuki
Mabu, Shingo
Hirasawa, Kotaro
Hu, Jinglu
2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 1522 - +
[16] A soft actor-critic reinforcement learning algorithm for network intrusion detection
Li, Zhengfa
Huang, Chuanhe
Deng, Shuhua
Qiu, Wanyu
Gao, Xieping
COMPUTERS & SECURITY, 2023, 135
[17] A Novel Actor-Critic Motor Reinforcement Learning for Continuum Soft Robots
Pantoja-Garcia, Luis
Parra-Vega, Vicente
Garcia-Rodriguez, Rodolfo
Vazquez-Garcia, Carlos Ernesto
ROBOTICS, 2023, 12 (05)
[18] Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic
Ren, Yangang
Duan, Jingliang
Li, Shengbo Eben
Guan, Yang
Sun, Qi
2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
[19] Actor-critic reinforcement learning for the feedback control of a swinging chain
Dengler, C.
Lohmann, B.
IFAC PAPERSONLINE, 2018, 51 (13): : 378 - 383
[20] Fault-tolerant tracking control for continuous flight control system based on reinforcement learning algorithm with incremental strategy
Ren J.
Liu J.-W.
Yang P.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2020, 37 (07): : 1429 - 1438

← 1 2 3 4 5 →