Fractional-Order Systems Optimal Control via Actor-Critic Reinforcement Learning and Its Validation for Chaotic MFET

被引：9

作者：

Li, Dongdong ^{[1
,2
]}

Dong, Jiuxiang ^{[1
,2
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

[2] Northeastern Univ, Key Lab Vibrat & Control Aeroprop Syst, Minist Educ, Shenyang 110819, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2024年

基金：

中国国家自然科学基金;

关键词：

Fractional-order systems; optimal control; reinforcement learning (RL); neural networks; adaptive dynamic programming; TRACKING CONTROL; SYNCHRONIZATION;

D O I：

10.1109/TASE.2024.3361213

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Since the existence of fractional order dynamics, it is difficult to obtain an optimality equation to solve for fractional-order optimal control. In this paper, a fractional Hamilton-Jacobi-Bellman (HJB) equation based on error derivative is proposed, and a corresponding online learning algorithm is designed. The scheme can handle the optimal tracking problem for the 0 < alpha <= 1 order nonlinear systems. Since the traditional quadratic cost function is unbounded at infinite time and the optimal control derived from the discounted cost function fails to stabilize the system asymptotically, a cost function based on the error derivative is proposed, which can avoid these problems, and the system is not restricted to be zero equilibrium. Then, the fractional HJB equation is derived by constructing an auxiliary signal without directly using the chain rule of differentiation. The optimality, stability and convergence of its solution are proved, and actor-critic neural networks (NNs) are established to perform the RL algorithm. Finally, the algorithm is applied to a chaotic magnetic-field electromechanical transducer (MFET) system to verify the effectiveness and advantages.

引用

页码：1 / 10

页数：10

共 50 条

[1] Optimal fractional-order PID controller based on fractional-order actor-critic algorithm
Shalaby, Raafat
El-Hossainy, Mohammad
Abo-Zalam, Belal
Mahmoud, Tarek A.
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03): : 2347 - 2380
[2] Optimal fractional-order PID controller based on fractional-order actor-critic algorithm
Raafat Shalaby
Mohammad El-Hossainy
Belal Abo-Zalam
Tarek A. Mahmoud
Neural Computing and Applications, 2023, 35 : 2347 - 2380
[3] Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning
Shi, Daming
Guo, Xudong
Liu, Yi
Fan, Wenhui
ENTROPY, 2022, 24 (06)
[4] Adaptive Neural Optimal Backstepping Control of Uncertain Fractional-Order Chaotic Circuit Systems via Reinforcement Learning
Zhong, Mei
Huang, Chengdai
Cao, Jinde
Liu, Heng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (10) : 4707 - 4720
[5] Actor-Critic Reinforcement Learning for Tracking Control in Robotics
Pane, Yudha P.
Nageshrao, Subramanya P.
Babuska, Robert
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 5819 - 5826
[6] Actor-Critic Reinforcement Learning for Control With Stability Guarantee
Han, Minghao
Zhang, Lixian
Wang, Jun
Pan, Wei
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6217 - 6224
[7] Adaptive Optimal Surrounding Control of Multiple Unmanned Surface Vessels via Actor-Critic Reinforcement Learning
Lu, Renzhi
Wang, Xiaotao
Ding, Yiyu
Zhang, Hai-Tao
Zhao, Feng
Zhu, Lijun
He, Yong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[8] Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning
Wei, Qinglai
Wang, Lingxiao
Liu, Yu
Polycarpou, Marios M.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5245 - 5256
[9] Actor-critic reinforcement learning for the feedback control of a swinging chain
Dengler, C.
Lohmann, B.
IFAC PAPERSONLINE, 2018, 51 (13): : 378 - 383
[10] Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Zhu, Hanlin
Rashidinejad, Paria
Jiao, Jiantao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →