Fractional-Order Systems Optimal Control via Actor-Critic Reinforcement Learning and Its Validation for Chaotic MFET

被引:9
|
作者
Li, Dongdong [1 ,2 ]
Dong, Jiuxiang [1 ,2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Key Lab Vibrat & Control Aeroprop Syst, Minist Educ, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Fractional-order systems; optimal control; reinforcement learning (RL); neural networks; adaptive dynamic programming; TRACKING CONTROL; SYNCHRONIZATION;
D O I
10.1109/TASE.2024.3361213
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since the existence of fractional order dynamics, it is difficult to obtain an optimality equation to solve for fractional-order optimal control. In this paper, a fractional Hamilton-Jacobi-Bellman (HJB) equation based on error derivative is proposed, and a corresponding online learning algorithm is designed. The scheme can handle the optimal tracking problem for the 0 < alpha <= 1 order nonlinear systems. Since the traditional quadratic cost function is unbounded at infinite time and the optimal control derived from the discounted cost function fails to stabilize the system asymptotically, a cost function based on the error derivative is proposed, which can avoid these problems, and the system is not restricted to be zero equilibrium. Then, the fractional HJB equation is derived by constructing an auxiliary signal without directly using the chain rule of differentiation. The optimality, stability and convergence of its solution are proved, and actor-critic neural networks (NNs) are established to perform the RL algorithm. Finally, the algorithm is applied to a chaotic magnetic-field electromechanical transducer (MFET) system to verify the effectiveness and advantages.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Optimal fractional-order PID controller based on fractional-order actor-critic algorithm
    Shalaby, Raafat
    El-Hossainy, Mohammad
    Abo-Zalam, Belal
    Mahmoud, Tarek A.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03): : 2347 - 2380
  • [2] Optimal fractional-order PID controller based on fractional-order actor-critic algorithm
    Raafat Shalaby
    Mohammad El-Hossainy
    Belal Abo-Zalam
    Tarek A. Mahmoud
    Neural Computing and Applications, 2023, 35 : 2347 - 2380
  • [3] Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning
    Shi, Daming
    Guo, Xudong
    Liu, Yi
    Fan, Wenhui
    ENTROPY, 2022, 24 (06)
  • [4] Adaptive Neural Optimal Backstepping Control of Uncertain Fractional-Order Chaotic Circuit Systems via Reinforcement Learning
    Zhong, Mei
    Huang, Chengdai
    Cao, Jinde
    Liu, Heng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (10) : 4707 - 4720
  • [5] Actor-Critic Reinforcement Learning for Tracking Control in Robotics
    Pane, Yudha P.
    Nageshrao, Subramanya P.
    Babuska, Robert
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 5819 - 5826
  • [6] Actor-Critic Reinforcement Learning for Control With Stability Guarantee
    Han, Minghao
    Zhang, Lixian
    Wang, Jun
    Pan, Wei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6217 - 6224
  • [7] Adaptive Optimal Surrounding Control of Multiple Unmanned Surface Vessels via Actor-Critic Reinforcement Learning
    Lu, Renzhi
    Wang, Xiaotao
    Ding, Yiyu
    Zhang, Hai-Tao
    Zhao, Feng
    Zhu, Lijun
    He, Yong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [8] Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning
    Wei, Qinglai
    Wang, Lingxiao
    Liu, Yu
    Polycarpou, Marios M.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5245 - 5256
  • [9] Actor-critic reinforcement learning for the feedback control of a swinging chain
    Dengler, C.
    Lohmann, B.
    IFAC PAPERSONLINE, 2018, 51 (13): : 378 - 383
  • [10] Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
    Zhu, Hanlin
    Rashidinejad, Paria
    Jiao, Jiantao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,