Reinforcement learning-based adaptive optimal tracking algorithm for Markov jump systems with partial unknown dynamics

被引:7
|
作者
Tu, Yidong [1 ]
Fang, Haiyang [2 ]
Wang, Hai [3 ]
Shi, Kaibo [4 ]
He, Shuping [1 ]
机构
[1] Anhui Univ, Sch Elect Engn & Automat, Human Robot Integrat Syst & Intelligent Equipment, Hefei 230601, Peoples R China
[2] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Sha Tin, Hong Kong, Peoples R China
[3] Murdoch Univ, Discipline Engn & Energy, Murdoch, WA, Australia
[4] Chengdu Univ, Sch Elect Informat & Elect Engn, Chengdu, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
adaptive optimal tracking; algebraic Riccati equation; Markov jump systems; policy iteration; reinforcement learning; NONLINEAR-SYSTEMS; LINEAR-SYSTEMS;
D O I
10.1002/oca.2903
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a novel method is proposed to solve the adaptive optimal tracking algorithm for a class of Markov jump systems. First, the augmented system with the tracking signal is built under the decoupling Markov jump systems and it is proved that the selected performance index satisfies the algebraic Riccati equation which can be solved by policy iteration schemes. Then, a reinforcement learning (RL) algorithm is used to solve the coupled algebraic Riccati equations by using partial knowledge of system dynamics. The convergence of the partial model-free integral RL iteration algorithm is also proved. Finally, a simulation example is given to show the better tracking effectiveness and accuracy of the online iteration algorithm comparing with the offline one.
引用
收藏
页码:1435 / 1449
页数:15
相关论文
共 50 条
  • [31] Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints
    Zhao, Bo
    Liu, Derong
    Luo, Chaomin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 4330 - 4340
  • [32] A reinforcement learning-based scheme for direct adaptive optimal control of linear stochastic systems
    Wong, Wee Chin
    Lee, Jay H.
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2010, 31 (04): : 365 - 374
  • [33] Optimal Lateral Path-Tracking Control of Vehicles With Partial Unknown Dynamics via DPG-Based Reinforcement Learning Methods
    Shi, Xiongtao
    Li, Yanjie
    Hu, Wenxiao
    Du, Chenglong
    Chen, Chaoyang
    Gui, Weihua
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1701 - 1710
  • [34] Constrained Reinforcement Learning-Based Closed-Loop Reference Model for Optimal Tracking Control of Unknown Continuous-Time Systems
    Zhang, Haoran
    Zhao, Chunhui
    Ding, Jinliang
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 21 (04) : 1 - 13
  • [35] Adaptive filtering for jump Markov systems with unknown noise covariance
    Li, Wenling
    Jia, Yingmin
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2013, 7 (13): : 1765 - 1772
  • [36] Nonfragile Output Feedback Tracking Control for Markov Jump Fuzzy Systems Based on Integral Reinforcement Learning Scheme
    Wang, Jing
    Wu, Jiacheng
    Cao, Jinde
    Chadli, Mohammed
    Shen, Hao
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (07) : 4521 - 4530
  • [37] Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems
    Gao, Weinan
    Jiang, Zhong-Ping
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2614 - 2624
  • [38] A Reinforcement Learning-Based Approach for Optimal Output Tracking in Uncertain Nonlinear Systems with Mismatched Disturbances
    Tang, Zezhi
    Rossiter, J. Anthony
    Panoutsos, George
    [J]. 2024 UKACC 14TH INTERNATIONAL CONFERENCE ON CONTROL, CONTROL, 2024, : 169 - 174
  • [39] Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
    Kiumarsi, Bahare
    Lewis, Frank L.
    Modares, Hamidreza
    Karimpour, Ali
    Naghibi-Sistani, Mohammad-Bagher
    [J]. AUTOMATICA, 2014, 50 (04) : 1167 - 1175
  • [40] Adaptive Fault-Tolerant Tracking Control for Affine Nonlinear Systems With Unknown Dynamics via Reinforcement Learning
    Roshanravan, Sajad
    Shamaghdari, Saeed
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 569 - 580