Data-driven policy iteration algorithm for optimal control of continuous-time Ito stochastic systems with Markovian jumps

被引:32
|
作者
Song, Jun [1 ]
He, Shuping [2 ]
Liu, Fei [3 ]
Niu, Yugang [1 ]
Ding, Zhengtao [4 ]
机构
[1] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
[2] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
[3] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[4] Univ Manchester, Sch Elect & Elect Engn, Control Syst Ctr, Sackville St Bldg, Manchester M13 9PL, Lancs, England
来源
IET CONTROL THEORY AND APPLICATIONS | 2016年 / 10卷 / 12期
关键词
stochastic systems; continuous time systems; iterative methods; Markov processes; convergence of numerical methods; Riccati equations; transforms; optimal control; ST-based data-driven policy iteration algorithm; infinite horizon optimal control problem; continuous-time Ito stochastic systems; Markovian jumps; multiplicative noises; stochastic coupled algebraic Riccatic equation; stochastic CARE; offline iteration algorithm; implicit iterative algorithm; subsystems transformation technique; parallel Kleinman iterative equations; SLIDING MODE CONTROL; OPTIMAL TRACKING CONTROL; ADAPTIVE OPTIMAL-CONTROL; H-INFINITY CONTROL; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NEURAL-NETWORKS; TRANSFORMATION; STABILITY;
D O I
10.1049/iet-cta.2015.0973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This studies the infinite horizon optimal control problem for a class of continuous-time systems subjected to multiplicative noises and Markovian jumps by using a data-driven policy iteration algorithm. The optimal control problem is equivalent to solve a stochastic coupled algebraic Riccatic equation (CARE). An off-line iteration algorithm is first established to converge the solutions of the stochastic CARE, which is generalised from an implicit iterative algorithm. By applying subsystems transformation (ST) technique, the off-line iterative algorithm is decoupled into N parallel Kleinman's iterative equations. To learn the solution of the stochastic CARE from N decomposed linear subsystems data, an ST-based data-driven policy iteration algorithm is proposed and the convergence is proved. Finally, a numerical example is given to illustrate the effectiveness and applicability of the proposed two iterative algorithms.
引用
收藏
页码:1431 / 1439
页数:9
相关论文
共 50 条
  • [31] Policy Iteration Algorithm for Optimal Control of Stochastic Logical Dynamical Systems
    Wu, Yuhu
    Shen, Tielong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (05) : 2031 - 2036
  • [32] Control of continuous-time systems with discrete jumps
    Shi, P
    JOURNAL OF THE AUSTRALIAN MATHEMATICAL SOCIETY SERIES B-APPLIED MATHEMATICS, 1999, 41 : 58 - 82
  • [33] Optimal control of stochastic singular affine systems with Markovian jumps
    Wang, Xin
    Wang, Lisha
    Liu, Yuxiang
    JOURNAL OF INEQUALITIES AND APPLICATIONS, 2022, 2022 (01)
  • [34] Optimal control of stochastic singular affine systems with Markovian jumps
    Xin Wang
    Lisha Wang
    Yuxiang Liu
    Journal of Inequalities and Applications, 2022
  • [35] Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design
    Luo, Biao
    Wu, Huai-Ning
    Huang, Tingwen
    Liu, Derong
    AUTOMATICA, 2014, 50 (12) : 3281 - 3290
  • [36] A Time-Delay Modeling Approach for Data-Driven Predictive Control of Continuous-Time Systems
    Liu, Juan
    Yang, Xindi
    Zhang, Hao
    Wang, Zhuping
    Yan, Huaicheng
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 6408 - 6420
  • [37] Bias-policy iteration based optimal control for unknown continuous-time linear periodic systems
    Li, Xiang
    Jiang, Huaiyuan
    Zhou, Bin
    SYSTEMS & CONTROL LETTERS, 2024, 189
  • [38] Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming
    Wei, Qinglai
    Zhou, Tianmin
    Lu, Jingwei
    Liu, Yu
    Su, Shuai
    Xiao, Jun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (10): : 6375 - 6387
  • [39] Adaptive optimal tracking control for nonlinear continuous-time systems with time delay using value iteration algorithm
    Shi, Jing
    Yue, Dong
    Xie, Xiangpeng
    NEUROCOMPUTING, 2020, 396 : 172 - 178
  • [40] Localized data-driven consensus control for continuous-time multi-agent systems
    Chang, Zeze
    Li, Zhongkui
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024,