Data-driven policy iteration algorithm for optimal control of continuous-time Ito stochastic systems with Markovian jumps

被引:32
|
作者
Song, Jun [1 ]
He, Shuping [2 ]
Liu, Fei [3 ]
Niu, Yugang [1 ]
Ding, Zhengtao [4 ]
机构
[1] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
[2] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
[3] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[4] Univ Manchester, Sch Elect & Elect Engn, Control Syst Ctr, Sackville St Bldg, Manchester M13 9PL, Lancs, England
来源
IET CONTROL THEORY AND APPLICATIONS | 2016年 / 10卷 / 12期
关键词
stochastic systems; continuous time systems; iterative methods; Markov processes; convergence of numerical methods; Riccati equations; transforms; optimal control; ST-based data-driven policy iteration algorithm; infinite horizon optimal control problem; continuous-time Ito stochastic systems; Markovian jumps; multiplicative noises; stochastic coupled algebraic Riccatic equation; stochastic CARE; offline iteration algorithm; implicit iterative algorithm; subsystems transformation technique; parallel Kleinman iterative equations; SLIDING MODE CONTROL; OPTIMAL TRACKING CONTROL; ADAPTIVE OPTIMAL-CONTROL; H-INFINITY CONTROL; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NEURAL-NETWORKS; TRANSFORMATION; STABILITY;
D O I
10.1049/iet-cta.2015.0973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This studies the infinite horizon optimal control problem for a class of continuous-time systems subjected to multiplicative noises and Markovian jumps by using a data-driven policy iteration algorithm. The optimal control problem is equivalent to solve a stochastic coupled algebraic Riccatic equation (CARE). An off-line iteration algorithm is first established to converge the solutions of the stochastic CARE, which is generalised from an implicit iterative algorithm. By applying subsystems transformation (ST) technique, the off-line iterative algorithm is decoupled into N parallel Kleinman's iterative equations. To learn the solution of the stochastic CARE from N decomposed linear subsystems data, an ST-based data-driven policy iteration algorithm is proposed and the convergence is proved. Finally, a numerical example is given to illustrate the effectiveness and applicability of the proposed two iterative algorithms.
引用
收藏
页码:1431 / 1439
页数:9
相关论文
共 50 条
  • [1] Data-driven policy iteration algorithm for continuous-time stochastic linear-quadratic optimal control problems
    Zhang, Heng
    Li, Na
    ASIAN JOURNAL OF CONTROL, 2024, 26 (01) : 481 - 489
  • [2] Stochastic linear quadratic optimal control for continuous-time systems based on policy iteration
    College of Information Science and Engineering,, Northeastern University,, Shenyang
    110004, China
    不详
    110034, China
    Kongzhi yu Juece Control Decis, 9 (1674-1678):
  • [3] Value Iteration and Data-Driven Optimal Output Regulation of Linear Continuous-Time Systems
    Jiang, Yi
    Gao, Weinan
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1509 - 1514
  • [4] Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration
    Vrabie, D.
    Lewis, F. L.
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 73 - 79
  • [5] COMPUTATIONAL ALGORITHM FOR OPTIMAL CONTROL OF CONTINUOUS-TIME STOCHASTIC SYSTEMS
    TACKER, EC
    SANDERS, CW
    LINTON, TD
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1973, AC18 (03) : 310 - 311
  • [6] Neuro-Control for Continuous-Time Stochastic Nonlinear Systems via Online Policy Iteration Algorithm
    Zhou, Tianmin
    Hou, Jiaxu
    Li, Handong
    Di, Zengru
    Zhao, Bo
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 1499 - 1503
  • [7] H2/H∞ Control for Continuous-Time Stochastic Systems with Infinite Markovian Jumps
    Liu, Yueying
    Hou, Ting
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 1493 - 1497
  • [8] Data-Driven Based Optimal Output-Feedback Control of Continuous-Time Systems
    Li, Zican
    Wu, Tao
    Na, Jing
    Zhao, Jun
    Gao, Guanbin
    Herrmann, Guido
    PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC), 2018,
  • [9] Adaptive optimal control for continuous-time linear systems based on policy iteration
    Vrabie, D.
    Pastravanu, O.
    Abu-Khalaf, M.
    Lewis, F. L.
    AUTOMATICA, 2009, 45 (02) : 477 - 484
  • [10] Data-driven Iterative Learning Control for Continuous-Time Systems
    Chu, Bing
    Rapisarda, Paolo
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4626 - 4631