Data-driven policy iteration algorithm for optimal control of continuous-time Ito stochastic systems with Markovian jumps

被引:32
|
作者
Song, Jun [1 ]
He, Shuping [2 ]
Liu, Fei [3 ]
Niu, Yugang [1 ]
Ding, Zhengtao [4 ]
机构
[1] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
[2] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
[3] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[4] Univ Manchester, Sch Elect & Elect Engn, Control Syst Ctr, Sackville St Bldg, Manchester M13 9PL, Lancs, England
来源
IET CONTROL THEORY AND APPLICATIONS | 2016年 / 10卷 / 12期
关键词
stochastic systems; continuous time systems; iterative methods; Markov processes; convergence of numerical methods; Riccati equations; transforms; optimal control; ST-based data-driven policy iteration algorithm; infinite horizon optimal control problem; continuous-time Ito stochastic systems; Markovian jumps; multiplicative noises; stochastic coupled algebraic Riccatic equation; stochastic CARE; offline iteration algorithm; implicit iterative algorithm; subsystems transformation technique; parallel Kleinman iterative equations; SLIDING MODE CONTROL; OPTIMAL TRACKING CONTROL; ADAPTIVE OPTIMAL-CONTROL; H-INFINITY CONTROL; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NEURAL-NETWORKS; TRANSFORMATION; STABILITY;
D O I
10.1049/iet-cta.2015.0973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This studies the infinite horizon optimal control problem for a class of continuous-time systems subjected to multiplicative noises and Markovian jumps by using a data-driven policy iteration algorithm. The optimal control problem is equivalent to solve a stochastic coupled algebraic Riccatic equation (CARE). An off-line iteration algorithm is first established to converge the solutions of the stochastic CARE, which is generalised from an implicit iterative algorithm. By applying subsystems transformation (ST) technique, the off-line iterative algorithm is decoupled into N parallel Kleinman's iterative equations. To learn the solution of the stochastic CARE from N decomposed linear subsystems data, an ST-based data-driven policy iteration algorithm is proposed and the convergence is proved. Finally, a numerical example is given to illustrate the effectiveness and applicability of the proposed two iterative algorithms.
引用
收藏
页码:1431 / 1439
页数:9
相关论文
共 50 条
  • [21] On approximate policy iteration for continuous-time systems
    Wernrud, Andreas
    Rantzer, Anders
    2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 1453 - 1458
  • [22] Optimal Control for Continuous-time Nonlinear Systems based on a Linear-like Policy Iteration
    Tahirovic, Adnan
    Astolfi, Alessandro
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5238 - 5243
  • [23] Generalized Policy Iteration for Continuous-Time Systems
    Vrabie, Draguna
    Lewis, Frank L.
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 2677 - 2684
  • [24] Orthogonal Polynomial Bases for Data-Driven Analysis and Control of Continuous-Time Systems
    Rapisarda, P.
    van Waarde, Henk J.
    Camlibel, M. K.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (07) : 4307 - 4319
  • [25] Data-driven analysis and control of continuous-time systems under aperiodic sampling
    Berberich, Julian
    Wildhagen, Stefan
    Hertneck, Michael
    Allgoewer, Frank
    IFAC PAPERSONLINE, 2021, 54 (07): : 210 - 215
  • [26] Stochastic Optimal Tracking with Preview for Linear Continuous-Time Markovian Jump Systems
    Nakura, Gou
    2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1756 - 1761
  • [27] Policy Iteration for Optimal Switching with Continuous-time Dynamics
    Sardarmehni, Tohid
    Heydari, Ali
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3536 - 3543
  • [28] Policy Iteration Algorithm for Online Design of Robust Control for a Class of Continuous-Time Nonlinear Systems
    Wang, Ding
    Liu, Derong
    Li, Hongliang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (02) : 627 - 632
  • [29] Value Iteration and Adaptive Optimal Control for Linear Continuous-time Systems
    Bian, Tao
    Jiang, Zhong-Ping
    PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 53 - 58
  • [30] Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm
    Peng, Zhinan
    Zhao, Yiyi
    Hu, Jiangping
    Ghosh, Bijoy Kumar
    INFORMATION SCIENCES, 2019, 481 : 189 - 202