Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method

被引:31
|
作者
Jiang, He [1 ]
Zhang, Huaguang [1 ]
Luo, Yanhong [1 ]
Wang, Junyi [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal tracking control; Markov jump systems; Data-based; Reinforcement learning; Adaptive dynamic programming; Neural networks; SYNCHRONIZATION CONTROL; GRAPHICAL GAMES; CONTROL SCHEME; STABILITY; ALGORITHM;
D O I
10.1016/j.neucom.2016.02.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we develop a novel optimal tracking control scheme for a class of nonlinear discrete-time Markov jump systems (MJSs) by utilizing a data-based reinforcement learning method. It is not practical to obtain accurate system models of the real-world MJSs due to the existence of abrupt variations in their system structures. Consequently, most traditional model-based methods for MJSs are invalid for the practical engineering applications. In order to overcome the difficulties without any identification scheme which would cause estimation errors, a model-free adaptive dynamic programming (ADP) algorithm will be designed by using system data rather than accurate system functions. Firstly, we combine the tracking error dynamics and reference system dynamics to form an augmented system. Then, based on the augmented system, a new performance index function with discount factor is formulated for the optimal tracking control problem via Markov chain and weighted sum technique. Neural networks are employed to implement the on-line ADP learning algorithm. Finally, a simulation example is given to demonstrate the effectiveness of our proposed approach. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:176 / 182
页数:7
相关论文
共 50 条
  • [21] Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics
    Wang, Chao-Ran
    Wu, Huai-Ning
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1402 - 1407
  • [22] Data-based optimal control design with reinforcement learning for nonlinear PDE systems
    Zheng, Yuqing
    Zhang, Guoshan
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 345 - 350
  • [23] Online optimal and adaptive integral tracking control for varying discrete-time systems using reinforcement learning
    Sanusi, Ibrahim
    Mills, Andrew
    Dodd, Tony
    Konstantopoulos, George
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2020, 34 (08) : 971 - 991
  • [24] Reinforcement Q-learning and Optimal Tracking Control of Unknown Discrete-time Multi-player Systems Based on Game Theory
    Zhao, Jin-Gang
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (05) : 1751 - 1759
  • [25] Learning Optimal Control Policy for Unknown Discrete-Time Systems
    Lai, Jing
    Xiong, Junlin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (11) : 4191 - 4195
  • [26] Reinforcement Learning-Based Robust Tracking Control for Unknown Markov Jump Systems and its Application
    Shen, Hao
    Wu, Jiacheng
    Wang, Yun
    Wang, Jing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1211 - 1215
  • [27] A maximum principle for optimal control of discrete-time stochastic systems with Markov jump
    Lin X.-Y.
    Wang X.-R.
    Zhang W.-H.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (05): : 895 - 904
  • [28] Linear Quadratic Optimal Control for Discrete-time Markov Jump Linear Systems
    Han, Chunyan
    Li, Hongdan
    Wang, Wei
    Zhang, Huanshui
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 769 - 774
  • [29] Reinforcement learning-based adaptive optimal tracking algorithm for Markov jump systems with partial unknown dynamics
    Tu, Yidong
    Fang, Haiyang
    Wang, Hai
    Shi, Kaibo
    He, Shuping
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2022, 43 (05): : 1435 - 1449
  • [30] Data-based L 2 gain optimal control for discrete-time system with unknown dynamics
    Wang, Jiamin
    Liu, Jian
    Zheng, Yuanshi
    Zhang, Dong
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (06): : 4354 - 4377