Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method

被引:31
|
作者
Jiang, He [1 ]
Zhang, Huaguang [1 ]
Luo, Yanhong [1 ]
Wang, Junyi [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal tracking control; Markov jump systems; Data-based; Reinforcement learning; Adaptive dynamic programming; Neural networks; SYNCHRONIZATION CONTROL; GRAPHICAL GAMES; CONTROL SCHEME; STABILITY; ALGORITHM;
D O I
10.1016/j.neucom.2016.02.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we develop a novel optimal tracking control scheme for a class of nonlinear discrete-time Markov jump systems (MJSs) by utilizing a data-based reinforcement learning method. It is not practical to obtain accurate system models of the real-world MJSs due to the existence of abrupt variations in their system structures. Consequently, most traditional model-based methods for MJSs are invalid for the practical engineering applications. In order to overcome the difficulties without any identification scheme which would cause estimation errors, a model-free adaptive dynamic programming (ADP) algorithm will be designed by using system data rather than accurate system functions. Firstly, we combine the tracking error dynamics and reference system dynamics to form an augmented system. Then, based on the augmented system, a new performance index function with discount factor is formulated for the optimal tracking control problem via Markov chain and weighted sum technique. Neural networks are employed to implement the on-line ADP learning algorithm. Finally, a simulation example is given to demonstrate the effectiveness of our proposed approach. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:176 / 182
页数:7
相关论文
共 50 条
  • [31] Generalized Policy Iteration-based Reinforcement Learning Algorithm for Optimal Control of Unknown Discrete-time Systems
    Lin, Mingduo
    Zhao, Bo
    Liu, Derong
    Liu, Xi
    Luo, Fangchao
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3650 - 3655
  • [32] Adaptive optimal output feedback tracking control for unknown discrete-time linear systems using a combined reinforcement Q-learning and internal model method
    Sun, Weijie
    Zhao, Guangyue
    Peng, Yunjian
    IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (18): : 3075 - 3086
  • [33] Fuzzy H8 Control of Discrete-Time Nonlinear Markov Jump Systems via a Novel Hybrid Reinforcement Q-Learning Method
    Wang, Jing
    Wu, Jiacheng
    Shen, Hao
    Cao, Jinde
    Rutkowski, Leszek
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (11) : 7380 - 7391
  • [34] Optimal Output Regulation of Linear Discrete-Time Systems With Unknown Dynamics Using Reinforcement Learning
    Jiang, Yi
    Kiumarsi, Bahare
    Fan, Jialu
    Chai, Tianyou
    Li, Jinna
    Lewis, Frank L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3147 - 3156
  • [35] Actor-Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems
    Kiumarsi, Bahare
    Lewis, Frank L.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (01) : 140 - 151
  • [36] Discrete-Time Nonlinear Optimal Control Using Multi-Step Reinforcement Learning
    An, Ningbo
    Wang, Qishao
    Zhao, Xiaochuan
    Wang, Qingyun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) : 2279 - 2283
  • [37] Optimal Tracking Control of Unknown Discrete-Time Linear Systems Using Input-Output Measured Data
    Kiumarsi, Bahare
    Lewis, Frank L.
    Naghibi-Sistani, Mohammad-Bagher
    Karimpour, Ali
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (12) : 2770 - 2779
  • [38] Learning-based robust output tracking control for unknown discrete-time nonlinear systems with dynamic uncertainty
    Liu, Fang
    Peng, Hui
    NEUROCOMPUTING, 2024, 606
  • [39] Non-zero-sum games of discrete-time Markov jump systems with unknown dynamics: An off-policy reinforcement learning method
    Zhang, Xuewen
    Shen, Hao
    Li, Feng
    Wang, Jing
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (02) : 949 - 968
  • [40] Reinforcement Q-Learning Algorithm for H∞ Tracking Control of Unknown Discrete-Time Linear Systems
    Peng, Yunjian
    Chen, Qian
    Sun, Weijie
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4109 - 4122