Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators

被引:154
|
作者
Yang, Qinmin [1 ]
Jagannathan, Sarangapani [2 ]
机构
[1] Zhejiang Univ, Dept Control Sci & Engn, State Key Lab Ind Control Technol, Hangzhou 310027, Zhejiang, Peoples R China
[2] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
关键词
Adaptive critic; dynamic programming (DP); Lyapunov method; neural networks (NNs); online approximators (OLAs); online learning; reinforcement learning;
D O I
10.1109/TSMCB.2011.2166384
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, reinforcement learning state- and output-feedback-based adaptive critic controller designs are proposed by using the online approximators (OLAs) for a general multi-input and multioutput affine unknown nonlinear discrete-time systems in the presence of bounded disturbances. The proposed controller design has two entities, an action network that is designed to produce optimal signal and a critic network that evaluates the performance of the action network. The critic estimates the cost-to-go function which is tuned online using recursive equations derived from heuristic dynamic programming. Here, neural networks (NNs) are used both for the action and critic whereas any OLAs, such as radial basis functions, splines, fuzzy logic, etc., can be utilized. For the output-feedback counterpart, an additional NN is designated as the observer to estimate the unavailable system states, and thus, separation principle is not required. The NN weight tuning laws for the controller schemes are also derived while ensuring uniform ultimate boundedness of the closed-loop system using Lyapunov theory. Finally, the effectiveness of the two controllers is tested in simulation on a pendulum balancing system and a two-link robotic arm system.
引用
收藏
页码:377 / 390
页数:14
相关论文
共 50 条
  • [41] Optimal Control for Constrained Discrete-Time Nonlinear Systems Based on Safe Reinforcement Learning
    Zhang, Lingzhi
    Xie, Lei
    Jiang, Yi
    Li, Zhishan
    Liu, Xueqin
    Su, Hongye
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 12
  • [42] Feedback control design for discrete-time piecewise affine systems
    XU Jun1
    2.School of Electrical and Electronic Engineering
    山东大学学报(工学版), 2007, (03) : 1 - 10
  • [43] Feedback control design for discrete-time piecewise affine systems
    Xu, J
    Xie, LH
    Feng, G
    2005 INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), VOLS 1 AND 2, 2005, : 425 - 430
  • [44] Variable structure controller design for output tracking of a class of discrete-time nonlinear systems
    Liao, TL
    Chien, TI
    JSME INTERNATIONAL JOURNAL SERIES C-MECHANICAL SYSTEMS MACHINE ELEMENTS AND MANUFACTURING, 2002, 45 (02) : 462 - 469
  • [45] Discrete-Time Resilient Controller Design with General Criteria for a Class of Uncertain Nonlinear Systems
    Feng, Fan
    Yaz, Edwin E.
    Schneider, Susan C.
    Yaz, Yvonne I.
    2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 4268 - 4273
  • [46] Design of Robust Adaptive Sliding Mode Controller for Uncertain Nonlinear Discrete-Time Systems
    Zhao, Meng
    Liang, Hongjing
    Yu, Zhandong
    Zhou, Qi
    Wang, Lijie
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 85 - 90
  • [47] PID controller design for nonlinear systems represented by discrete-time local model networks
    Hametner, Christoph
    Mayr, Christian H.
    Kozek, Martin
    Jakubek, Stefan
    INTERNATIONAL JOURNAL OF CONTROL, 2013, 86 (09) : 1453 - 1466
  • [48] Stabilization and tracking controller for a class of nonlinear discrete-time systems
    Sharma, B. B.
    Kar, I. N.
    CHAOS SOLITONS & FRACTALS, 2011, 44 (10) : 902 - 913
  • [49] A robust certainty equivalence controller for discrete-time nonlinear systems
    Kalyan, A
    Gopal, M
    PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY 2000, VOLS 1 AND 2, 2000, : 405 - 409
  • [50] H∞ Static Output-Feedback Control Design for Discrete-Time Systems Using Reinforcement Learning
    Valadbeigi, Amir Parviz
    Sedigh, Ali Khaki
    Lewis, F. L.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 396 - 406