Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators

被引:154
|
作者
Yang, Qinmin [1 ]
Jagannathan, Sarangapani [2 ]
机构
[1] Zhejiang Univ, Dept Control Sci & Engn, State Key Lab Ind Control Technol, Hangzhou 310027, Zhejiang, Peoples R China
[2] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
关键词
Adaptive critic; dynamic programming (DP); Lyapunov method; neural networks (NNs); online approximators (OLAs); online learning; reinforcement learning;
D O I
10.1109/TSMCB.2011.2166384
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, reinforcement learning state- and output-feedback-based adaptive critic controller designs are proposed by using the online approximators (OLAs) for a general multi-input and multioutput affine unknown nonlinear discrete-time systems in the presence of bounded disturbances. The proposed controller design has two entities, an action network that is designed to produce optimal signal and a critic network that evaluates the performance of the action network. The critic estimates the cost-to-go function which is tuned online using recursive equations derived from heuristic dynamic programming. Here, neural networks (NNs) are used both for the action and critic whereas any OLAs, such as radial basis functions, splines, fuzzy logic, etc., can be utilized. For the output-feedback counterpart, an additional NN is designated as the observer to estimate the unavailable system states, and thus, separation principle is not required. The NN weight tuning laws for the controller schemes are also derived while ensuring uniform ultimate boundedness of the closed-loop system using Lyapunov theory. Finally, the effectiveness of the two controllers is tested in simulation on a pendulum balancing system and a two-link robotic arm system.
引用
收藏
页码:377 / 390
页数:14
相关论文
共 50 条
  • [1] Online reinforcement learning-based neural network controller design for affine nonlinear discrete-time systems
    Yang, Qinmin
    Jagannathan, S.
    2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 3425 - 3430
  • [2] Reinforcement learning-based online adaptive controller design for a class of unknown nonlinear discrete-time systems with time delays
    Liang, Yuling
    Zhang, Huaguang
    Xiao, Geyang
    Jiang, He
    NEURAL COMPUTING & APPLICATIONS, 2018, 30 (06): : 1733 - 1745
  • [3] Reinforcement learning-based online adaptive controller design for a class of unknown nonlinear discrete-time systems with time delays
    Yuling Liang
    Huaguang Zhang
    Geyang Xiao
    He Jiang
    Neural Computing and Applications, 2018, 30 : 1733 - 1745
  • [4] A Nonlinear Internal Model Controller for Affine Nonlinear Discrete-time Systems
    Othman, C.
    Cheikh, I. B.
    Bouzaouache, Hajer
    2016 17TH INTERNATIONAL CONFERENCE ON SCIENCES AND TECHNIQUES OF AUTOMATIC CONTROL AND COMPUTER ENGINEERING (STA'2016), 2016, : 48 - 53
  • [5] Fault-Tolerant Controller Design for a Class of Nonlinear MIMO Discrete-Time Systems via Online Reinforcement Learning Algorithm
    Wang, Zhanshan
    Liu, Lei
    Zhang, Huaguang
    Xiao, Geyang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 46 (05): : 611 - 622
  • [6] Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    Wei, Qinglai
    NEURAL NETWORKS, 2014, 55 : 30 - 41
  • [7] Automated Fault Accommodation for Discrete-time Systems Using Online Approximators
    Yang Qinmin
    Sun Youxian
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4264 - 4269
  • [8] Degradation Tolerant Control Learning for Discrete-Time Affine Nonlinear Systems
    Kanso, Soha
    Jha, Mayank Shekhar
    Theilliol, Didier
    IFAC PAPERSONLINE, 2023, 56 (02): : 7734 - 7739
  • [9] OBSERVER-BASED CONTROLLER DESIGN OF DISCRETE-TIME PIECEWISE AFFINE SYSTEMS
    Gao, Ya-Hui
    Liu, Zhi-Yuan
    Chen, Hong
    ASIAN JOURNAL OF CONTROL, 2010, 12 (04) : 558 - 567
  • [10] Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints
    He, Pingan
    Jagannathan, S.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 425 - 436