Optimal Trajectory Output Tracking Control with a Q-learning Algorithm

被引:0
|
作者
Vamvoudakis, Kyriakos G. [1 ]
机构
[1] Univ Calif Santa Barbara, Ctr Control Dynam Syst & Computat CCDC, Santa Barbara, CA 93106 USA
关键词
Q-learning; output trajectory tracking; uncertain systems; TIME LINEAR-SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper a novel Q-learning algorithm is proposed to solve the Linear Quadratic Output Tracking (LQOT) control problem of a linear time invariant system with completely unknown system and reference dynamics. We first define an action-dependent value function for the LQOT problem after we augment the system and the reference states and pick appropriately the user-defined matrices in the performance index of the augmented state. An integral reinforcement learning approach is used to develop a reinforcement learning structure to estimate the parameters of the Q-function online while also guaranteeing closed-loop stability, trajectory tracking and convergence to the optimal tracking solution. A simulation result of an unknown spring-mass-damper linear system is presented to show the efficacy of the proposed approach.
引用
收藏
页码:5752 / 5757
页数:6
相关论文
共 50 条
  • [1] Output Feedback Optimal Tracking Control Using Reinforcement Q-Learning
    Rizvi, Syed Ali Asad
    Lin, Zongli
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 3423 - 3428
  • [2] Nonlinear neuro-optimal tracking control via stable iterative Q-learning algorithm
    Wei, Qinglai
    Song, Ruizhuo
    Sun, Qiuye
    NEUROCOMPUTING, 2015, 168 : 520 - 528
  • [3] Off-policy Q-learning: Optimal tracking control for networked control systems
    Li J.-N.
    Yin Z.-X.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (11): : 2343 - 2349
  • [4] Q-learning algorithm for optimal multilevel thresholding
    Yin, PY
    IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 335 - 340
  • [5] Q-learning Based Adaptive Optimal Control for Linear Quadratic Tracking Problem
    Sharma, Shashi Kant
    Jha, Sumit Kumar
    Dhawan, Amit
    Tiwari, Manish
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (08) : 2718 - 2725
  • [6] Q-learning Based Adaptive Optimal Control for Linear Quadratic Tracking Problem
    Shashi Kant Sharma
    Sumit Kumar Jha
    Amit Dhawan
    Manish Tiwari
    International Journal of Control, Automation and Systems, 2023, 21 : 2718 - 2725
  • [7] Horizontal trajectory control of stratospheric airships in wind field using Q-learning algorithm
    Yang, Xiaowei
    Yang, Xixiang
    Deng, Xiaolong
    AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 106
  • [8] Experience replay-based output feedback Q-learning scheme for optimal output tracking control of discrete-time linear systems
    Rizvi, Syed Ali Asad
    Lin, Zongli
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2019, 33 (12) : 1825 - 1842
  • [9] Discrete-Time Optimal Control Scheme Based on Q-Learning Algorithm
    Wei, Qinglai
    Liu, Derong
    Song, Ruizhuo
    2016 SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2016, : 125 - 130
  • [10] State-flipped control and Q-learning for finite horizon output tracking of Boolean control networks
    Sun, Changle
    Li, Haitao
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2023, 54 (12) : 2452 - 2464