Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach

被引:20
|
作者
Bian, Tao [1 ]
Jiang, Zhong-Ping [2 ]
机构
[1] Bank Amer Merrill Lynch, One Bryant Pk, New York, NY 10036 USA
[2] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Control & Networks Lab, 5 Metrotech Ctr, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Adaptive optimal control; robust dynamic programming; value iteration (VI); ADAPTIVE OPTIMAL-CONTROL; STABILIZATION; STATE;
D O I
10.1109/JAS.2019.1911390
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we introduce a novel reinforcement learning (RL) scheme for linear continuous-time dynamical systems. Different from traditional batch learning algorithms, an incremental learning approach is developed, which provides a more efficient way to tackle the on-line learning problem in real-world applications. We provide concrete convergence and robust analysis on this incremental-learning algorithm. An extension to solving robust optimal control problems is also given. Two simulation examples are also given to illustrate the effectiveness of our theoretical result.
引用
收藏
页码:433 / 440
页数:8
相关论文
共 50 条
  • [1] Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach
    Tao Bian
    Zhong-Ping Jiang
    IEEE/CAA Journal of Automatica Sinica, 2019, 6 (02) : 433 - 440
  • [2] Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems
    Pang, Bo
    Jiang, Zhong-Ping
    Mareels, Iven
    AUTOMATICA, 2020, 118
  • [3] Online Reinforcement Learning in Stochastic Continuous-Time Systems
    Faradonbeh, Mohamad Kazem Shirani
    Faradonbeh, Mohamad Sadegh Shirani
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195 : 612 - 656
  • [4] Continuous-time reinforcement learning approach for portfolio management with time penalization
    Garcia-Galicia, Mauricio
    Carsteanu, Alin A.
    Clempner, Julio B.
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 129 : 27 - 36
  • [5] Integral Reinforcement Learning with Explorations for Continuous-Time Nonlinear Systems
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [6] Online Solution to the Linear Quadratic Tracking Problem of Continuous-time Systems using Reinforcement Learning
    Modares, Hamidreza
    Lewis, Frank L.
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3851 - 3856
  • [7] CHOQUET REGULARIZATION FOR CONTINUOUS-TIME REINFORCEMENT LEARNING
    Han, Xia
    Wang, Ruodu
    Zhou, Xun Yu
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2023, 61 (05) : 2777 - 2801
  • [8] Safe Q-learning for continuous-time linear systems
    Bandyopadhyay, Soutrik
    Bhasin, Shubhendu
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 241 - 246
  • [9] Dynamic Multiobjective Control for Continuous-Time Systems Using Reinforcement Learning
    Lopez, Victor G.
    Lewis, Frank L.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (07) : 2869 - 2874
  • [10] Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach
    Bian, Tao
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 2781 - 2790