IMPACT OF COMPUTATION IN INTEGRAL REINFORCEMENT LEARNING FOR CONTINUOUS-TIME CONTROL

被引:0
|
作者
Cao, Wenhan [1 ,2 ]
Pan, Wei [1 ]
机构
[1] Department of Computer Science, University of Manchester, United Kingdom
[2] School of Vehicle and Mobility, Tsinghua University, China
关键词
Compendex;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement learning
引用
收藏
相关论文
共 50 条
  • [41] An optimal iterative learning control for continuous-time systems
    Nasiri, Mohammad Reza
    IECON 2006 - 32ND ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS, VOLS 1-11, 2006, : 675 - 680
  • [42] Computation of LQ Control for Continuous-Time Bimodal Switched Linear Systems
    Hara, Naoyuki
    Konishi, Keiji
    2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 4955 - 4960
  • [43] Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning
    Modares, Hamidreza
    Lewis, Frank L.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (11) : 3051 - 3056
  • [44] Continuous-Time Reinforcement Learning Control: A Review of Theoretical Results, Insights on Performance, and Needs for New Designs
    Wallace, Brent A.
    Si, Jennie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10199 - 10219
  • [45] Optimal control for continuous-time Markov jump singularly perturbed systems : A hybrid reinforcement learning scheme
    Huang, Yaling
    Li, Wenqian
    Wang, Yun
    Shen, Hao
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (07):
  • [46] Robust safe reinforcement learning control of unknown continuous-time nonlinear systems with state constraints and disturbances
    Zhang, Haoran
    Zhao, Chunhui
    Ding, Jinliang
    JOURNAL OF PROCESS CONTROL, 2023, 128
  • [47] Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning
    Zhao, Jingang
    Gan, Minggang
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (13) : 2429 - 2440
  • [48] Learning continuous-time working memory tasks with on-policy neural reinforcement learning
    Zambrano, Davide
    Roelfsema, Pieter R.
    Bohte, Sander
    NEUROCOMPUTING, 2021, 461 : 635 - 656
  • [49] Strong accessibility and integral manifolds of the continuous-time nonlinear control systems
    Wyrwas, Malgorzata
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2019, 469 (02) : 935 - 959
  • [50] Computation of the continuous-time PAR of an OFDM signal
    Yu, H
    Wei, G
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: SIGNAL PROCESSING FOR COMMUNICATIONS SPECIAL SESSIONS, 2003, : 529 - 531