IMPACT OF COMPUTATION IN INTEGRAL REINFORCEMENT LEARNING FOR CONTINUOUS-TIME CONTROL

被引：0

作者：

Cao, Wenhan ^{[1
,2
]}

Pan, Wei ^{[1
]}

机构：

[1] Department of Computer Science, University of Manchester, United Kingdom

[2] School of Vehicle and Mobility, Tsinghua University, China

来源：

12th International Conference on Learning Representations, ICLR 2024 | 2024年

关键词：

Compendex;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Reinforcement learning

引用

共 50 条

[41] An optimal iterative learning control for continuous-time systems
Nasiri, Mohammad Reza
IECON 2006 - 32ND ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS, VOLS 1-11, 2006, : 675 - 680
[42] Computation of LQ Control for Continuous-Time Bimodal Switched Linear Systems
Hara, Naoyuki
Konishi, Keiji
2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 4955 - 4960
[43] Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning
Modares, Hamidreza
Lewis, Frank L.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (11) : 3051 - 3056
[44] Continuous-Time Reinforcement Learning Control: A Review of Theoretical Results, Insights on Performance, and Needs for New Designs
Wallace, Brent A.
Si, Jennie
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10199 - 10219
[45] Optimal control for continuous-time Markov jump singularly perturbed systems : A hybrid reinforcement learning scheme
Huang, Yaling
Li, Wenqian
Wang, Yun
Shen, Hao
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (07):
[46] Robust safe reinforcement learning control of unknown continuous-time nonlinear systems with state constraints and disturbances
Zhang, Haoran
Zhao, Chunhui
Ding, Jinliang
JOURNAL OF PROCESS CONTROL, 2023, 128
[47] Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning
Zhao, Jingang
Gan, Minggang
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (13) : 2429 - 2440
[48] Learning continuous-time working memory tasks with on-policy neural reinforcement learning
Zambrano, Davide
Roelfsema, Pieter R.
Bohte, Sander
NEUROCOMPUTING, 2021, 461 : 635 - 656
[49] Strong accessibility and integral manifolds of the continuous-time nonlinear control systems
Wyrwas, Malgorzata
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2019, 469 (02) : 935 - 959
[50] Computation of the continuous-time PAR of an OFDM signal
Yu, H
Wei, G
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: SIGNAL PROCESSING FOR COMMUNICATIONS SPECIAL SESSIONS, 2003, : 529 - 531

← 1 2 3 4 5 →