Control Delay in Reinforcement Learning for Real-Time Dynamic Systems: A Memoryless Approach

被引：24

作者：

Schuitema, Erik ^{[1
]}

Busoniu, Lucian

Babuska, Robert ^{[2
]}

Jonker, Pieter ^{[1
]}

机构：

[1] Delft Univ Technol, Delft Biorobot Lab, Mekelweg 2, NL-2628 CD Delft, Netherlands

[2] Delft Univ Technol, Delft Ctr Syst & Control, NL-2628 CD Delft, Netherlands

来源：

IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010) | 2010年

关键词：

D O I：

10.1109/IROS.2010.5650345

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Robots controlled by Reinforcement Learning (RL) are still rare. A core challenge to the application of RL to robotic systems is to learn despite the existence of control delay - the delay between measuring a system's state and acting upon it. Control delay is always present in real systems. In this work, we present two novel temporal difference (TD) learning algorithms for problems with control delay. These algorithms improve learning performance by taking the control delay into account. We test our algorithms in a gridworld, where the delay is an integer multiple of the time step, as well as in the simulation of a robotic system, where the delay can have any value. In both tests, our proposed algorithms outperform classical TD learning algorithms, while maintaining low computational complexity.

引用

页码：3226 / 3231

页数：6

共 50 条

[1] Real-time measurement-driven reinforcement learning control approach for uncertain nonlinear systems
Abouheaf, Mohamed
Boase, Derek
Gueaieb, Wail
Spinello, Davide
Al-Sharhan, Salah
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
[2] Integration of Adaptive Control and Reinforcement Learning for Real-Time Control and Learning
Annaswamy, Anuradha M.
Guha, Anubhav
Cui, Yingnan
Tang, Sunbochen
Fisher, Peter A.
Gaudio, Joseph E.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7740 - 7755
[3] Experience Replay for Real-Time Reinforcement Learning Control
Adam, Sander
Busoniu, Lucian
Babuska, Robert
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (02): : 201 - 212
[4] Real-time Obstacle Avoidance for AUV Based on Reinforcement Learning and Dynamic Window Approach
Shen, Yue
Xu, Han
Wang, Dianrui
Zhang, Yixiao
Yan, Tianhong
He, Bo
[J]. GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
[5] Real-Time Reinforcement Learning
Ramstedt, Simon
Pal, Christopher
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[6] A reinforcement learning-based approach for online optimal control of self-adaptive real-time systems
Bakhta Haouari
Rania Mzid
Olfa Mosbahi
[J]. Neural Computing and Applications, 2023, 35 : 20375 - 20401
[7] A delay-robust method for enhanced real-time reinforcement learning
[J]. Wang, Xueqian (wang.xq@sz.tsinghua.edu.cn), 2025, 181
[8] A reinforcement learning-based approach for online optimal control of self-adaptive real-time systems
Haouari, Bakhta
Mzid, Rania
Mosbahi, Olfa
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 20375 - 20401
[9] Real-time Construction and Control on Dynamic Systems
Liu, Hua
Wang, Zhaoyang
[J]. REAL-TIME PHOTONIC MEASUREMENTS, DATA MANAGEMENT, AND PROCESSING VI, 2021, 11902
[10] Dynamic Resource Allocation for Real-Time Cloud XR Video Transmission: A Reinforcement Learning Approach
Wang, Zhaocheng
Wang, Rui
Wu, Jun
Zhang, Wei
Li, Chenxi
[J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (03) : 996 - 1010

← 1 2 3 4 5 →