Control Delay in Reinforcement Learning for Real-Time Dynamic Systems: A Memoryless Approach

被引:24
|
作者
Schuitema, Erik [1 ]
Busoniu, Lucian
Babuska, Robert [2 ]
Jonker, Pieter [1 ]
机构
[1] Delft Univ Technol, Delft Biorobot Lab, Mekelweg 2, NL-2628 CD Delft, Netherlands
[2] Delft Univ Technol, Delft Ctr Syst & Control, NL-2628 CD Delft, Netherlands
关键词
D O I
10.1109/IROS.2010.5650345
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robots controlled by Reinforcement Learning (RL) are still rare. A core challenge to the application of RL to robotic systems is to learn despite the existence of control delay - the delay between measuring a system's state and acting upon it. Control delay is always present in real systems. In this work, we present two novel temporal difference (TD) learning algorithms for problems with control delay. These algorithms improve learning performance by taking the control delay into account. We test our algorithms in a gridworld, where the delay is an integer multiple of the time step, as well as in the simulation of a robotic system, where the delay can have any value. In both tests, our proposed algorithms outperform classical TD learning algorithms, while maintaining low computational complexity.
引用
收藏
页码:3226 / 3231
页数:6
相关论文
共 50 条
  • [1] Real-time measurement-driven reinforcement learning control approach for uncertain nonlinear systems
    Abouheaf, Mohamed
    Boase, Derek
    Gueaieb, Wail
    Spinello, Davide
    Al-Sharhan, Salah
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [2] Integration of Adaptive Control and Reinforcement Learning for Real-Time Control and Learning
    Annaswamy, Anuradha M.
    Guha, Anubhav
    Cui, Yingnan
    Tang, Sunbochen
    Fisher, Peter A.
    Gaudio, Joseph E.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7740 - 7755
  • [3] Experience Replay for Real-Time Reinforcement Learning Control
    Adam, Sander
    Busoniu, Lucian
    Babuska, Robert
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (02): : 201 - 212
  • [4] Real-time Obstacle Avoidance for AUV Based on Reinforcement Learning and Dynamic Window Approach
    Shen, Yue
    Xu, Han
    Wang, Dianrui
    Zhang, Yixiao
    Yan, Tianhong
    He, Bo
    [J]. GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [5] Real-Time Reinforcement Learning
    Ramstedt, Simon
    Pal, Christopher
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] A reinforcement learning-based approach for online optimal control of self-adaptive real-time systems
    Bakhta Haouari
    Rania Mzid
    Olfa Mosbahi
    [J]. Neural Computing and Applications, 2023, 35 : 20375 - 20401
  • [8] A reinforcement learning-based approach for online optimal control of self-adaptive real-time systems
    Haouari, Bakhta
    Mzid, Rania
    Mosbahi, Olfa
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 20375 - 20401
  • [9] Real-time Construction and Control on Dynamic Systems
    Liu, Hua
    Wang, Zhaoyang
    [J]. REAL-TIME PHOTONIC MEASUREMENTS, DATA MANAGEMENT, AND PROCESSING VI, 2021, 11902
  • [10] Dynamic Resource Allocation for Real-Time Cloud XR Video Transmission: A Reinforcement Learning Approach
    Wang, Zhaocheng
    Wang, Rui
    Wu, Jun
    Zhang, Wei
    Li, Chenxi
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (03) : 996 - 1010