Delay-aware model-based reinforcement learning for continuous control

被引:30
|
作者
Chen, Baiming [1 ]
Xu, Mengdi [2 ]
Li, Liang [1 ]
Zhao, Ding [2 ]
机构
[1] Tsinghua Univ, Beijing 100084, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
Model-based reinforcement learning; Markov decision process; Continuous control; Delayed system; FINITE SPECTRUM ASSIGNMENT; DEEP NEURAL-NETWORKS; SMITH PREDICTOR; SYSTEMS; INTEGRATOR; STABILITY; ROBOT;
D O I
10.1016/j.neucom.2021.04.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action delays degrade the performance of reinforcement learning in many real-world systems. This paper proposes a formal definition of delay-aware Markov Decision Process and proves it can be transformed into standard MDP with augmented states using the Markov reward process. We develop a delay-aware model-based reinforcement learning framework that can incorporate the multi-step delay into the learned system models without learning effort. Experiments with the Gym and MuJoCo platforms show that the proposed delay-aware model-based algorithm is more efficient in training and transferable between systems with various durations of delay compared with state-of-the-art model-free reinforce-ment learning methods. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:119 / 128
页数:10
相关论文
共 50 条
  • [21] Value-Aware Loss Function for Model-based Reinforcement Learning
    Farahmand, Amir-massoud
    Barreto, Andre M. S.
    Nikovski, Daniel N.
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 1486 - 1494
  • [22] A Safety Aware Model-Based Reinforcement Learning Framework for Systems with Uncertainties
    Mahmud, S. M. Nahid
    Hareland, Katrine
    Nivison, Scott A.
    Bell, Zachary, I
    Kamalapurkar, Rushikesh
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1979 - 1984
  • [23] Efficient reinforcement learning: Model-based acrobot control
    Boone, G
    1997 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION - PROCEEDINGS, VOLS 1-4, 1997, : 229 - 234
  • [24] Multiple model-based reinforcement learning for nonlinear control
    Samejima, K
    Katagiri, K
    Doya, K
    Kawato, M
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (09): : 54 - 69
  • [25] Offline Model-Based Reinforcement Learning for Tokamak Control
    Char, Ian
    Abbate, Joseph
    Bardoczi, Laszlo
    Boyer, Mark D.
    Chung, Youngseog
    Conlin, Rory
    Erickson, Keith
    Mehta, Viraj
    Richner, Nathan
    Kolemen, Egemen
    Schneider, Jeff
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [26] Delay-aware Transmission Range Control for VANETs
    Li, Jialiang
    Chigan, Chunxiao
    2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,
  • [27] Delay-Aware Period Assignment in Control Systems
    Bini, Enrico
    Cervin, Anton
    RTSS: 2008 REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2008, : 291 - +
  • [28] Learning to Shape by Grinding: Cutting-Surface-Aware Model-Based Reinforcement Learning
    Hachimine, Takumi
    Morimoto, Jun
    Matsubara, Takamitsu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6235 - 6242
  • [29] Meta-DAMS: Delay-Aware Multipath Scheduler using Hybrid Meta Reinforcement Learning
    Sepahi, Amir
    Cai, Lin
    Yang, Wenjun
    Pan, Jianping
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [30] Delay-Aware Routing in Software-Defined Networks via Network Tomography and Reinforcement Learning
    Tao, Xu
    Monaco, Doriana
    Sacco, Alessio
    Silvestri, Simone
    Marchetto, Guido
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (04): : 3383 - 3397