Delay-aware model-based reinforcement learning for continuous control

被引:30
|
作者
Chen, Baiming [1 ]
Xu, Mengdi [2 ]
Li, Liang [1 ]
Zhao, Ding [2 ]
机构
[1] Tsinghua Univ, Beijing 100084, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
Model-based reinforcement learning; Markov decision process; Continuous control; Delayed system; FINITE SPECTRUM ASSIGNMENT; DEEP NEURAL-NETWORKS; SMITH PREDICTOR; SYSTEMS; INTEGRATOR; STABILITY; ROBOT;
D O I
10.1016/j.neucom.2021.04.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action delays degrade the performance of reinforcement learning in many real-world systems. This paper proposes a formal definition of delay-aware Markov Decision Process and proves it can be transformed into standard MDP with augmented states using the Markov reward process. We develop a delay-aware model-based reinforcement learning framework that can incorporate the multi-step delay into the learned system models without learning effort. Experiments with the Gym and MuJoCo platforms show that the proposed delay-aware model-based algorithm is more efficient in training and transferable between systems with various durations of delay compared with state-of-the-art model-free reinforce-ment learning methods. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:119 / 128
页数:10
相关论文
共 50 条
  • [41] Uncertainty-Aware Contact-Safe Model-Based Reinforcement Learning
    Kuo, Cheng-Yu
    Schaarschmidt, Andreas
    Cui, Yunduan
    Asfour, Tamim
    Matsubara, Takamitsu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3918 - 3925
  • [42] Cognitive Control Predicts Use of Model-based Reinforcement Learning
    Otto, A. Ross
    Skatova, Anya
    Madlon-Kay, Seth
    Daw, Nathaniel D.
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2015, 27 (02) : 319 - 333
  • [43] Model-based hierarchical reinforcement learning and human action control
    Botvinick, Matthew
    Weinstein, Ari
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2014, 369 (1655)
  • [44] Advances in model-based reinforcement learning for Adaptive Optics control
    Nousiainen, Jalo
    Engler, Byron
    Kasper, Markus
    Helin, Tapio
    Heritier, Cedric T.
    Rajani, Chang
    ADAPTIVE OPTICS SYSTEMS VIII, 2022, 12185
  • [45] Adaptive optics control using model-based reinforcement learning
    Nousiainen, Jalo
    Rajani, Chang
    Kasper, Markus
    Helin, Tapio
    OPTICS EXPRESS, 2021, 29 (10) : 15327 - 15344
  • [46] Delay-aware decoupling of multivariable control systems with observers
    Angermann, A
    Schroeder, D
    CCA 2003: PROCEEDINGS OF 2003 IEEE CONFERENCE ON CONTROL APPLICATIONS, VOLS 1 AND 2, 2003, : 1082 - 1087
  • [47] Optimal Delay Assignment in Delay-Aware Control of Cyber-Physical Systems: A Machine Learning Approach
    Pauli, Patricia
    Dibaji, Seyed Mehran
    Annaswamy, Anuradha M.
    Chakrabortty, Aranya
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4583 - 4588
  • [48] Lyapunov-guided deep reinforcement learning for delay-aware online task offloading in MEC systems
    Dai, Longbao
    Mei, Jing
    Yang, Zhibang
    Tong, Zhao
    Zeng, Cuibin
    Li, Keqin
    JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 153
  • [49] Delay-Aware Stochastic Resource Management for Mobile Edge Computing Systems via Constrained Reinforcement Learning
    Tian, Chang
    Liu, An
    Luo, Wu
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2021, 10 (12) : 2708 - 2712
  • [50] Delay-Aware Optimization of Fine-Grained Microservice Deployment and Routing in Edge via Reinforcement Learning
    Peng, Kai
    He, Jintao
    Guo, Jialu
    Liu, Yuan
    He, Jianwen
    Liu, Wei
    Hu, Menglan
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 6024 - 6037