Model Predictive Control-Based Value Estimation for Efficient Reinforcement Learning

被引:0
|
作者
Wu, Qizhen [1 ]
Liu, Kexin [1 ]
Chen, Lei [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
Predictive models; Neural networks; Trajectory; Data models; Computational modeling; Training; Optimization; Reinforcement learning; Predictive control;
D O I
10.1109/MIS.2024.3386204
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal strategy with only a few attempts for many learning methods. Hereby, we design an improved RL method based on model predictive control that models the environment through a data-driven approach. Based on the learned environment model, it performs multistep prediction to estimate the value function and optimize the policy. The method demonstrates higher learning efficiency, faster convergent speed of strategies tending to the local optimal value, and less sample capacity space required by experience replay buffers. Experimental results, both in classic databases and in a dynamic obstacle-avoidance scenario for an unmanned aerial vehicle, validate the proposed approaches.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [1] Model Predictive Control-Based Reinforcement Learning
    Han, Qiang
    Boussaid, Farid
    Bennamoun, Mohammed
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [2] Model Predictive Control-Based Reinforcement Learning Using Expected Sarsa
    Moradimaryamnegari, Hoomaan
    Frego, Marco
    Peer, Angelika
    [J]. IEEE ACCESS, 2022, 10 : 81177 - 81191
  • [3] Energy management in residential microgrid using model predictive control-based reinforcement learning and Shapley value
    Cai, Wenqi
    Kordabad, Arash Bahari
    Gros, Sebastien
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [4] Filtered Probabilistic Model Predictive Control-Based Reinforcement Learning for Unmanned Surface Vehicles
    Cui, Yunduan
    Peng, Lei
    Li, Huiyun
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 6950 - 6961
  • [5] Autonomous boat driving system using sample-efficient model predictive control-based reinforcement learning approach
    Cui, Yunduan
    Osaki, Shigeki
    Matsubara, Takamitsu
    [J]. JOURNAL OF FIELD ROBOTICS, 2021, 38 (03) : 331 - 354
  • [6] An efficient Model Predictive Control-based motion cueing algorithm for the driving simulator
    Fang, Zhou
    Kemeny, Andras
    [J]. SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2016, 92 (11): : 1025 - 1033
  • [7] Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control
    Kamthe, Sanket
    Deisenroth, Marc Peter
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [8] Reinforcement Learning Boat Autopilot: A Sample-efficient and Model Predictive Control based Approach
    Cui, Yunduan
    Osaki, Shigeki
    Matsubara, Takamitsu
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 2868 - 2875
  • [9] Model Predictive Control of Quadruped Robot Based on Reinforcement Learning
    Zhang, Zhitong
    Chang, Xu
    Ma, Hongxu
    An, Honglei
    Lang, Lin
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [10] Markov Decision Process Framework for Control-Based Reinforcement Learning
    Lu Y.
    Squillante M.S.
    Wah Wu C.
    [J]. Performance Evaluation Review, 2023, 51 (02): : 39 - 41