Model Predictive Control-Based Value Estimation for Efficient Reinforcement Learning

被引:0
|
作者
Wu, Qizhen [1 ]
Liu, Kexin [1 ]
Chen, Lei [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
Predictive models; Neural networks; Trajectory; Data models; Computational modeling; Training; Optimization; Reinforcement learning; Predictive control;
D O I
10.1109/MIS.2024.3386204
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal strategy with only a few attempts for many learning methods. Hereby, we design an improved RL method based on model predictive control that models the environment through a data-driven approach. Based on the learned environment model, it performs multistep prediction to estimate the value function and optimize the policy. The method demonstrates higher learning efficiency, faster convergent speed of strategies tending to the local optimal value, and less sample capacity space required by experience replay buffers. Experimental results, both in classic databases and in a dynamic obstacle-avoidance scenario for an unmanned aerial vehicle, validate the proposed approaches.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [1] Model Predictive Control-Based Reinforcement Learning
    Han, Qiang
    Boussaid, Farid
    Bennamoun, Mohammed
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [2] Model Predictive Control-Based Reinforcement Learning Using Expected Sarsa
    Moradimaryamnegari, Hoomaan
    Frego, Marco
    Peer, Angelika
    IEEE ACCESS, 2022, 10 : 81177 - 81191
  • [3] Energy management in residential microgrid using model predictive control-based reinforcement learning and Shapley value
    Cai, Wenqi
    Kordabad, Arash Bahari
    Gros, Sebastien
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [4] Filtered Probabilistic Model Predictive Control-Based Reinforcement Learning for Unmanned Surface Vehicles
    Cui, Yunduan
    Peng, Lei
    Li, Huiyun
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 6950 - 6961
  • [5] Autonomous boat driving system using sample-efficient model predictive control-based reinforcement learning approach
    Cui, Yunduan
    Osaki, Shigeki
    Matsubara, Takamitsu
    JOURNAL OF FIELD ROBOTICS, 2021, 38 (03) : 331 - 354
  • [6] An efficient Model Predictive Control-based motion cueing algorithm for the driving simulator
    Fang, Zhou
    Kemeny, Andras
    SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2016, 92 (11): : 1025 - 1033
  • [7] Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control
    Kamthe, Sanket
    Deisenroth, Marc Peter
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [8] Reinforcement Learning Boat Autopilot: A Sample-efficient and Model Predictive Control based Approach
    Cui, Yunduan
    Osaki, Shigeki
    Matsubara, Takamitsu
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 2868 - 2875
  • [9] Model Predictive Control of Quadruped Robot Based on Reinforcement Learning
    Zhang, Zhitong
    Chang, Xu
    Ma, Hongxu
    An, Honglei
    Lang, Lin
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [10] Markov Decision Process Framework for Control-Based Reinforcement Learning
    Lu Y.
    Squillante M.S.
    Wah Wu C.
    Performance Evaluation Review, 2023, 51 (02): : 39 - 41