Model Predictive Control-Based Value Estimation for Efficient Reinforcement Learning

被引:0
|
作者
Wu, Qizhen [1 ]
Liu, Kexin [1 ]
Chen, Lei [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
Predictive models; Neural networks; Trajectory; Data models; Computational modeling; Training; Optimization; Reinforcement learning; Predictive control;
D O I
10.1109/MIS.2024.3386204
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal strategy with only a few attempts for many learning methods. Hereby, we design an improved RL method based on model predictive control that models the environment through a data-driven approach. Based on the learned environment model, it performs multistep prediction to estimate the value function and optimize the policy. The method demonstrates higher learning efficiency, faster convergent speed of strategies tending to the local optimal value, and less sample capacity space required by experience replay buffers. Experimental results, both in classic databases and in a dynamic obstacle-avoidance scenario for an unmanned aerial vehicle, validate the proposed approaches.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [21] Model Predictive Control-based Thermal Comfort and Energy Optimization
    Boodi, Abhinandana
    Beddiar, Karim
    Amirat, Yassine
    Benbouzid, Mohamed
    [J]. 45TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2019), 2019, : 5801 - 5806
  • [22] Model predictive control-based robust stabilization of a chemical reactor
    Monika Bakošová
    Juraj Oravec
    Katarína Matejičková
    [J]. Chemical Papers, 2013, 67 : 1146 - 1156
  • [23] Model Predictive Control-Based Fast Charging for Vehicular Batteries
    Yan, Jingyu
    Xu, Guoqing
    Qian, Huihuan
    Xu, Yangsheng
    Song, Zhibin
    [J]. ENERGIES, 2011, 4 (08) : 1178 - 1196
  • [24] Model Predictive Control-based Optimal Coordination of Distributed Energy
    Mayhorn, Ebony
    Kalsi, Karanjit
    Lian, Jianming
    Elizondo, Marcelo
    [J]. PROCEEDINGS OF THE 46TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2013, : 2237 - 2244
  • [25] Relations between Model Predictive Control and Reinforcement Learning
    Goerges, Daniel
    [J]. IFAC PAPERSONLINE, 2017, 50 (01): : 4920 - 4928
  • [26] Reinforcement Learning of the Prediction Horizon in Model Predictive Control
    Bohn, Eivind
    Gros, Sebastien
    Moe, Signe
    Johansen, Tor Arne
    [J]. IFAC PAPERSONLINE, 2021, 54 (06): : 314 - 320
  • [27] Distributed Nonlinear Model Predictive Control and Reinforcement Learning
    Saeed, Ifrah
    Alpcan, Tansu
    Erfani, Sarah M.
    Yilmaz, M. Berkay
    [J]. 2019 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2019, : 255 - 257
  • [28] MODEL-FREE PREDICTIVE CONTROL OF NONLINEAR PROCESSES BASED ON REINFORCEMENT LEARNING
    Shah, Hitesh
    Gopal, M.
    [J]. IFAC PAPERSONLINE, 2016, 49 (01): : 89 - 94
  • [29] Multi-step Greedy Reinforcement Learning Based on Model Predictive Control
    Yang, Yucheng
    Lucia, Sergio
    [J]. IFAC PAPERSONLINE, 2021, 54 (03): : 699 - 705
  • [30] Adaptive parameterized model predictive control based on reinforcement learning: A synthesis framework
    Sun, Dingshan
    Jamshidnejad, Anahita
    De Schutter, Bart
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136