Q-MPC: stable and efficient reinforcement learning using model predictive control

被引:0
|
作者
Oh, Tae Hoon [1 ]
机构
[1] Kyoto Univ, Dept Chem Engn, Kyoto, Japan
来源
IFAC PAPERSONLINE | 2023年 / 56卷 / 02期
关键词
Model predictive and optimization-based control; Reinforcement learning control; Process modeling and identification; Data-based control; Learning for control; Predictive control;
D O I
10.1016/j.ifacol.2023.10.1369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is a growing interest in developing an efficient data-driven control method that can be implemented into digitized manufacturing processes. Model-free reinforcement learning (RL) is a machine learning method that can directly learn the optimal control policy from the process data. However, the model-free RL shows higher cost variance than the model-based method and may require an infeasible amount of data to learn the optimal control policy. Motivated by the fact that the system identification to linear model shows high data efficiency and stable performance, this paper proposes combining the linear model predictive control (MPC) with Q-learning. This combined scheme, Q-MPC, can improve the control performance more stably and safely. For the case study, linear MPC, Q-MPC, DDPG, TD3, and SAC methods are applied to the nonlinear benchmark system, mainly focusing on the learning speed and cost variance. Copyright (c) 2023 The Authors.
引用
收藏
页码:2727 / 2732
页数:6
相关论文
共 50 条
  • [1] Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control
    Kamthe, Sanket
    Deisenroth, Marc Peter
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [2] IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
    Chitnis, Rolian
    Xu, Yingchen
    Hashemi, Bobak
    Lehnert, Lucas
    Dogan, Crun
    Zhu, Zheqing
    Delalleau, Olivier
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 9154 - 9160
  • [3] Model Predictive Control-Based Value Estimation for Efficient Reinforcement Learning
    Wu, Qizhen
    Liu, Kexin
    Chen, Lei
    IEEE INTELLIGENT SYSTEMS, 2024, 39 (03) : 63 - 72
  • [4] Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control
    Dawood, Murad
    Dengler, Nils
    de Heuvel, Jorge
    Bennewitz, Maren
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 879 - 885
  • [5] Optimization of the Model Predictive Control Update Interval Using Reinforcement Learning
    Bohn, Eivind
    Gros, Sebastien
    Moe, Signe
    Johansen, Tor Arne
    IFAC PAPERSONLINE, 2021, 54 (14): : 257 - 262
  • [6] The integration of Model Predictive Control and deep Reinforcement Learning for efficient thermal control in thermoforming processes
    Hosseinionari, Hadi
    Seethaler, Rudolf
    JOURNAL OF MANUFACTURING PROCESSES, 2024, 115 : 82 - 93
  • [7] Safety Reinforced Model Predictive Control (SRMPC): Improving MPC with Reinforcement Learning for Motion Planning in Autonomous Driving
    Fischer, Johannes
    Steiner, Marlon
    Tas, Omer Sahin
    Stiller, Christoph
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 2811 - 2818
  • [8] Model-free Predictive Optimal Iterative Learning Control using Reinforcement Learning
    Zhang, Yueqing
    Chu, Bing
    Shu, Zhan
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3279 - 3284
  • [9] Model Predictive Control Guided Reinforcement Learning Control Scheme
    Xie, Huimin
    Xu, Xinghai
    Li, Yuling
    Hong, Wenjing
    Shi, Jia
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [10] Inverse Model Predictive Control: Learning Optimal Control Cost Functions for MPC
    Zhang, Fawang
    Duan, Jingliang
    Xu, Haoyuan
    Chen, Hao
    Liu, Hui
    Nie, Shida
    Li, Shengbo Eben
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (12) : 13644 - 13655