Model Predictive Control Guided Reinforcement Learning Control Scheme

被引:10
|
作者
Xie, Huimin [1 ]
Xu, Xinghai [1 ]
Li, Yuling [2 ]
Hong, Wenjing [1 ]
Shi, Jia [1 ]
机构
[1] Xiamen Univ, Dept Chem & Biochem Engn, Xiamen, Peoples R China
[2] Imperial Coll London, Dept Earth Sci & Engn, London, England
基金
国家重点研发计划;
关键词
Deep reinforcement learning; Model predictive control; Time delay; Process control;
D O I
10.1109/ijcnn48605.2020.9207398
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Reinforcement Learning (DRL) is an artificial intelligence technology that can complete decision-making tasks by interaction. It has been successfully applied to various games. However, there are still many challenges when this technique is applied to the industrial process control due to the low sample efficiency and the inability to deal with large time delay. In this paper, a novel Model Predictive Control (MPC) guided Reinforcement Learning Control (MP-RLC) scheme is proposed for the process control. In this scheme, Model predictive control is directly combined with Reinforcement Learning (RL) to guide the training process, thus greatly improving the sample efficiency of reinforcement learning and effectively solving the problem of time delay. The simulation results on both a third-order linear system and a nonlinear continuous stirred tank reactor (CSTR) system with large time delay demonstrate that this scheme can not only accelerate the training process but also improve the control performance, which is superior to both standalone RL and MPC schemes. The proposed approach may help to pave the way for DRL applied to industrial processes.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Iterative Learning Control (ILC) Guided Reinforcement Learning Control (RLC) Scheme for Batch Processes
    Xu, Xinghai
    Xie, Huimin
    Shi, Jia
    [J]. PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 241 - 246
  • [2] Reinforcement Learning versus Model Predictive Control on greenhouse climate control
    Morcego, Bernardo
    Yin, Wenjie
    Boersma, Sjoerd
    van Henten, Eldert
    Puig, Vicenc
    Sun, Congcong
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 215
  • [3] Distributed Nonlinear Model Predictive Control and Reinforcement Learning
    Saeed, Ifrah
    Alpcan, Tansu
    Erfani, Sarah M.
    Yilmaz, M. Berkay
    [J]. 2019 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2019, : 255 - 257
  • [4] Reinforcement Learning of the Prediction Horizon in Model Predictive Control
    Bohn, Eivind
    Gros, Sebastien
    Moe, Signe
    Johansen, Tor Arne
    [J]. IFAC PAPERSONLINE, 2021, 54 (06): : 314 - 320
  • [5] Relations between Model Predictive Control and Reinforcement Learning
    Goerges, Daniel
    [J]. IFAC PAPERSONLINE, 2017, 50 (01): : 4920 - 4928
  • [6] Model Predictive Control-Based Reinforcement Learning
    Han, Qiang
    Boussaid, Farid
    Bennamoun, Mohammed
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [7] Comparison of Deep Reinforcement Learning and Model Predictive Control for Adaptive Cruise Control
    Lin, Yuan
    McPhee, John
    Azad, Nasser L.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2021, 6 (02): : 221 - 231
  • [8] Reinforcement learning and model predictive control for robust embedded quadrotor guidance and control
    Colin Greatwood
    Arthur G. Richards
    [J]. Autonomous Robots, 2019, 43 : 1681 - 1693
  • [9] Reinforcement learning and model predictive control for robust embedded quadrotor guidance and control
    Greatwood, Colin
    Richards, Arthur G.
    [J]. AUTONOMOUS ROBOTS, 2019, 43 (07) : 1681 - 1693
  • [10] Newton's method for reinforcement learning and model predictive control
    Bertsekas, Dimitri
    [J]. RESULTS IN CONTROL AND OPTIMIZATION, 2022, 7