Data-driven constrained reinforcement learning for optimal control of a multistage evaporation process

被引:9
|
作者
Yao, Yao [1 ]
Ding, Jinliang [1 ]
Zhao, Chunhui [2 ]
Wang, Yonggang [3 ]
Chai, Tianyou [1 ]
机构
[1] Northeastern Univ, State Key Lab Synthet Automation Proc Ind, Shenyang 110819, Peoples R China
[2] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Peoples R China
[3] Shenyang Agr Univ, Coll Informat & Elect Engn, Shenyang 110866, Peoples R China
基金
中国国家自然科学基金;
关键词
Constrained reinforcement learning; Data-driven; Evaporation process; Optimal control; PREDICTIVE CONTROL; SYSTEM;
D O I
10.1016/j.conengprac.2022.105345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is challenging for reinforcement learning to solve the optimal control problem of industrial processes with constraints under uncertain operating conditions. In this context, this paper proposes a novel data-driven constrained reinforcement learning algorithm for the optimal control of a multistage evaporation process consisting of multiple evaporators in series with coupled liquid levels. We first formulate the optimal control problem as a constrained Markov decision process. Then, with the cumulative tracking error of the outlet liquor density taken as the cumulative constraint, a Lagrangian-based constrained policy optimization is developed. The fast setpoint tracking is achieved by gradient iteration of the policy and the dual variable. An action correction layer based on the online sequential version of random vector functional-link networks is built on the output of the policy network to address the instantaneous constraints of the liquid levels. The infeasible action is corrected in real-time so as to keep the liquid levels in each evaporator within operating range. Finally, we utilize both on-policy and off-policy data generated by the interaction between the constrained policy and the evaporation environment to update our algorithm, which is more data-efficient. Experiments have been carried out on a multistage evaporation system, and the results validate the effectiveness of the proposed algorithm.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Constrained data-driven optimal iterative learning control
    Chi, Ronghu
    Liu, Xiaohe
    Zhang, Ruikun
    Hou, Zhongsheng
    Huang, Biao
    [J]. JOURNAL OF PROCESS CONTROL, 2017, 55 : 10 - 29
  • [2] Data-Driven Flotation Industrial Process Operational Optimal Control Based on Reinforcement Learning
    Jiang, Yi
    Fan, Jialu
    Chai, Tianyou
    Li, Jinna
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (05) : 1974 - 1989
  • [3] Data-driven constrained reinforcement learning algorithm for path tracking control of hovercraft
    Wang, Yuanhui
    Zhou, Hua
    [J]. OCEAN ENGINEERING, 2024, 307
  • [4] Reinforcement Learning based Data-driven Optimal Control Strategy for Systems with Disturbance
    Fan, Zhong-Xin
    Li, Shihua
    Liu, Rongjie
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 567 - 572
  • [5] Control of batch pulping process using data-driven constrained iterative learning control
    Shibani, B.
    Ambure, Prathmesh
    Purohit, Amit
    Suratia, Preetsinh
    Bhartiya, Sharad
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2023, 170
  • [6] Data-Driven Robust Control Using Reinforcement Learning
    Ngo, Phuong D.
    Tejedor, Miguel
    Godtliebsen, Fred
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (04):
  • [7] Data-Driven Control of Hydraulic Manipulators by Reinforcement Learning
    Yao, Zhikai
    Xu, Fengyu
    Jiang, Guo-Ping
    Yao, Jianyong
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (04) : 2673 - 2684
  • [8] Data-driven optimal control of wind turbines using reinforcement learning with function approximation
    Peng, Shenglin
    Feng, Qianmei
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 176
  • [9] Data-Driven Nearly Optimal Control for Constrained Nonlinear Systems
    Yang, Xiong
    [J]. PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 105 - 110
  • [10] Reinforcement learning as data-driven optimization technique for GMAW process
    Giulio Mattera
    Alessandra Caggiano
    Luigi Nele
    [J]. Welding in the World, 2024, 68 : 805 - 817