A novel sim2real reinforcement learning algorithm for process control

被引:1
|
作者
Liang, Huiping [1 ,2 ]
Xie, Junyao [2 ]
Huang, Biao [2 ]
Li, Yonggang [1 ,3 ]
Sun, Bei [1 ,3 ]
Yang, Chunhua [1 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China
[2] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB T6G 2V4, Canada
[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Process control; Model-plant mismatch; Fix-horizon return; Industrial roasting process;
D O I
10.1016/j.ress.2024.110639
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
While reinforcement learning (RL) has potential in advanced process control and optimization, its direct interaction with real industrial processes can pose safety concerns. Model-based pre-training of RL may alleviate such risks. However, the intricate nature of industrial processes complicates the establishment of entirely accurate simulation models. Consequently, RL-based controllers relying on simulation models can easily suffer from model-plant mismatch. On the one hand, utilizing offline data for pre-training of RL can also mitigate safety risks. However, it requires well-represented historical datasets. This is demanding because industrial processes mostly run under a regulatory mode with basic controllers. To handle these issues, this paper proposes a novel sim2real reinforcement learning algorithm. First, a state adaptor (SA) is proposed to align simulated states with real states to mitigate the model-plant mismatch. Then, a fix-horizon return is designed to replace traditional infinite-step return to provide genuine labels for the critic network, enhancing learning efficiency and stability. Finally, applying proximal policy optimization (PPO), the SA-PPO method is introduced to implement the proposed sim2real algorithm. Experimental results show that SA-PPO improves performance in MSE by 1.96% and in R by 21.64% on average for roasting process simulation. This verifies the effectiveness of the proposed method.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Using digital twin to enhance Sim2real transfer for reinforcement learning in 3C assembly
    Mu, Weiwen
    Chen, Wenbai
    Zhou, Huaidong
    Liu, Naijun
    Shi, Haobin
    Li, Jingchen
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2024, 51 (01): : 125 - 133
  • [22] Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control
    Sadeghi, Fereshteh
    Toshev, Alexander
    Jang, Eric
    Levine, Sergey
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4691 - 4699
  • [23] Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real
    Miao, Qinghai
    Lv, Yisheng
    Huang, Min
    Wang, Xiao
    Wang, Fei-Yue
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (03) : 603 - 631
  • [24] Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real
    Qinghai Miao
    Yisheng Lv
    Min Huang
    Xiao Wang
    Fei-Yue Wang
    IEEE/CAA Journal of Automatica Sinica, 2023, 10 (03) : 603 - 631
  • [25] Sim2real Learning of Obstacle Avoidance for Robotic Manipulators in Uncertain Environments
    Zhang, Tan
    Zhang, Kefang
    Lin, Jiatao
    Louie, Wing-Yue Geoffrey
    Huang, Hui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 65 - 72
  • [26] A platform-agnostic deep reinforcement learning framework for effective Sim2Real transfer towards autonomous driving
    Dianzhao Li
    Ostap Okhrin
    Communications Engineering, 3 (1):
  • [27] Sim2Real in Endoscopy Segmentation with a Novel Structure Aware Image Translation
    Tomasini, Clara
    Riazuelo, Luis
    Murillo, Ana C.
    SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2024, 2025, 15187 : 89 - 101
  • [28] Flying Through a Narrow Gap Using End-to-End Deep Reinforcement Learning Augmented With Curriculum Learning and Sim2Real
    Xiao, Chenxi
    Lu, Peng
    He, Qizhi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (05) : 2701 - 2708
  • [29] Longitudinal deep truck: Deep longitudinal model with application to sim2real deep reinforcement learning for heavy-duty truck control in the field
    Albeaik, Saleh
    Wu, Trevor
    Vurimi, Ganeshnikhil
    Chou, Fang-Chieh
    Lu, Xiao-Yun
    Bayen, Alexandre M.
    JOURNAL OF FIELD ROBOTICS, 2023, 40 (02) : 306 - 329
  • [30] Sim2real for Autonomous Vehicle Control using Executable Digital Twin
    Allamaa, Jean Pierre
    Patrinos, Panagiotis
    Van der Auweraer, Herman
    Son, Tong Duy
    IFAC PAPERSONLINE, 2022, 55 (24): : 385 - 391