A novel sim2real reinforcement learning algorithm for process control

被引：1

作者：

Liang, Huiping ^{[1
,2
]}

Xie, Junyao ^{[2
]}

Huang, Biao ^{[2
]}

Li, Yonggang ^{[1
,3
]}

Sun, Bei ^{[1
,3
]}

Yang, Chunhua ^{[1
]}

机构：

[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China

[2] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB T6G 2V4, Canada

[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China

来源：

RELIABILITY ENGINEERING & SYSTEM SAFETY | 2025年 / 254卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Process control; Model-plant mismatch; Fix-horizon return; Industrial roasting process;

D O I：

10.1016/j.ress.2024.110639

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

While reinforcement learning (RL) has potential in advanced process control and optimization, its direct interaction with real industrial processes can pose safety concerns. Model-based pre-training of RL may alleviate such risks. However, the intricate nature of industrial processes complicates the establishment of entirely accurate simulation models. Consequently, RL-based controllers relying on simulation models can easily suffer from model-plant mismatch. On the one hand, utilizing offline data for pre-training of RL can also mitigate safety risks. However, it requires well-represented historical datasets. This is demanding because industrial processes mostly run under a regulatory mode with basic controllers. To handle these issues, this paper proposes a novel sim2real reinforcement learning algorithm. First, a state adaptor (SA) is proposed to align simulated states with real states to mitigate the model-plant mismatch. Then, a fix-horizon return is designed to replace traditional infinite-step return to provide genuine labels for the critic network, enhancing learning efficiency and stability. Finally, applying proximal policy optimization (PPO), the SA-PPO method is introduced to implement the proposed sim2real algorithm. Experimental results show that SA-PPO improves performance in MSE by 1.96% and in R by 21.64% on average for roasting process simulation. This verifies the effectiveness of the proposed method.

引用

页数：12

共 50 条

[21] Using digital twin to enhance Sim2real transfer for reinforcement learning in 3C assembly
Mu, Weiwen
Chen, Wenbai
Zhou, Huaidong
Liu, Naijun
Shi, Haobin
Li, Jingchen
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2024, 51 (01): : 125 - 133
[22] Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control
Sadeghi, Fereshteh
Toshev, Alexander
Jang, Eric
Levine, Sergey
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4691 - 4699
[23] Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real
Miao, Qinghai
Lv, Yisheng
Huang, Min
Wang, Xiao
Wang, Fei-Yue
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (03) : 603 - 631
[24] Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real
Qinghai Miao
Yisheng Lv
Min Huang
Xiao Wang
Fei-Yue Wang
IEEE/CAA Journal of Automatica Sinica, 2023, 10 (03) : 603 - 631
[25] Sim2real Learning of Obstacle Avoidance for Robotic Manipulators in Uncertain Environments
Zhang, Tan
Zhang, Kefang
Lin, Jiatao
Louie, Wing-Yue Geoffrey
Huang, Hui
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 65 - 72
[26] A platform-agnostic deep reinforcement learning framework for effective Sim2Real transfer towards autonomous driving
Dianzhao Li
Ostap Okhrin
Communications Engineering, 3 (1):
[27] Sim2Real in Endoscopy Segmentation with a Novel Structure Aware Image Translation
Tomasini, Clara
Riazuelo, Luis
Murillo, Ana C.
SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2024, 2025, 15187 : 89 - 101
[28] Flying Through a Narrow Gap Using End-to-End Deep Reinforcement Learning Augmented With Curriculum Learning and Sim2Real
Xiao, Chenxi
Lu, Peng
He, Qizhi
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (05) : 2701 - 2708
[29] Longitudinal deep truck: Deep longitudinal model with application to sim2real deep reinforcement learning for heavy-duty truck control in the field
Albeaik, Saleh
Wu, Trevor
Vurimi, Ganeshnikhil
Chou, Fang-Chieh
Lu, Xiao-Yun
Bayen, Alexandre M.
JOURNAL OF FIELD ROBOTICS, 2023, 40 (02) : 306 - 329
[30] Sim2real for Autonomous Vehicle Control using Executable Digital Twin
Allamaa, Jean Pierre
Patrinos, Panagiotis
Van der Auweraer, Herman
Son, Tong Duy
IFAC PAPERSONLINE, 2022, 55 (24): : 385 - 391

← 1 2 3 4 5 →