Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process

被引:38
|
作者
Zhu, Lingwei [1 ]
Cui, Yunduan [1 ]
Takami, Go [2 ]
Kanokogi, Hiroaki [2 ]
Matsubara, Takamitsu [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Sci & Technol, Takayama Cho 8916-5, Ikoma, Nara, Japan
[2] Yokogawa Elect Corp, New Field Dev Ctr, Nakacho 2-9-32, Musashino, Tokyo, Japan
关键词
Chemical process control; Reinforcement learning; Vinyl acetate monomer; MODEL;
D O I
10.1016/j.conengprac.2020.104331
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper explores a reinforcement learning (RL) approach that designs automatic control strategies in a large-scale chemical process control scenario as the first step for leveraging an RL method to intelligently control real-world chemical plants. The huge number of units for chemical reactions as well as feeding and recycling the materials of a typical chemical process induces a vast amount of samples and subsequent prohibitive computation complexity in RL for deriving a suitable control policy due to high-dimensional state and action spaces. To tackle this problem, a novel RL algorithm: Factorial Fast-food Dynamic Policy Programming (FFDPP) is proposed. By introducing a factorial framework that efficiently factorizes the action space, Fast-food kernel approximation that alleviates the curse of dimensionality caused by the high dimensionality of state space, into Dynamic Policy Programming (DPP) that achieves stable learning even with insufficient samples. FFDPP is evaluated in a commercial chemical plant simulator for a Vinyl Acetate Monomer (VAM) process. Experimental results demonstrate that without any knowledge of the model, the proposed method successfully learned a stable policy with reasonable computation resources to produce a larger amount of VAM product with comparative performance to a state-of-the-art model-based control.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Distributed SFA-CA monitoring approach for nonstationary plant-wide process and its application on a vinyl acetate monomer process
    Huang, Jian
    Xiao, Jieshi
    Yang, Xu
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2022, 162 : 1091 - 1101
  • [2] Plant-wide control of a complete ethyl acetate reactive distillation process
    Tang, YT
    Huang, HP
    Chien, IL
    JOURNAL OF CHEMICAL ENGINEERING OF JAPAN, 2005, 38 (02) : 130 - 146
  • [3] Plant-wide control of a hybrid process
    de Prada, C.
    Sarabia, D.
    Cristea, S.
    Mazaeda, R.
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2008, 22 (02) : 124 - 141
  • [4] Plant-wide control of an industrial process
    Lausch, HR
    Wozny, G
    Wutkewicz, M
    Wendeler, H
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 1998, 76 (A2): : 185 - 192
  • [5] Plant-wide process control for the Collahuasi Project
    Del Castillo, A
    Gomez, P
    MINERAL PROCESSING/ENVIRONMENT, HEALTH AND SAFETY, 1999, : 51 - 66
  • [6] Design and Control of a Modified Vinyl Acetate Monomer Process
    Luyben, William L.
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2011, 50 (17) : 10136 - 10147
  • [7] Design and Control of a Plant-Wide Process for the Production of Epichlorohydrin
    Huang, Chien-Chih
    Wang, San-Jang
    Wong, David Shan-Hill
    2017 6TH INTERNATIONAL SYMPOSIUM ON ADVANCED CONTROL OF INDUSTRIAL PROCESSES (ADCONIP), 2017, : 383 - 388
  • [8] Plant-wide hierarchical optimal control of a crystallization process
    Mazaeda, Rogelio
    Cristca, Smaranda P.
    de Prada, Cesar
    IFAC PAPERSONLINE, 2015, 48 (08): : 1210 - 1215
  • [9] Plant-wide process control loop condition monitoring
    Ras Tanura Refinery Engineering Division, Saudi Aramco, Saudi Arabia
    Saudi Aramco J. Technol., 2006, FALL (53-61):
  • [10] A PLANT-WIDE INDUSTRIAL PROCESS CONTROL SECURITY PROBLEM
    McEvoy, Thomas
    Wolthusen, Stephen
    CRITICAL INFRASTRUCTURE PROTECTION V, 2011, 367 : 47 - 56