Reinforcement Learning With Model-Based Assistance for Shape Control in Sendzimir Rolling Mills

被引:3
|
作者
Park, Jonghyuk [1 ]
Kim, Beomsu [1 ]
Han, Soohee [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Convergence IT Engn, Pohang 37673, South Korea
基金
新加坡国家研究基金会;
关键词
Actor-critic policy gradient; cold rolling mill; partially observable Markov decision process (MDP); reinforcement learning; Sendzimir rolling mill (ZRM); CONTROL-SYSTEMS; IMPROVEMENT;
D O I
10.1109/TCST.2022.3227502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As one of the most popular tandem cold rolling mills, the Sendzimir rolling mill (ZRM) aims to obtain a flat steel strip shape by properly allocating the rolling pressure. To improve the performance of the ZRM, it is meaningful to adopt recently emerging deep reinforcement learning (DRL) that is powerful for difficult-to-solve and challenging problems. However, the direct application of DRL techniques may be impractical because of a serious singularity, partial observability, and even safety issues inherent in mill systems. In this brief, we propose an effective hybridization approach that integrates a model-based assistant into model-free DRL to resolve such practical issues. For the model-based assistant, a model-based optimization problem is first constructed and solved for the static part of the mill model. Then, the obtained static model-based coarse assistant, or controller, is improved by the proposed reinforcement learning, considering the remaining dynamic part of the mill model. The serious singularity can be resolved using the model-based approach, and the issue of partial observability is addressed by the long short-term memory (LSTM) state estimator in the proposed method. In simulation results, the proposed method successfully learns a highly performing policy for the ZRM, achieving a higher reward than pure model-free DRL. It is also observed that the proposed method can safely improve the shape controller of the mill system. The demonstration results strongly confirm the high applicability of DRL to other cold multiroll mills, such as four-high, six-high, and cluster mills.
引用
收藏
页码:1867 / 1874
页数:8
相关论文
共 50 条
  • [1] Development and Experimental Evaluation of Strip Shape Prediction Model for Sendzimir Rolling Mills
    Jong-min Shin
    Seong-ik Han
    Jong-shik Kim
    Journal of Iron and Steel Research International, 2013, 20 : 25 - 32
  • [2] Development and Experimental Evaluation of Strip Shape Prediction Model for Sendzimir Rolling Mills
    SHIN Jong-min
    HAN Seong-ik
    KIM Jong-shik
    JournalofIronandSteelResearch(International), 2013, 20 (12) : 25 - 32
  • [3] Development and Experimental Evaluation of Strip Shape Prediction Model for Sendzimir Rolling Mills
    Shin, Jong-min
    Han, Seong-ik
    Kim, Jong-shik
    JOURNAL OF IRON AND STEEL RESEARCH INTERNATIONAL, 2013, 20 (12) : 25 - 32
  • [4] Shape recognition performance analysis and improvement in Sendzimir rolling mills
    Jung, Chul Su
    Park, Jung Hyun
    Han, Seong Ik
    Kim, Jong Shik
    2012 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2012, : 1886 - 1891
  • [5] Shape control systems for Sendzimir steel mills
    Ringwood, JV
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2000, 8 (01) : 70 - 86
  • [6] Shape recognition performance analysis and improvement in Sendzimir rolling mills
    Jeong, Cheol Su
    Park, Jung Hyun
    Han, Seong Ik
    Kim, Jong Shik
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2014, 28 (04) : 1455 - 1463
  • [7] Shape Control Systems for Sendzimir Cold-rolling Steel Mills with Actuator Saturation
    Kim, Beomsu
    Han, Soohee
    2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 918 - 922
  • [8] Shape recognition performance analysis and improvement in Sendzimir rolling mills
    Cheol Su Jeong
    Jung Hyun Park
    Seong Ik Han
    Jong Shik Kim
    Journal of Mechanical Science and Technology, 2014, 28 : 1455 - 1463
  • [9] Model-Based Reinforcement Learning For Robot Control
    Li, Xiang
    Shang, Weiwei
    Cong, Shuang
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 300 - 305
  • [10] THE DESIGN OF STRIP SHAPE CONTROL-SYSTEMS FOR SENDZIMIR MILLS
    GRIMBLE, MJ
    FOTAKIS, J
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1982, 27 (03) : 656 - 666