An accelerated asynchronous advantage actor-critic algorithm applied in papermaking

被引:0
|
作者
Wang, Xuechun [1 ]
Zhuang, Zhiwei [1 ]
Zou, Luobao [1 ]
Zhang, Weidong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
关键词
Basis weight and moisture control; Reinforcement Learning; Asynchronous advantage actor-critic; Eligibility trace;
D O I
10.23919/chicc.2019.8866243
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The basis weight and moisture system with multivariate, strong coupling, large time lag and large inertia has always been tricky in papermaking. In this paper, a framework which applies the reinforcement learning methods is introduced to solve. Inspired by the previous work, we improve the Markov Decision Process (MDP) in our model, moreover we propose a new form of eligibility traces which can realize data multiplexing to speed up learning, aiming at remitting the costly expense of industrial process online training, where we use it in asynchronous advantage actor-critic (A3C) for illustration. The effectiveness of accelerated A3C in aforementioned system and the superiority of fast learning have been proved in the simulation.
引用
收藏
页码:8637 / 8642
页数:6
相关论文
共 50 条
  • [21] Adaptive Advantage Estimation for Actor-Critic Algorithms
    Chen, Yurou
    Zhang, Fengyi
    Liu, Zhiyong
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [22] Supervised Advantage Actor-Critic for Recommender Systems
    Xin, Xin
    Karatzoglou, Alexandros
    Arapakis, Ioannis
    Jose, Joemon M.
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1186 - 1196
  • [23] Asynchronous Advantage Actor-Critic (A3C) Learning for Cognitive Network Security
    Muhati, Eric
    Rawat, Danda B.
    [J]. 2021 THIRD IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2021), 2021, : 106 - 113
  • [24] VMP-A3C: Virtual machines placement in cloud computing based on asynchronous advantage actor-critic algorithm
    Wei, Pengcheng
    Zeng, Yushan
    Yan, Bei
    Zhou, Jiahui
    Nikougoftar, Elaheh
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (05)
  • [25] An Actor-Critic Algorithm for SVM Hyperparameters
    Kim, Chayoung
    Park, Jung-min
    Kim, Hye-young
    [J]. INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 653 - 661
  • [26] Design and application of adaptive PID controller based on asynchronous advantage actor-critic learning method
    Sun, Qifeng
    Du, Chengze
    Duan, Youxiang
    Ren, Hui
    Li, Hongqiang
    [J]. WIRELESS NETWORKS, 2021, 27 (05) : 3537 - 3547
  • [27] An improved scheduling with advantage actor-critic for Storm workloads
    Dong, Gaoqiang
    Wang, Jia
    Wang, Mingjing
    Su, Tingting
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (10): : 13421 - 13433
  • [28] Optimal Scheduling Framework of Electricity-Gas-Heat Integrated Energy System Based on Asynchronous Advantage Actor-Critic Algorithm
    Dong, Jian
    Wang, Haixin
    Yang, Junyou
    Lu, Xinyi
    Gao, Liu
    Zhou, Xiran
    [J]. IEEE Access, 2021, 9 : 139685 - 139696
  • [29] A dynamic event-triggered network control algorithm combined with gradient-sharing asynchronous advantage actor-critic strategy
    Zhang, Donghui
    Ye, Zehua
    Zhang, Dan
    Lu, Qun
    [J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2023,
  • [30] Optimal Scheduling Framework of Electricity-Gas-Heat Integrated Energy System Based on Asynchronous Advantage Actor-Critic Algorithm
    Dong, Jian
    Wang, Haixin
    Yang, Junyou
    Lu, Xinyi
    Gao, Liu
    Zhou, Xiran
    [J]. IEEE ACCESS, 2021, 9 : 139685 - 139696