An accelerated asynchronous advantage actor-critic algorithm applied in papermaking

被引:0
|
作者
Wang, Xuechun [1 ]
Zhuang, Zhiwei [1 ]
Zou, Luobao [1 ]
Zhang, Weidong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
关键词
Basis weight and moisture control; Reinforcement Learning; Asynchronous advantage actor-critic; Eligibility trace;
D O I
10.23919/chicc.2019.8866243
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The basis weight and moisture system with multivariate, strong coupling, large time lag and large inertia has always been tricky in papermaking. In this paper, a framework which applies the reinforcement learning methods is introduced to solve. Inspired by the previous work, we improve the Markov Decision Process (MDP) in our model, moreover we propose a new form of eligibility traces which can realize data multiplexing to speed up learning, aiming at remitting the costly expense of industrial process online training, where we use it in asynchronous advantage actor-critic (A3C) for illustration. The effectiveness of accelerated A3C in aforementioned system and the superiority of fast learning have been proved in the simulation.
引用
收藏
页码:8637 / 8642
页数:6
相关论文
共 50 条
  • [1] Asynchronous Advantage Actor-Critic with Double Attention Mechanisms
    Ling, Xing-Hong
    Li, Jie
    Zhu, Fei
    Liu, Quan
    Fu, Yu-Chen
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (01): : 93 - 106
  • [2] Optimization of Robot Environment Interaction Based on Asynchronous Advantage Actor-Critic Algorithm
    Xu, Jitang
    Chen, Qiang
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 1350 - 1359
  • [3] Accelerated DRL Agent for Autonomous Voltage Control Using Asynchronous Advantage Actor-critic
    Xu, Zhengyuan
    Zan, Yan
    Xu, Chunlei
    Li, Jin
    Shi, Di
    Wang, Zhiwei
    Zhang, Bei
    Duan, Jiajun
    [J]. 2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [4] Locating algorithm of steel stock area with asynchronous advantage actor-critic reinforcement learning
    Cho, Young-in
    Kim, Byeongseop
    Yoon, Hee-Chang
    Woo, Jong Hun
    [J]. JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2024, 11 (01) : 230 - 246
  • [5] Workflow scheduling based on asynchronous advantage actor-critic algorithm in multi-cloud environment
    Tang, Xuhao
    Liu, Fagui
    Wang, Bin
    Xu, Dishi
    Jiang, Jun
    Wu, Qingbo
    Chen, C. L. Philip
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [6] Resource allocation Algorithm of Service Function Chain Based on Asynchronous Advantage Actor-Critic Learning
    Tang Lun
    He Xiaoyu
    Wang Xiao
    Tan Qi
    Hu Yanjuan
    Chen Qianbin
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1733 - 1741
  • [7] Towards Understanding Asynchronous Advantage Actor-Critic: Convergence and Linear Speedup
    Shen, Han
    Zhang, Kaiqing
    Hong, Mingyi
    Chen, Tianyi
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 2579 - 2594
  • [8] Adversarial retraining attack of asynchronous advantage actor-critic based pathfinding
    Chen Tong
    Liu Jiqiang
    Xiang Yingxiao
    Niu Wenjia
    Tong Endong
    Wang Shuoru
    Li He
    Chang Liang
    Li Gang
    Alfred, Chen Qi
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (05) : 2323 - 2346
  • [9] A new noise network and gradient parallelisation-based asynchronous advantage actor-critic algorithm
    Fei, Zhengshun
    Wang, Yanping
    Wang, Jinglong
    Liu, Kangling
    Huang, Bingqiang
    Tan, Ping
    [J]. IET CYBER-SYSTEMS AND ROBOTICS, 2022, 4 (03) : 175 - 188
  • [10] Asynchronous Advantage Actor-Critic Algorithm Based Cooperative Caching Strategy for Fog Radio Access Networks
    Jiang, Fan
    Han, Shaojiang
    Sun, Changyin
    [J]. 2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,