An accelerated asynchronous advantage actor-critic algorithm applied in papermaking

被引：0

作者：

Wang, Xuechun ^{[1
]}

Zhuang, Zhiwei ^{[1
]}

Zou, Luobao ^{[1
]}

Zhang, Weidong ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China

来源：

PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC) | 2019年

关键词：

Basis weight and moisture control; Reinforcement Learning; Asynchronous advantage actor-critic; Eligibility trace;

D O I：

10.23919/chicc.2019.8866243

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The basis weight and moisture system with multivariate, strong coupling, large time lag and large inertia has always been tricky in papermaking. In this paper, a framework which applies the reinforcement learning methods is introduced to solve. Inspired by the previous work, we improve the Markov Decision Process (MDP) in our model, moreover we propose a new form of eligibility traces which can realize data multiplexing to speed up learning, aiming at remitting the costly expense of industrial process online training, where we use it in asynchronous advantage actor-critic (A3C) for illustration. The effectiveness of accelerated A3C in aforementioned system and the superiority of fast learning have been proved in the simulation.

引用

页码：8637 / 8642

页数：6

共 50 条

[1] Asynchronous Advantage Actor-Critic with Double Attention Mechanisms
Ling, Xing-Hong
Li, Jie
Zhu, Fei
Liu, Quan
Fu, Yu-Chen
[J]. Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (01): : 93 - 106
[2] Optimization of Robot Environment Interaction Based on Asynchronous Advantage Actor-Critic Algorithm
Xu, Jitang
Chen, Qiang
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 1350 - 1359
[3] Accelerated DRL Agent for Autonomous Voltage Control Using Asynchronous Advantage Actor-critic
Xu, Zhengyuan
Zan, Yan
Xu, Chunlei
Li, Jin
Shi, Di
Wang, Zhiwei
Zhang, Bei
Duan, Jiajun
[J]. 2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
[4] Locating algorithm of steel stock area with asynchronous advantage actor-critic reinforcement learning
Cho, Young-in
Kim, Byeongseop
Yoon, Hee-Chang
Woo, Jong Hun
[J]. JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2024, 11 (01) : 230 - 246
[5] Workflow scheduling based on asynchronous advantage actor-critic algorithm in multi-cloud environment
Tang, Xuhao
Liu, Fagui
Wang, Bin
Xu, Dishi
Jiang, Jun
Wu, Qingbo
Chen, C. L. Philip
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[6] Resource allocation Algorithm of Service Function Chain Based on Asynchronous Advantage Actor-Critic Learning
Tang Lun
He Xiaoyu
Wang Xiao
Tan Qi
Hu Yanjuan
Chen Qianbin
[J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1733 - 1741
[7] Towards Understanding Asynchronous Advantage Actor-Critic: Convergence and Linear Speedup
Shen, Han
Zhang, Kaiqing
Hong, Mingyi
Chen, Tianyi
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 2579 - 2594
[8] Adversarial retraining attack of asynchronous advantage actor-critic based pathfinding
Chen Tong
Liu Jiqiang
Xiang Yingxiao
Niu Wenjia
Tong Endong
Wang Shuoru
Li He
Chang Liang
Li Gang
Alfred, Chen Qi
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (05) : 2323 - 2346
[9] A new noise network and gradient parallelisation-based asynchronous advantage actor-critic algorithm
Fei, Zhengshun
Wang, Yanping
Wang, Jinglong
Liu, Kangling
Huang, Bingqiang
Tan, Ping
[J]. IET CYBER-SYSTEMS AND ROBOTICS, 2022, 4 (03) : 175 - 188
[10] Asynchronous Advantage Actor-Critic Algorithm Based Cooperative Caching Strategy for Fog Radio Access Networks
Jiang, Fan
Han, Shaojiang
Sun, Changyin
[J]. 2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,

← 1 2 3 4 5 →