A survey and comparative evaluation of actor-critic methods in process control

被引：20

作者：

Dutta, Debaprasad ^{[1
]}

Upreti, Simant R. ^{[1
]}

机构：

[1] Toronto Metropolitan Univ, Dept Chem Engn, Toronto, ON, Canada

来源：

CANADIAN JOURNAL OF CHEMICAL ENGINEERING | 2022年 / 100卷 / 09期

基金：

加拿大自然科学与工程研究理事会;

关键词：

actor-critic methods; process control; reinforcement learning; MODEL-PREDICTIVE CONTROL; LEARNING CONTROL; BATCH PROCESSES; NEURO-CONTROL; REINFORCEMENT; SYSTEM; PERFORMANCE; FRAMEWORK;

D O I：

10.1002/cjce.24508

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

Actor-critic (AC) methods have emerged as an important class of reinforcement learning (RL) paradigm that enables model-free control by acting on a process and learning from the consequence. To that end, these methods utilize artificial neural networks, which are synergized for action evaluation and optimal action prediction. This feature is highly desirable for process control, especially when the knowledge about a process is limited or when it is susceptible to uncertainties. In this work, we summarize important concepts of AC methods and survey their process control applications. This treatment is followed by a comparative evaluation of the set-point tracking and robustness of controllers based on five prominent AC methods, namely, DDPG, TD3, SAC, PPO, and TRPO, in five case studies of varying process nonlinearity. The training demands and control performances indicate the superiority of DDPG and TD3 methods, which rely on off-policy, deterministic search for optimal action policies. Overall, the knowledge base and results of this work are expected to serve practitioners in their efforts toward further development of autonomous process control strategies.

引用

页码：2028 / 2056

页数：29

共 50 条

[41] Actor-critic reinforcement learning for the feedback control of a swinging chain
Dengler, C.
Lohmann, B.
IFAC PAPERSONLINE, 2018, 51 (13): : 378 - 383
[42] Adaptive actor-critic control of robots with integral invariant manifold
Pantoja-Garcia, Luis
Garcia-Rodriguez, Rodolfo
Parra-Vega, Vicente
2021 IEEE CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (IEEE CHILECON 2021), 2021, : 782 - 787
[43] Actor-critic learning based PID control for robotic manipulators
Nohooji, Hamed Rahimi
Zaraki, Abolfazl
Voos, Holger
APPLIED SOFT COMPUTING, 2024, 151
[44] Better Exploration with Optimistic Actor-Critic
Ciosek, Kamil
Quan Vuong
Loftin, Robert
Hofmann, Katja
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[45] Twin Delayed Hierarchical Actor-Critic
Anca, Mihai
Studley, Matthew
2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 221 - 225
[46] Generative Adversarial Soft Actor-Critic
Hwang, Hyo-Seok
Kim, Yoojoong
Seok, Junhee
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[47] Robust Actor-Critic With Relative Entropy Regulating Actor
Cheng, Yuhu
Huang, Longyang
Chen, C. L. Philip
Wang, Xuesong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9054 - 9063
[48] An Actor-Critic Algorithm for SVM Hyperparameters
Kim, Chayoung
Park, Jung-min
Kim, Hye-young
INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 653 - 661
[49] Offline-Online Actor-Critic
Wang X.
Hou D.
Huang L.
Cheng Y.
IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 61 - 69
[50] Actor-Critic Algorithms for Variance Minimization
Awate, Yogesh P.
TECHNOLOGICAL DEVELOPMENTS IN EDUCATION AND AUTOMATION, 2010, : 455 - 460

← 1 2 3 4 5 →