A survey and comparative evaluation of actor-critic methods in process control

被引:20
|
作者
Dutta, Debaprasad [1 ]
Upreti, Simant R. [1 ]
机构
[1] Toronto Metropolitan Univ, Dept Chem Engn, Toronto, ON, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
actor-critic methods; process control; reinforcement learning; MODEL-PREDICTIVE CONTROL; LEARNING CONTROL; BATCH PROCESSES; NEURO-CONTROL; REINFORCEMENT; SYSTEM; PERFORMANCE; FRAMEWORK;
D O I
10.1002/cjce.24508
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Actor-critic (AC) methods have emerged as an important class of reinforcement learning (RL) paradigm that enables model-free control by acting on a process and learning from the consequence. To that end, these methods utilize artificial neural networks, which are synergized for action evaluation and optimal action prediction. This feature is highly desirable for process control, especially when the knowledge about a process is limited or when it is susceptible to uncertainties. In this work, we summarize important concepts of AC methods and survey their process control applications. This treatment is followed by a comparative evaluation of the set-point tracking and robustness of controllers based on five prominent AC methods, namely, DDPG, TD3, SAC, PPO, and TRPO, in five case studies of varying process nonlinearity. The training demands and control performances indicate the superiority of DDPG and TD3 methods, which rely on off-policy, deterministic search for optimal action policies. Overall, the knowledge base and results of this work are expected to serve practitioners in their efforts toward further development of autonomous process control strategies.
引用
收藏
页码:2028 / 2056
页数:29
相关论文
共 50 条
  • [41] Actor-critic reinforcement learning for the feedback control of a swinging chain
    Dengler, C.
    Lohmann, B.
    IFAC PAPERSONLINE, 2018, 51 (13): : 378 - 383
  • [42] Adaptive actor-critic control of robots with integral invariant manifold
    Pantoja-Garcia, Luis
    Garcia-Rodriguez, Rodolfo
    Parra-Vega, Vicente
    2021 IEEE CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (IEEE CHILECON 2021), 2021, : 782 - 787
  • [43] Actor-critic learning based PID control for robotic manipulators
    Nohooji, Hamed Rahimi
    Zaraki, Abolfazl
    Voos, Holger
    APPLIED SOFT COMPUTING, 2024, 151
  • [44] Better Exploration with Optimistic Actor-Critic
    Ciosek, Kamil
    Quan Vuong
    Loftin, Robert
    Hofmann, Katja
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [45] Twin Delayed Hierarchical Actor-Critic
    Anca, Mihai
    Studley, Matthew
    2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 221 - 225
  • [46] Generative Adversarial Soft Actor-Critic
    Hwang, Hyo-Seok
    Kim, Yoojoong
    Seok, Junhee
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [47] Robust Actor-Critic With Relative Entropy Regulating Actor
    Cheng, Yuhu
    Huang, Longyang
    Chen, C. L. Philip
    Wang, Xuesong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9054 - 9063
  • [48] An Actor-Critic Algorithm for SVM Hyperparameters
    Kim, Chayoung
    Park, Jung-min
    Kim, Hye-young
    INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 653 - 661
  • [49] Offline-Online Actor-Critic
    Wang X.
    Hou D.
    Huang L.
    Cheng Y.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (01): : 61 - 69
  • [50] Actor-Critic Algorithms for Variance Minimization
    Awate, Yogesh P.
    TECHNOLOGICAL DEVELOPMENTS IN EDUCATION AND AUTOMATION, 2010, : 455 - 460