A Novel Integral Reinforcement Learning-Based Control Method Assisted by Twin Delayed Deep Deterministic Policy Gradient for Solid Oxide Fuel Cell in DC Microgrid

被引:5
|
作者
Liu, Yulin [1 ]
Qie, Tianhao [1 ]
Yu, Yang [4 ]
Wang, Yuxuan [1 ]
Chau, Tat Kei [1 ]
Zhang, Xinan [1 ]
Manandhar, Ujjal [2 ]
Li, Sinan [3 ]
Iu, Herbert H. C. [1 ]
Fernando, Tyrone [1 ]
机构
[1] Univ Western Australia, Sch Engn, Crawley, WA 6009, Australia
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney 2006, Australia
[4] Halliburton Ltd, Ctr Excellence Adv Control, Singapore 639940, Singapore
关键词
Solid oxide fuel cell; DC microgrid; integral reinforcement learning; hardware-in-the-loop; twin delayed deep deterministic policy gradient; POWER-PLANT; H-INFINITY; MODEL;
D O I
10.1109/TSTE.2022.3224179
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper proposes a new online integral reinforcement learning (IRL)-based control algorithm for the solid oxide fuel cell (SOFC) to overcome the long-lasting problems of model dependency and sensitivity to offline training dataset in the existing SOFC control approaches. The proposed method automatically updates the optimal control gains through the online neural network training. Unlike the other online learning-based control methods that rely on the assumption of initial stabilizing control or trial-and-error based initial control policy search, the proposed method employs the offline twin delayed deep deterministic policy gradient (TD3) algorithm to systematically determine the initial stabilizing control policy. Compared to the conventional IRL-based control, the proposed method contributes to greatly reduce the computational burden without compromising the control performance. The excellent performance of the proposed method is verified by hardware-in-the-loop experiments.
引用
收藏
页码:688 / 703
页数:16
相关论文
共 50 条
  • [11] Twin actor twin delayed deep deterministic policy gradient (TATD3) learning for batch process control
    Joshi, Tanuja
    Makker, Shikhar
    Kodamana, Hariprasad
    Kandath, Harikumar
    COMPUTERS & CHEMICAL ENGINEERING, 2021, 155
  • [12] Distributed deep reinforcement learning-based gas supply system coordination management method for solid oxide fuel cell
    Li, Jiawen
    Cui, Haoyang
    Jiang, Wei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [13] Deep Deterministic Gradient Policy (DDGP) Reinforcement Learning Assisted Degradation-Aware Control of Solid-State Transformer
    Haque, Moinul Shahidul
    Choi, Seungdeog
    2021 THIRTY-SIXTH ANNUAL IEEE APPLIED POWER ELECTRONICS CONFERENCE AND EXPOSITION (APEC 2021), 2021, : 2090 - 2096
  • [14] Decentralized multi-agent control of a three-tank hybrid system based on twin delayed deep deterministic policy gradient reinforcement learning algorithm
    N. Rajasekhar
    T. K. Radhakrishnan
    N. Samsudeen
    International Journal of Dynamics and Control, 2024, 12 : 1098 - 1115
  • [15] Decentralized multi-agent control of a three-tank hybrid system based on twin delayed deep deterministic policy gradient reinforcement learning algorithm
    Rajasekhar, N.
    Radhakrishnan, T. K.
    Samsudeen, N.
    INTERNATIONAL JOURNAL OF DYNAMICS AND CONTROL, 2023, 12 (4) : 1098 - 1115
  • [16] Path Planning Method for Manipulators Based on Improved Twin Delayed Deep Deterministic Policy Gradient and RRT*
    Cai, Ronggui
    Li, Xiao
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [17] A Novel Integral Reinforcement Learning-Based H8 Control Strategy for Proton Exchange Membrane Fuel Cell in DC Microgrids
    Liu, Yulin
    Qie, Tianhao
    Yu, Yang
    Wang, Yuxuan
    Chau, Tat Kei
    Zhang, Xinan
    Manandhar, Ujjal
    Iu, Herbert H. C.
    Fernando, Tyrone
    IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (03) : 1668 - 1681
  • [18] The Control Method of Twin Delayed Deep Deterministic Policy Gradient with Rebirth Mechanism to Multi-DOF Manipulator
    Hou, Yangyang
    Hong, Huajie
    Sun, Zhaomei
    Xu, Dasheng
    Zeng, Zhe
    ELECTRONICS, 2021, 10 (07)
  • [19] A theoretical demonstration for reinforcement learning of PI control dynamics for optimal speed control of DC motors by using Twin Delay Deep Deterministic Policy Gradient Algorithm
    Tufenkci, Sevilay
    Alagoz, Baris Baykant
    Kavuran, Gurkan
    Yeroglu, Celaleddin
    Herencsar, Norbert
    Mahata, Shibendu
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [20] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    Structural Control and Health Monitoring, 2022, 29 (10)