A Novel Integral Reinforcement Learning-Based Control Method Assisted by Twin Delayed Deep Deterministic Policy Gradient for Solid Oxide Fuel Cell in DC Microgrid

被引:5
|
作者
Liu, Yulin [1 ]
Qie, Tianhao [1 ]
Yu, Yang [4 ]
Wang, Yuxuan [1 ]
Chau, Tat Kei [1 ]
Zhang, Xinan [1 ]
Manandhar, Ujjal [2 ]
Li, Sinan [3 ]
Iu, Herbert H. C. [1 ]
Fernando, Tyrone [1 ]
机构
[1] Univ Western Australia, Sch Engn, Crawley, WA 6009, Australia
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney 2006, Australia
[4] Halliburton Ltd, Ctr Excellence Adv Control, Singapore 639940, Singapore
关键词
Solid oxide fuel cell; DC microgrid; integral reinforcement learning; hardware-in-the-loop; twin delayed deep deterministic policy gradient; POWER-PLANT; H-INFINITY; MODEL;
D O I
10.1109/TSTE.2022.3224179
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper proposes a new online integral reinforcement learning (IRL)-based control algorithm for the solid oxide fuel cell (SOFC) to overcome the long-lasting problems of model dependency and sensitivity to offline training dataset in the existing SOFC control approaches. The proposed method automatically updates the optimal control gains through the online neural network training. Unlike the other online learning-based control methods that rely on the assumption of initial stabilizing control or trial-and-error based initial control policy search, the proposed method employs the offline twin delayed deep deterministic policy gradient (TD3) algorithm to systematically determine the initial stabilizing control policy. Compared to the conventional IRL-based control, the proposed method contributes to greatly reduce the computational burden without compromising the control performance. The excellent performance of the proposed method is verified by hardware-in-the-loop experiments.
引用
收藏
页码:688 / 703
页数:16
相关论文
共 50 条
  • [1] REINFORCEMENT LEARNING-BASED PERFORMANCE AND CONTROL ENHANCEMENT OF PMSM THROUGH POLICY GRADIENT AGENT OF TWIN DELAYED DEEP DETERMINISTIC
    Shana, Lakshmi Prasad
    Gudipalli, Abhishek
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2023, 18 : 58 - 71
  • [2] Twin-Delayed Deep Deterministic Policy Gradient Algorithm to Control a Boost Converter in a DC Microgrid
    Muktiadji, Rifqi Firmansyah
    Ramli, Makbul A. M.
    Milyani, Ahmad H.
    ELECTRONICS, 2024, 13 (02)
  • [3] Twin delayed deep deterministic policy gradient-based deep reinforcement learning for energy management of fuel cell vehicle integrating durability information of powertrain
    Zhang, Yuanzhi
    Zhang, Caizhi
    Fan, Ruijia
    Huang, Shulong
    Yang, Yun
    Xu, Qianwen
    ENERGY CONVERSION AND MANAGEMENT, 2022, 274
  • [4] A Novel Deep Deterministic Policy Gradient Assisted Learning-Based Control Algorithm for Three-Phase DC/AC Inverter With an RL Load
    Xiang, Chaoqun
    Zhang, Xinan
    Qie, Tianhao
    Chau, Tat Kei
    Ye, Jian
    Yu, Yang
    Iu, Herbert Ho Ching
    Fernando, Tyrone
    IEEE JOURNAL OF EMERGING AND SELECTED TOPICS IN POWER ELECTRONICS, 2023, 11 (06) : 5529 - 5539
  • [5] Deep reinforcement learning for PMSG wind turbine control via twin delayed deep deterministic policy gradient (TD3)
    Zholtayev, Darkhan
    Rubagotti, Matteo
    Do, Ton Duc
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, 45 (04): : 1889 - 1906
  • [6] Virtual Synchronous Generator Control Using Twin Delayed Deep Deterministic Policy Gradient Method
    Oboreh-Snapps, Oroghene
    She, Buxin
    Fahad, Shah
    Chen, Haotian
    Kimball, Jonathan
    Li, Fangxing
    Cui, Hantao
    Bo, Rui
    IEEE TRANSACTIONS ON ENERGY CONVERSION, 2024, 39 (01) : 214 - 228
  • [7] Fractional-Order Control Method Based on Twin-Delayed Deep Deterministic Policy Gradient Algorithm
    Jiao, Guangxin
    An, Zhengcai
    Shao, Shuyi
    Sun, Dong
    FRACTAL AND FRACTIONAL, 2024, 8 (02)
  • [8] Twin-delayed-based Deep Deterministic Policy Gradient Method Integrating Gravitational Search
    Xu P.-A.
    Liu Q.
    Hao S.-P.
    Zhang L.-H.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (11): : 5192 - 5204
  • [9] Cooperative control of velocity and heading for unmanned surface vessel based on twin delayed deep deterministic policy gradient with an integral compensator
    Wang, Yibai
    Zhao, Shulong
    Wang, Qingling
    OCEAN ENGINEERING, 2023, 288
  • [10] Optimizing Control of Wastewater Treatment Plant With Reinforcement Learning: Technical Evaluation of Twin-Delayed Deep Deterministic Policy Gradient Agent
    Klawikowska, Zuzanna
    Grochowski, Michal
    IEEE ACCESS, 2024, 12 : 130318 - 130333