A Novel Integral Reinforcement Learning-Based Control Method Assisted by Twin Delayed Deep Deterministic Policy Gradient for Solid Oxide Fuel Cell in DC Microgrid

被引:5
|
作者
Liu, Yulin [1 ]
Qie, Tianhao [1 ]
Yu, Yang [4 ]
Wang, Yuxuan [1 ]
Chau, Tat Kei [1 ]
Zhang, Xinan [1 ]
Manandhar, Ujjal [2 ]
Li, Sinan [3 ]
Iu, Herbert H. C. [1 ]
Fernando, Tyrone [1 ]
机构
[1] Univ Western Australia, Sch Engn, Crawley, WA 6009, Australia
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney 2006, Australia
[4] Halliburton Ltd, Ctr Excellence Adv Control, Singapore 639940, Singapore
关键词
Solid oxide fuel cell; DC microgrid; integral reinforcement learning; hardware-in-the-loop; twin delayed deep deterministic policy gradient; POWER-PLANT; H-INFINITY; MODEL;
D O I
10.1109/TSTE.2022.3224179
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper proposes a new online integral reinforcement learning (IRL)-based control algorithm for the solid oxide fuel cell (SOFC) to overcome the long-lasting problems of model dependency and sensitivity to offline training dataset in the existing SOFC control approaches. The proposed method automatically updates the optimal control gains through the online neural network training. Unlike the other online learning-based control methods that rely on the assumption of initial stabilizing control or trial-and-error based initial control policy search, the proposed method employs the offline twin delayed deep deterministic policy gradient (TD3) algorithm to systematically determine the initial stabilizing control policy. Compared to the conventional IRL-based control, the proposed method contributes to greatly reduce the computational burden without compromising the control performance. The excellent performance of the proposed method is verified by hardware-in-the-loop experiments.
引用
收藏
页码:688 / 703
页数:16
相关论文
共 50 条
  • [21] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10):
  • [22] A novel data-driven energy management strategy for fuel cell hybrid electric bus based on improved twin delayed deep deterministic policy gradient algorithm
    Huang, Ruchen
    He, Hongwen
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 52 : 782 - 798
  • [23] A Deep Reinforcement Learning Method based on Deterministic Policy Gradient for Multi-Agent Cooperative Competition
    Zuo, Xuan
    Xue, Hui-Feng
    Wang, Xiao-Yin
    Du, Wan-Ru
    Tian, Tao
    Gao, Shan
    Zhang, Pu
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2021, 23 (03): : 88 - 98
  • [24] Active Power Correction Control of Power Grid Based on Improved Twin Delayed Deep Deterministic Policy Gradient Algorithm
    Gu X.
    Liu T.
    Li S.
    Wang T.
    Yang X.
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2023, 38 (08): : 2162 - 2177
  • [25] Machine Learning Control of an Aerial Robot Based on a Tuned Deep Deterministic Policy Gradient Method
    Esfandiari, Mohamadamin
    Atashgah, M. A. Amiri
    2022 10TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2022, : 209 - 216
  • [26] Real-Time Autonomous Residential Demand Response Management Based on Twin Delayed Deep Deterministic Policy Gradient Learning
    Ye, Yujian
    Qiu, Dawei
    Wang, Huiyu
    Tang, Yi
    Strbac, Goran
    ENERGIES, 2021, 14 (03)
  • [27] Large-scale multi-agent reinforcement learning-based method for coordinated output voltage control of solid oxide fuel cell
    Li, Jiawen
    Li, Yaping
    Yu, Tao
    Yang, Bo
    CASE STUDIES IN THERMAL ENGINEERING, 2022, 30
  • [28] A Novel Deep Reinforcement Learning-Based Current Control Method for Direct Matrix Converters
    Li, Yao
    Qiu, Lin
    Liu, Xing
    Ma, Jien
    Zhang, Jian
    Fang, Youtong
    ENERGIES, 2023, 16 (05)
  • [29] Study on indoor temperature optimal control of air-conditioning based on Twin Delayed Deep Deterministic policy gradient algorithm
    Li W.
    Wu H.
    Zhao Y.
    Jiang C.
    Zhang J.
    Energy and Buildings, 2024, 317
  • [30] Safe reinforcement learning-based control using deep deterministic policy gradient algorithm and slime mould algorithm with experimental tower crane system validation
    Zamfirache, Iuliu Alexandru
    Precup, Radu-Emil
    Petriu, Emil M.
    Information Sciences, 2025, 692