An Assessment of Multistage Reward Function Design for Deep Reinforcement Learning-Based Microgrid Energy Management

被引:23
|
作者
Goh, Hui Hwang [1 ]
Huang, Yifeng [1 ]
Lim, Chee Shen [2 ]
Zhang, Dongdong [1 ]
Liu, Hui [1 ]
Dai, Wei [1 ]
Kurniawan, Tonni Agustiono [3 ]
Rahman, Saifur [4 ]
机构
[1] Guangxi Univ, Sch Elect Engn, Nanning 530004, Peoples R China
[2] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215131, Peoples R China
[3] Xiamen Univ, Coll Environm & Ecol, Key Lab Coastal & Wetland Ecosyst, Minist Educ, Xiamen 361102, Fujian, Peoples R China
[4] Virginia Polytech Inst & State Univ, Dept Adv Res Inst, Arlington, VA 22203 USA
关键词
Microgrids; Energy management; Real-time systems; Costs; Prediction algorithms; Training; Convergence; Microgrid energy management; deep reinforcement learning; reward function; optimal scheduling; OPTIMIZATION; SYSTEM; OPERATION;
D O I
10.1109/TSG.2022.3179567
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Reinforcement learning based energy management strategy has been an active research subject in the past few years. Different from the baseline reward function (BRF), the work proposes and investigates a multi-stage reward mechanism (MSRM) that scores the agent's step and final performance during training and returns it to the agent in real time as a reward. MSRM will also improve the agent's training through expert intervention which aims to prevent the agent from being trapped in sub-optimal strategies. The energy management performance considered by MSRM-based algorithm includes the energy balance, economic cost, and reliability. The reward function is assessed in conjunction with two deep reinforcement learning algorithms: double deep Q-learning network (DDQN) and policy gradient (PG). Upon benchmarking with BRF, the numerical simulation shows that MSRM tends to improve the convergence characteristic, reduce the explained variance, and reduce the tendency of the agent being trapped in suboptimal strategies. In addition, the methods have been assessed with MPC-based energy management strategies in terms of relative cost, self-balancing rate, and computational time. The assessment concludes that, in the given context, PG-MSRM has the best overall performance.
引用
收藏
页码:4300 / 4311
页数:12
相关论文
共 50 条
  • [1] Reward Mechanism Design for Deep Reinforcement Learning-Based Microgrid Energy Management
    Hu, Mingjie
    Han, Baohui
    Lv, Shilin
    Bao, Zhejing
    Lu, Lingxia
    Yu, Miao
    [J]. 2023 6TH INTERNATIONAL CONFERENCE ON RENEWABLE ENERGY AND POWER ENGINEERING, REPE 2023, 2023, : 201 - 205
  • [2] Effect of immediate reward function on the performance of reinforcement learning-based energy management system
    Biswas, Atriya
    Wang, Yue
    Emadi, Ali
    [J]. 2022 IEEE/AIAA TRANSPORTATION ELECTRIFICATION CONFERENCE AND ELECTRIC AIRCRAFT TECHNOLOGIES SYMPOSIUM (ITEC+EATS 2022), 2022, : 1021 - 1026
  • [3] Online Microgrid Energy Management Based on Safe Deep Reinforcement Learning
    Li, Hepeng
    Wang, Zhenhua
    Li, Lusi
    He, Haibo
    [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [4] A unified benchmark for deep reinforcement learning-based energy management: Novel training ideas with the unweighted reward
    Chen, Jiaxin
    Tang, Xiaolin
    Yang, Kai
    [J]. ENERGY, 2024, 307
  • [5] Novel Architecture of Energy Management Systems Based on Deep Reinforcement Learning in Microgrid
    Lee, Seongwoo
    Seon, Joonho
    Sun, Young Ghyu
    Kim, Soo Hyun
    Kyeong, Chanuk
    Kim, Dong In
    Kim, Jin Young
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1646 - 1658
  • [6] Deep Reinforcement Learning-based Image Captioning with Embedding Reward
    Ren, Zhou
    Wang, Xiaoyu
    Zhang, Ning
    Lv, Xutao
    Li, Li-Jia
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1151 - 1159
  • [7] Deep reinforcement learning for energy management in a microgrid with flexible demand
    Nakabi, Taha Abdelhalim
    Toivanen, Pekka
    [J]. SUSTAINABLE ENERGY GRIDS & NETWORKS, 2021, 25
  • [8] A hybrid deep learning-based online energy management scheme for industrial microgrid
    Lu, Renzhi
    Bai, Ruichang
    Ding, Yuemin
    Wei, Min
    Jiang, Junhui
    Sun, Mingyang
    Xiao, Feng
    Zhang, Hai-Tao
    [J]. APPLIED ENERGY, 2021, 304
  • [9] Reinforcement learning-based scheduling strategy for energy storage in microgrid
    Zhou, Kunshu
    Zhou, Kaile
    Yang, Shanlin
    [J]. JOURNAL OF ENERGY STORAGE, 2022, 51
  • [10] Energy management in solar microgrid via reinforcement learning using fuzzy reward
    Kofinas, Panagiotis
    Vouros, George
    Dounis, Anastasios I.
    [J]. ADVANCES IN BUILDING ENERGY RESEARCH, 2018, 12 (01) : 97 - 115