Towards Pareto-optimal energy management in integrated energy systems: A multi-agent and multi-objective deep reinforcement learning approach

被引:4
|
作者
Dou, Jiaming [1 ]
Wang, Xiaojun [1 ]
Liu, Zhao [1 ]
Sun, Qingkai [2 ]
Wang, Xihao [1 ]
He, Jinghan [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect Engn, Beijing 100044, Peoples R China
[2] State Grid Energy Res Inst Co Ltd, Beijing 102209, Peoples R China
基金
中国国家自然科学基金;
关键词
Integrated energy systems; Deep reinforcement learning; Multi -agent reinforcement learning; Multi -objective reinforcement learning; Energy management; UNIT COMMITMENT; ELECTRICITY;
D O I
10.1016/j.ijepes.2024.110022
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep Reinforcement Learning (DRL) is effective in solving complex, non-linear optimization problems, which is particularly relevant in energy management within Integrated Energy Systems (IESs). However, DRL approaches conventionally focus on single -objective policy learning, which is inadequate for the multi -objective optimization tasks commonly encountered in IESs energy management. To improve this, these approaches typically combine multi -objectives, such as operating cost objective and safety objective into a single reward function using scalarization techniques. This reduces the fidelity and interpretability of the objective space and limits its applicability to a wide range of IESs energy management. To address these challenges, this paper presents a novel framework called Multi -Agent and Multi -Objective DRL (MAMODRL). This framework combines value function decomposition and policy gradient methods to achieve a Pareto-optimal solution. The IESs energy management is initially formulated as a multi -objective Markov decision process. Then, an advanced MAMODRL architecture is developed, which includes objective value function networks to facilitate policy optimization. Finally, based on the definition of dominance, Pareto frontier is approximated of IESs energy management. A case study suggests that the proposed approach is effective in solving the Pareto frontier for IESs energy management. To ensure the safe operation of the system, safety threshold is set at the Pareto frontier forming a Pareto optimization with safety conditions. Compared to traditional DRL approaches, the proposed approach is more flexible, interpretable, and capable of making multi -dimensional decisions.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Pareto-optimal solutions in fuzzy multi-objective linear programming
    Jimenez, Mariano
    Bilbao, Amelia
    FUZZY SETS AND SYSTEMS, 2009, 160 (18) : 2714 - 2721
  • [42] Deep clustering of cooperative multi-agent reinforcement learning to optimize multi chiller HVAC systems for smart buildings energy management
    Homod, Raad Z.
    Yaseen, Zaher Mundher
    Hussein, Ahmed Kadhim
    Almusaed, Amjad
    Alawi, Omer A.
    Falah, Mayadah W.
    Abdelrazek, Ali H.
    Ahmed, Waqar
    Eltaweel, Mahmoud
    JOURNAL OF BUILDING ENGINEERING, 2023, 65
  • [43] Multi-energy Management of Interconnected Multi-microgrid System Using Multi-agent Deep Reinforcement Learning
    Sichen Li
    Di Cao
    Weihao Hu
    Qi Huang
    Zhe Chen
    Frede Blaabjerg
    JournalofModernPowerSystemsandCleanEnergy, 2023, 11 (05) : 1606 - 1617
  • [44] Multi-energy Management of Interconnected Multi-microgrid System Using Multi-agent Deep Reinforcement Learning
    Li, Sichen
    Cao, Di
    Hu, Weihao
    Huang, Qi
    Chen, Zhe
    Blaabjerg, Frede
    JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2023, 11 (05) : 1606 - 1617
  • [45] Energy Management Simulation with Multi-Agent Reinforcement Learning: An Approach to Achieve Reliability and Resilience
    Deshpande, Kapil
    Moehl, Philipp
    Haemmerle, Alexander
    Weichhart, Georg
    Zoerrer, Helmut
    Pichler, Andreas
    ENERGIES, 2022, 15 (19)
  • [46] Pareto-optimal solutions for multi-objective production scheduling problems
    Bagchi, TP
    EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, PROCEEDINGS, 2001, 1993 : 458 - 471
  • [47] Pareto-optimal solutions for multi-objective flexible linear programming
    Dubey, Dipti
    Mehra, Aparna
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 30 (01) : 535 - 546
  • [48] A Multi-Objective Pareto-Optimal Genetic Algorithm for QoS Multicasting
    Rai, S. C.
    Misra, B. B.
    Nayak, A. K.
    Mall, R.
    Pradhan, S.
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1303 - +
  • [49] Multi-objective Energy Management for We-Energy in Energy Internet using Reinforcement Learning
    Sun, Qiuye
    Wang, Danlu
    Ma, Dazhong
    Huang, Bonan
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1630 - 1635
  • [50] Multi-Agent Deep Reinforcement Learning based Multi-Objective Resource Optimization in a Distributed Manufacturing System
    Shen, Xinchang
    Tham, Chen-Khong
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,