Towards Pareto-optimal energy management in integrated energy systems: A multi-agent and multi-objective deep reinforcement learning approach

被引：4

作者：

Dou, Jiaming ^{[1
]}

Wang, Xiaojun ^{[1
]}

Liu, Zhao ^{[1
]}

Sun, Qingkai ^{[2
]}

Wang, Xihao ^{[1
]}

He, Jinghan ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Sch Elect Engn, Beijing 100044, Peoples R China

[2] State Grid Energy Res Inst Co Ltd, Beijing 102209, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS | 2024年 / 159卷

基金：

中国国家自然科学基金;

关键词：

Integrated energy systems; Deep reinforcement learning; Multi -agent reinforcement learning; Multi -objective reinforcement learning; Energy management; UNIT COMMITMENT; ELECTRICITY;

D O I：

10.1016/j.ijepes.2024.110022

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep Reinforcement Learning (DRL) is effective in solving complex, non-linear optimization problems, which is particularly relevant in energy management within Integrated Energy Systems (IESs). However, DRL approaches conventionally focus on single -objective policy learning, which is inadequate for the multi -objective optimization tasks commonly encountered in IESs energy management. To improve this, these approaches typically combine multi -objectives, such as operating cost objective and safety objective into a single reward function using scalarization techniques. This reduces the fidelity and interpretability of the objective space and limits its applicability to a wide range of IESs energy management. To address these challenges, this paper presents a novel framework called Multi -Agent and Multi -Objective DRL (MAMODRL). This framework combines value function decomposition and policy gradient methods to achieve a Pareto-optimal solution. The IESs energy management is initially formulated as a multi -objective Markov decision process. Then, an advanced MAMODRL architecture is developed, which includes objective value function networks to facilitate policy optimization. Finally, based on the definition of dominance, Pareto frontier is approximated of IESs energy management. A case study suggests that the proposed approach is effective in solving the Pareto frontier for IESs energy management. To ensure the safe operation of the system, safety threshold is set at the Pareto frontier forming a Pareto optimization with safety conditions. Compared to traditional DRL approaches, the proposed approach is more flexible, interpretable, and capable of making multi -dimensional decisions.

引用

页数：17

共 50 条

[41] Pareto-optimal solutions in fuzzy multi-objective linear programming
Jimenez, Mariano
Bilbao, Amelia
FUZZY SETS AND SYSTEMS, 2009, 160 (18) : 2714 - 2721
[42] Deep clustering of cooperative multi-agent reinforcement learning to optimize multi chiller HVAC systems for smart buildings energy management
Homod, Raad Z.
Yaseen, Zaher Mundher
Hussein, Ahmed Kadhim
Almusaed, Amjad
Alawi, Omer A.
Falah, Mayadah W.
Abdelrazek, Ali H.
Ahmed, Waqar
Eltaweel, Mahmoud
JOURNAL OF BUILDING ENGINEERING, 2023, 65
[43] Multi-energy Management of Interconnected Multi-microgrid System Using Multi-agent Deep Reinforcement Learning
Sichen Li
Di Cao
Weihao Hu
Qi Huang
Zhe Chen
Frede Blaabjerg
JournalofModernPowerSystemsandCleanEnergy, 2023, 11 (05) : 1606 - 1617
[44] Multi-energy Management of Interconnected Multi-microgrid System Using Multi-agent Deep Reinforcement Learning
Li, Sichen
Cao, Di
Hu, Weihao
Huang, Qi
Chen, Zhe
Blaabjerg, Frede
JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2023, 11 (05) : 1606 - 1617
[45] Energy Management Simulation with Multi-Agent Reinforcement Learning: An Approach to Achieve Reliability and Resilience
Deshpande, Kapil
Moehl, Philipp
Haemmerle, Alexander
Weichhart, Georg
Zoerrer, Helmut
Pichler, Andreas
ENERGIES, 2022, 15 (19)
[46] Pareto-optimal solutions for multi-objective production scheduling problems
Bagchi, TP
EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, PROCEEDINGS, 2001, 1993 : 458 - 471
[47] Pareto-optimal solutions for multi-objective flexible linear programming
Dubey, Dipti
Mehra, Aparna
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 30 (01) : 535 - 546
[48] A Multi-Objective Pareto-Optimal Genetic Algorithm for QoS Multicasting
Rai, S. C.
Misra, B. B.
Nayak, A. K.
Mall, R.
Pradhan, S.
2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1303 - +
[49] Multi-objective Energy Management for We-Energy in Energy Internet using Reinforcement Learning
Sun, Qiuye
Wang, Danlu
Ma, Dazhong
Huang, Bonan
2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1630 - 1635
[50] Multi-Agent Deep Reinforcement Learning based Multi-Objective Resource Optimization in a Distributed Manufacturing System
Shen, Xinchang
Tham, Chen-Khong
2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,

← 1 2 3 4 5 →