Towards Pareto-optimal energy management in integrated energy systems: A multi-agent and multi-objective deep reinforcement learning approach

被引：4

作者：

Dou, Jiaming ^{[1
]}

Wang, Xiaojun ^{[1
]}

Liu, Zhao ^{[1
]}

Sun, Qingkai ^{[2
]}

Wang, Xihao ^{[1
]}

He, Jinghan ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Sch Elect Engn, Beijing 100044, Peoples R China

[2] State Grid Energy Res Inst Co Ltd, Beijing 102209, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS | 2024年 / 159卷

基金：

中国国家自然科学基金;

关键词：

Integrated energy systems; Deep reinforcement learning; Multi -agent reinforcement learning; Multi -objective reinforcement learning; Energy management; UNIT COMMITMENT; ELECTRICITY;

D O I：

10.1016/j.ijepes.2024.110022

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep Reinforcement Learning (DRL) is effective in solving complex, non-linear optimization problems, which is particularly relevant in energy management within Integrated Energy Systems (IESs). However, DRL approaches conventionally focus on single -objective policy learning, which is inadequate for the multi -objective optimization tasks commonly encountered in IESs energy management. To improve this, these approaches typically combine multi -objectives, such as operating cost objective and safety objective into a single reward function using scalarization techniques. This reduces the fidelity and interpretability of the objective space and limits its applicability to a wide range of IESs energy management. To address these challenges, this paper presents a novel framework called Multi -Agent and Multi -Objective DRL (MAMODRL). This framework combines value function decomposition and policy gradient methods to achieve a Pareto-optimal solution. The IESs energy management is initially formulated as a multi -objective Markov decision process. Then, an advanced MAMODRL architecture is developed, which includes objective value function networks to facilitate policy optimization. Finally, based on the definition of dominance, Pareto frontier is approximated of IESs energy management. A case study suggests that the proposed approach is effective in solving the Pareto frontier for IESs energy management. To ensure the safe operation of the system, safety threshold is set at the Pareto frontier forming a Pareto optimization with safety conditions. Compared to traditional DRL approaches, the proposed approach is more flexible, interpretable, and capable of making multi -dimensional decisions.

引用

页数：17

共 50 条

[1] Distributional Pareto-Optimal Multi-Objective Reinforcement Learning
Cai, Xin-Qiang
Zhang, Pushi
Zhao, Li
Bian, Jiang
Sugiyama, Masashi
Llorens, Ashley J.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[2] Multi-objective optimization of hybrid electric vehicles energy management using multi-agent deep reinforcement learning framework
Li, Xiaoyu
Zhou, Zaihang
Wei, Changyin
Gao, Xiao
Zhang, Yibo
ENERGY AND AI, 2025, 20
[3] A multi-objective multi-agent deep reinforcement learning approach to residential appliance scheduling
Lu, Junlin
Mannion, Patrick
Mason, Karl
IET SMART GRID, 2022, 5 (04) : 260 - 280
[4] Pareto-optimal synchronization control of nonlinear multi-agent systems via integral reinforcement learning
Guo, Yaning
Sun, Qi
Pan, Quan
Wang, Yintao
NONLINEAR DYNAMICS, 2025, 113 (06) : 5339 - 5357
[5] Distributed energy management of multi-area integrated energy system based on multi-agent deep reinforcement learning
Ding, Lifu
Cui, Youkai
Yan, Gangfeng
Huang, Yaojia
Fan, Zhen
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 157
[6] A Multi-Objective Approach for Optimal Energy Management in Smart Home Using the Reinforcement Learning
Diyan, Muhammad
Silva, Bhagya Nathali
Han, Kijun
SENSORS, 2020, 20 (12) : 1 - 20
[7] Emergence of communication in competitive multi-agent systems: A Pareto multi-objective approach
McPartland, Michelle
Nolfi, Stefano
Abbass, Hussein A.
GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 51 - 58
[8] Distributed multi-agent reinforcement learning for multi-objective optimal dispatch of microgrids
Wang, Xiaowen
Liu, Shuai
Xu, Qianwen
Shao, Xinquan
ISA TRANSACTIONS, 2025, 158 : 130 - 140
[9] Multi-Agent Deep Reinforcement Learning for Resource Allocation in the Multi-Objective HetNet
Nie, Hongrui
Li, Shaosheng
Liu, Yong
IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 116 - 121
[10] A multi-objective evolutionary approach to Pareto-optimal model trees
Czajkowski, Marcin
Kretowski, Marek
SOFT COMPUTING, 2019, 23 (05) : 1423 - 1437

← 1 2 3 4 5 →