Towards Pareto-optimal energy management in integrated energy systems: A multi-agent and multi-objective deep reinforcement learning approach

被引:4
|
作者
Dou, Jiaming [1 ]
Wang, Xiaojun [1 ]
Liu, Zhao [1 ]
Sun, Qingkai [2 ]
Wang, Xihao [1 ]
He, Jinghan [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect Engn, Beijing 100044, Peoples R China
[2] State Grid Energy Res Inst Co Ltd, Beijing 102209, Peoples R China
基金
中国国家自然科学基金;
关键词
Integrated energy systems; Deep reinforcement learning; Multi -agent reinforcement learning; Multi -objective reinforcement learning; Energy management; UNIT COMMITMENT; ELECTRICITY;
D O I
10.1016/j.ijepes.2024.110022
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep Reinforcement Learning (DRL) is effective in solving complex, non-linear optimization problems, which is particularly relevant in energy management within Integrated Energy Systems (IESs). However, DRL approaches conventionally focus on single -objective policy learning, which is inadequate for the multi -objective optimization tasks commonly encountered in IESs energy management. To improve this, these approaches typically combine multi -objectives, such as operating cost objective and safety objective into a single reward function using scalarization techniques. This reduces the fidelity and interpretability of the objective space and limits its applicability to a wide range of IESs energy management. To address these challenges, this paper presents a novel framework called Multi -Agent and Multi -Objective DRL (MAMODRL). This framework combines value function decomposition and policy gradient methods to achieve a Pareto-optimal solution. The IESs energy management is initially formulated as a multi -objective Markov decision process. Then, an advanced MAMODRL architecture is developed, which includes objective value function networks to facilitate policy optimization. Finally, based on the definition of dominance, Pareto frontier is approximated of IESs energy management. A case study suggests that the proposed approach is effective in solving the Pareto frontier for IESs energy management. To ensure the safe operation of the system, safety threshold is set at the Pareto frontier forming a Pareto optimization with safety conditions. Compared to traditional DRL approaches, the proposed approach is more flexible, interpretable, and capable of making multi -dimensional decisions.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Distributional Pareto-Optimal Multi-Objective Reinforcement Learning
    Cai, Xin-Qiang
    Zhang, Pushi
    Zhao, Li
    Bian, Jiang
    Sugiyama, Masashi
    Llorens, Ashley J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Multi-objective optimization of hybrid electric vehicles energy management using multi-agent deep reinforcement learning framework
    Li, Xiaoyu
    Zhou, Zaihang
    Wei, Changyin
    Gao, Xiao
    Zhang, Yibo
    ENERGY AND AI, 2025, 20
  • [3] A multi-objective multi-agent deep reinforcement learning approach to residential appliance scheduling
    Lu, Junlin
    Mannion, Patrick
    Mason, Karl
    IET SMART GRID, 2022, 5 (04) : 260 - 280
  • [4] Pareto-optimal synchronization control of nonlinear multi-agent systems via integral reinforcement learning
    Guo, Yaning
    Sun, Qi
    Pan, Quan
    Wang, Yintao
    NONLINEAR DYNAMICS, 2025, 113 (06) : 5339 - 5357
  • [5] Distributed energy management of multi-area integrated energy system based on multi-agent deep reinforcement learning
    Ding, Lifu
    Cui, Youkai
    Yan, Gangfeng
    Huang, Yaojia
    Fan, Zhen
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 157
  • [6] A Multi-Objective Approach for Optimal Energy Management in Smart Home Using the Reinforcement Learning
    Diyan, Muhammad
    Silva, Bhagya Nathali
    Han, Kijun
    SENSORS, 2020, 20 (12) : 1 - 20
  • [7] Emergence of communication in competitive multi-agent systems: A Pareto multi-objective approach
    McPartland, Michelle
    Nolfi, Stefano
    Abbass, Hussein A.
    GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 51 - 58
  • [8] Distributed multi-agent reinforcement learning for multi-objective optimal dispatch of microgrids
    Wang, Xiaowen
    Liu, Shuai
    Xu, Qianwen
    Shao, Xinquan
    ISA TRANSACTIONS, 2025, 158 : 130 - 140
  • [9] Multi-Agent Deep Reinforcement Learning for Resource Allocation in the Multi-Objective HetNet
    Nie, Hongrui
    Li, Shaosheng
    Liu, Yong
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 116 - 121
  • [10] A multi-objective evolutionary approach to Pareto-optimal model trees
    Czajkowski, Marcin
    Kretowski, Marek
    SOFT COMPUTING, 2019, 23 (05) : 1423 - 1437