Multi-agent deep reinforcement learning for efficient multi-timescale bidding of a hybrid power plant in day-ahead and real-time markets

被引：19

作者：

Ochoa, Tomas ^{[1
]}

Gil, Esteban ^{[1
]}

Angulo, Alejandro ^{[1
]}

Valle, Carlos ^{[2
]}

机构：

[1] Univ Tecn Federico Santa Maria, Departmento Ingn Elect, Valparaiso 2390123, Chile

[2] Univ Playa Ancha, Departmento Ciencia Datos Informat, Valparaiso 2360001, Chile

来源：

APPLIED ENERGY | 2022年 / 317卷

关键词：

Multi-view artificial neural networks; Multi-agent deep reinforcement learning; Energy management system; Solar generation; Energy storage; Electricity market bidding; Multi-timescale electricity markets; ENERGY-STORAGE; WIND; GENERATION; PARTICIPATION;

D O I：

10.1016/j.apenergy.2022.119067

中图分类号：

TE [石油、天然气工业]; TK [能源与动力工程];

学科分类号：

0807 ; 0820 ;

摘要：

Effective bidding on multiple electricity products under uncertainty would allow a more profitable market participation for hybrid power plants with variable energy resources and storage systems, therefore aiding the decarbonization process. This study deals with the effective bidding of a photovoltaic plant with an energy storage system (PV-ESS) participating in multi-timescale electricity markets by providing energy and ancillary services (AS) products. The energy management system (EMS) aims to maximize the plant's profits by efficiently bidding in the day-ahead and real-time markets while considering the awarded products' adequate delivery. EMS's bidding decisions are usually obtained from traditional mathematical optimization frameworks. However, since the addressed problem is a multi-stage stochastic program, it is often intractable and suffers the curse of dimensionality. This paper presents a novel multi-agent deep reinforcement learning (MADRL) framework for efficient multi-timescale bidding. Two agents based on multi-view artificial neural networks with recurrent layers (MVANNs) are adjusted to map environment observations to actions. Such mappings use as inputs available information related to electricity market products, bidding decisions, solar generation, stored energy, and time representations to bid in both electricity markets. Sustained by a price-taker assumption, the physically and financially constrained EMS's environment is simulated by employing historical data. A shared cumulative reward function with a finite time horizon is used to adjust both MVANNs' weights simultaneously during the learning phase. We compare the proposed MADRL framework against scenario-based two-stage robust and stochastic optimization methods. Results are provided for one-year-round market participation of the hybrid plant at a 1-minute resolution. The proposed method achieved statistically significant higher profits, less variable incomes from both electricity markets, and better provision of awarded products by achieving smaller and less variable energy imbalances through time.

引用

页数：13

共 50 条

[31] Optimal bidding strategy of a virtual power plant in day-ahead energy and frequency regulation markets: A deep learning-based approach
Sadeghi, Saleh
Jahangir, Hamidreza
Vatandoust, Behzad
Golkar, Masoud Aliakbar
Ahmadian, Ali
Elkamel, Ali
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2021, 127
[32] Bi-level day-ahead and real-time hybrid pricing model and its reinforcement learning method
He, Youmeng
Gu, Chunhua
Gao, Yan
Wang, Jingqi
ENERGY, 2025, 322
[33] A deep reinforcement learning approach for power management of battery-assisted fast-charging EV hubs participating in day-ahead and real-time electricity markets
Paudel, Diwas
Das, Tapas K.
ENERGY, 2023, 283
[34] Optimal Operation Method with Day-ahead Market and Real-Time Pricing in Multi-Power Systems
Higa, Shota
Tahara, Hayato
Ikema, Hiroki
Howlader, Harun-Or-Rashid
Funabashi, Toshihisa
2014 INTERNATIONAL CONFERENCE ON POWER ENGINEERING AND RENEWABLE ENERGY (ICPERE), 2014, : 130 - 135
[35] Multi-Agent Deep Reinforcement Learning using Attentive Graph Neural Architectures for Real-Time Strategy Games
Yun, Won Joon
Yi, Sungwon
Kim, Joongheon
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2967 - 2972
[36] Real-time scheduling for production-logistics collaborative environment using multi-agent deep reinforcement learning
Li, Yuxin
Li, Xinyu
Gao, Liang
ADVANCED ENGINEERING INFORMATICS, 2025, 65
[37] Multi-agent deep reinforcement learning based real-time planning approach for responsive customized bus routes☆
Wu, Binglin
Zuo, Xingquan
Chen, Gang
Ai, Guanqun
Wan, Xing
COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 188
[38] QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning
Rehman, Hafiz Muhammad Raza Ur
On, Byung-Won
Ningombam, Devarani Devi
Yi, Sungwon
Choi, Gyu Sang
IEEE ACCESS, 2021, 9 : 129728 - 129741
[39] BRGR: Multi-agent cooperative reinforcement learning with bidirectional real-time gain representation
He, Xin
Ge, Hongwei
Sun, Liang
Li, Qifeng
Hou, Yaqing
APPLIED INTELLIGENCE, 2023, 53 (16) : 19044 - 19059
[40] BRGR: Multi-agent cooperative reinforcement learning with bidirectional real-time gain representation
Xin He
Hongwei Ge
Liang Sun
Qifeng Li
Yaqing Hou
Applied Intelligence, 2023, 53 : 19044 - 19059

← 1 2 3 4 5 →