Multi-agent deep reinforcement learning for efficient multi-timescale bidding of a hybrid power plant in day-ahead and real-time markets

被引:19
|
作者
Ochoa, Tomas [1 ]
Gil, Esteban [1 ]
Angulo, Alejandro [1 ]
Valle, Carlos [2 ]
机构
[1] Univ Tecn Federico Santa Maria, Departmento Ingn Elect, Valparaiso 2390123, Chile
[2] Univ Playa Ancha, Departmento Ciencia Datos Informat, Valparaiso 2360001, Chile
关键词
Multi-view artificial neural networks; Multi-agent deep reinforcement learning; Energy management system; Solar generation; Energy storage; Electricity market bidding; Multi-timescale electricity markets; ENERGY-STORAGE; WIND; GENERATION; PARTICIPATION;
D O I
10.1016/j.apenergy.2022.119067
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Effective bidding on multiple electricity products under uncertainty would allow a more profitable market participation for hybrid power plants with variable energy resources and storage systems, therefore aiding the decarbonization process. This study deals with the effective bidding of a photovoltaic plant with an energy storage system (PV-ESS) participating in multi-timescale electricity markets by providing energy and ancillary services (AS) products. The energy management system (EMS) aims to maximize the plant's profits by efficiently bidding in the day-ahead and real-time markets while considering the awarded products' adequate delivery. EMS's bidding decisions are usually obtained from traditional mathematical optimization frameworks. However, since the addressed problem is a multi-stage stochastic program, it is often intractable and suffers the curse of dimensionality. This paper presents a novel multi-agent deep reinforcement learning (MADRL) framework for efficient multi-timescale bidding. Two agents based on multi-view artificial neural networks with recurrent layers (MVANNs) are adjusted to map environment observations to actions. Such mappings use as inputs available information related to electricity market products, bidding decisions, solar generation, stored energy, and time representations to bid in both electricity markets. Sustained by a price-taker assumption, the physically and financially constrained EMS's environment is simulated by employing historical data. A shared cumulative reward function with a finite time horizon is used to adjust both MVANNs' weights simultaneously during the learning phase. We compare the proposed MADRL framework against scenario-based two-stage robust and stochastic optimization methods. Results are provided for one-year-round market participation of the hybrid plant at a 1-minute resolution. The proposed method achieved statistically significant higher profits, less variable incomes from both electricity markets, and better provision of awarded products by achieving smaller and less variable energy imbalances through time.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Optimal bidding strategy of a virtual power plant in day-ahead energy and frequency regulation markets: A deep learning-based approach
    Sadeghi, Saleh
    Jahangir, Hamidreza
    Vatandoust, Behzad
    Golkar, Masoud Aliakbar
    Ahmadian, Ali
    Elkamel, Ali
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2021, 127
  • [32] Bi-level day-ahead and real-time hybrid pricing model and its reinforcement learning method
    He, Youmeng
    Gu, Chunhua
    Gao, Yan
    Wang, Jingqi
    ENERGY, 2025, 322
  • [33] A deep reinforcement learning approach for power management of battery-assisted fast-charging EV hubs participating in day-ahead and real-time electricity markets
    Paudel, Diwas
    Das, Tapas K.
    ENERGY, 2023, 283
  • [34] Optimal Operation Method with Day-ahead Market and Real-Time Pricing in Multi-Power Systems
    Higa, Shota
    Tahara, Hayato
    Ikema, Hiroki
    Howlader, Harun-Or-Rashid
    Funabashi, Toshihisa
    2014 INTERNATIONAL CONFERENCE ON POWER ENGINEERING AND RENEWABLE ENERGY (ICPERE), 2014, : 130 - 135
  • [35] Multi-Agent Deep Reinforcement Learning using Attentive Graph Neural Architectures for Real-Time Strategy Games
    Yun, Won Joon
    Yi, Sungwon
    Kim, Joongheon
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2967 - 2972
  • [36] Real-time scheduling for production-logistics collaborative environment using multi-agent deep reinforcement learning
    Li, Yuxin
    Li, Xinyu
    Gao, Liang
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [37] Multi-agent deep reinforcement learning based real-time planning approach for responsive customized bus routes☆
    Wu, Binglin
    Zuo, Xingquan
    Chen, Gang
    Ai, Guanqun
    Wan, Xing
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 188
  • [38] QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning
    Rehman, Hafiz Muhammad Raza Ur
    On, Byung-Won
    Ningombam, Devarani Devi
    Yi, Sungwon
    Choi, Gyu Sang
    IEEE ACCESS, 2021, 9 : 129728 - 129741
  • [39] BRGR: Multi-agent cooperative reinforcement learning with bidirectional real-time gain representation
    He, Xin
    Ge, Hongwei
    Sun, Liang
    Li, Qifeng
    Hou, Yaqing
    APPLIED INTELLIGENCE, 2023, 53 (16) : 19044 - 19059
  • [40] BRGR: Multi-agent cooperative reinforcement learning with bidirectional real-time gain representation
    Xin He
    Hongwei Ge
    Liang Sun
    Qifeng Li
    Yaqing Hou
    Applied Intelligence, 2023, 53 : 19044 - 19059