Predicting Energy Generation in Large Wind Farms: A Data-Driven Study with Open Data and Machine Learning

被引:1
|
作者
Paula, Matheus [1 ]
Casaca, Wallace [2 ]
Colnago, Marilaine [3 ]
da Silva, Jose R. [1 ]
Oliveira, Kleber [1 ]
Dias, Mauricio A. [4 ]
Negri, Rogerio [5 ]
机构
[1] Sao Paulo State Univ UNESP, Fac Engn & Sci, BR-19274000 Rosana, Brazil
[2] Sao Paulo State Univ UNESP, Inst Biosci Letters & Exact Sci, BR-15054000 Sao Jose Do Rio Preto, Brazil
[3] Sao Paulo State Univ UNESP, Inst Chem, BR-4800060 Araraquara, Brazil
[4] Sao Paulo State Univ UNESP, Fac Sci & Technol, BR-19060080 Presidente Prudente, Brazil
[5] Sao Paulo State Univ UNESP, Sci & Technol Inst, BR-12245000 Sao Jose Dos Campos, Brazil
基金
巴西圣保罗研究基金会;
关键词
wind energy; forecasting; wind farms; machine learning; data science; LONG-TERM WIND; FORECAST; SYSTEM; BRAZIL;
D O I
10.3390/inventions8050126
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Wind energy has become a trend in Brazil, particularly in the northeastern region of the country. Despite its advantages, wind power generation has been hindered by the high volatility of exogenous factors, such as weather, temperature, and air humidity, making long-term forecasting a highly challenging task. Another issue is the need for reliable solutions, especially for large-scale wind farms, as this involves integrating specific optimization tools and restricted-access datasets collected locally at the power plants. Therefore, in this paper, the problem of forecasting the energy generated at the Praia Formosa wind farm, an eco-friendly park located in the state of Ceara, Brazil, which produces around 7% of the state's electricity, was addressed. To proceed with our data-driven analysis, publicly available data were collected from multiple Brazilian official sources, combining them into a unified database to perform exploratory data analysis and predictive modeling. Specifically, three machine-learning-based approaches were applied: Extreme Gradient Boosting, Random Forest, and Long Short-Term Memory Network, as well as feature-engineering strategies to enhance the precision of the machine intelligence models, including creating artificial features and tuning the hyperparameters. Our findings revealed that all implemented models successfully captured the energy-generation trends, patterns, and seasonality from the complex wind data. However, it was found that the LSTM-based model consistently outperformed the others, achieving a promising global MAPE of 4.55%, highlighting its accuracy in long-term wind energy forecasting. Temperature, relative humidity, and wind speed were identified as the key factors influencing electricity production, with peak generation typically occurring from August to November.
引用
下载
收藏
页数:20
相关论文
共 50 条
  • [41] A Framework for Modeling and Optimization of Data-Driven Energy Systems Using Machine Learning
    Danish M.S.S.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2434 - 2443
  • [42] DATA-DRIVEN FREEFORM MEMS ENERGY HARVESTER DESIGN ENABLED BY MACHINE LEARNING
    Li, Kunying
    Guo, Ruiqi
    Sui, Fanping
    Lin, Liwei
    2022 IEEE 35TH INTERNATIONAL CONFERENCE ON MICRO ELECTRO MECHANICAL SYSTEMS CONFERENCE (MEMS), 2022, : 458 - 461
  • [43] Data-Driven Load Forecasting Using Machine Learning and Meteorological Data
    Alrashidi A.
    Qamar A.M.
    Computer Systems Science and Engineering, 2023, 44 (03): : 1973 - 1988
  • [44] Machine Learning for Data-Driven Discovery The Rise and Relevance
    Sengupta, Partho P.
    Shrestha, Sirish
    JACC-CARDIOVASCULAR IMAGING, 2019, 12 (04) : 690 - 692
  • [45] Chinese diabetes datasets for data-driven machine learning
    Qinpei Zhao
    Jinhao Zhu
    Xuan Shen
    Chuwen Lin
    Yinjia Zhang
    Yuxiang Liang
    Baige Cao
    Jiangfeng Li
    Xiang Liu
    Weixiong Rao
    Congrong Wang
    Scientific Data, 10
  • [46] Data-driven models in machine learning for crime prediction
    Wawrzyniak, Zbigniew M.
    Jankowski, Stanislaw
    Szczechla, Eliza
    Szymanski, Zbigniew
    Pytlak, Radoslaw
    Michalak, Pawel
    Borowik, Grzegorz
    2018 26TH INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING (ICSENG 2018), 2018,
  • [47] Chinese diabetes datasets for data-driven machine learning
    Zhao, Qinpei
    Zhu, Jinhao
    Shen, Xuan
    Lin, Chuwen
    Zhang, Yinjia
    Liang, Yuxiang
    Cao, Baige
    Li, Jiangfeng
    Liu, Xiang
    Rao, Weixiong
    Wang, Congrong
    SCIENTIFIC DATA, 2023, 10 (01)
  • [48] Unsupervised machine learning for data-driven representations of reactions
    Sirumalla, Sai Krishna
    West, Richard
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [49] Anomaly analytics in data-driven machine learning applications
    Azimi, Shelernaz
    Pahl, Claus
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, : 155 - 180
  • [50] A data-driven analysis of renewable energy management: a case study of wind energy technology
    Altuntas, Fatma
    Gok, Mehmet Sahin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (06): : 4133 - 4152