Investigating the impact of input variable selection on daily solar radiation prediction accuracy using data-driven models: a case study in northern Iran

被引:0
|
作者
Mohammad Sina Jahangir
Seyed Mostafa Biazar
David Hah
John Quilty
Mohammad Isazadeh
机构
[1] University of Waterloo,Department of Civil and Environmental Engineering
[2] University of Tabriz,Department of Water Engineering, Faculty of Agriculture
关键词
Data-driven models; Solar radiation prediction; Input variable selection; Edgeworth approximation-based conditional mutual information; Iran;
D O I
暂无
中图分类号
学科分类号
摘要
Data-driven models have been explored in numerous studies for solar radiation (Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document}) prediction. However, the use of different input variable selection (IVS) methods for improving Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document} prediction accuracy has mostly been neglected. This study explores various IVS methods, including Gamma test (GT), Procrustes analysis (PA) and Edgeworth approximation-based conditional mutual information (EA) and evaluates their ability to improve Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document} prediction accuracy by coupling them with popular non-linear data-driven models, multilayer perceptron (MLP), support vector machine, extreme learning machine and multi-gene genetic programming (MGGP). The partial correlation input selection method was coupled with multiple linear regression to serve as a linear benchmark. Meteorological data from eight stations in northern Iran was used for building the Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document} prediction models. The type and number of variables selected at each station was dissimilar and dependent on the IVS method. The models utilizing EA selected fewer variables compared to the GT method and had higher accuracy, while models using PA selected fewer variables than all methods but were not able to adequately predict Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document}. It was also found that predictive performance substantially varied when pairing the IVS methods with different model types. For example, MLP, the model with the best average performance, when coupled with EA instead of PA resulted in a ~ 27% improvement (decrease) in the normalized root mean square error (nRMSE). The results also indicated that MGGP produced the least accurate predictions, where the nRMSE increased by up to 40% compared to MLP when the EA method was used for IVS. Finally, IVS hyper-parameter adjustment (which is routinely overlooked in the literature) profoundly affected the results and is recommended as a very important step to consider when developing data-driven models for solar radiation prediction.
引用
收藏
页码:225 / 249
页数:24
相关论文
共 50 条
  • [31] Univariate streamflow forecasting using commonly used data-driven models: literature review and case study
    Zhang, Zhenghao
    Zhang, Qiang
    Singh, Vijay P.
    HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2018, 63 (07): : 1091 - 1111
  • [32] DATA-DRIVEN MODELS FOR FAULT DETECTION USING KERNEL PCA: A WATER DISTRIBUTION SYSTEM CASE STUDY
    Nowicki, Adam
    Grochowski, Michal
    Duzinkiewicz, Kazimierz
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2012, 22 (04) : 939 - 949
  • [33] Improvement and evaluation of daily global solar radiation decomposition models using meteorological parameters: A case study for Turkey
    Erol, Ozge
    Filik, Ummuhan Basaran
    INTERNATIONAL JOURNAL OF GREEN ENERGY, 2022, 19 (15) : 1633 - 1648
  • [34] Data-driven input variable selection for rainfall-runoff modeling using binary-coded particle swarm optimization and Extreme Learning Machines
    Taormina, Riccardo
    Chau, Kwok-Wing
    JOURNAL OF HYDROLOGY, 2015, 529 : 1617 - 1632
  • [35] Prediction of horizontal diffuse solar radiation using clearness index based empirical models; A case study
    Khorasanizadeh, Hossein
    Mohammadi, Kasra
    Goudarzi, Natid
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2016, 41 (47) : 21888 - 21898
  • [36] Comparative Study of Data-driven Solar Coronal Field Models Using a Flux Emergence Simulation as a Ground-truth Data Set
    Toriumi, Shin
    Takasao, Shinsuke
    Cheung, Mark C. M.
    Jiang, Chaowei
    Guo, Yang
    Hayashi, Keiji
    Inoue, Satoshi
    ASTROPHYSICAL JOURNAL, 2020, 890 (02):
  • [37] An extended comparison study of large scale data-driven prediction methods based on variable selection, latent variables, penalized regression and machine learning
    Rendall, Ricardo
    Pereira, Ana
    Reis, Marco
    26TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING (ESCAPE), PT B, 2016, 38B : 1629 - 1634
  • [38] Prediction of Daily Global Solar Radiation using Resilient-propagation Artificial Neural Network and Historical Data: A Case Study of Hail, Saudi Arabia
    Boubaker, Sahbi
    Kamel, Souad
    Kchaou, Mourad
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2020, 10 (01) : 5228 - 5232
  • [39] Urban Geothermal Resource Potential Mapping Using Data-Driven Models-A Case Study of Zhuhai City
    Bian, Yu
    Ni, Yong
    Guo, Ya
    Wen, Jing
    Chen, Jie
    Chen, Ling
    Yang, Yongpeng
    SUSTAINABILITY, 2024, 16 (17)
  • [40] Investigating the intensity of urban heat island and the impacts of local climate using verified WRF data: A case study of Rasht, Northern Iran
    Orkomi, Ali Ahmadi
    Ameri, Mastooreh
    SUSTAINABLE CITIES AND SOCIETY, 2024, 106