Investigating the impact of input variable selection on daily solar radiation prediction accuracy using data-driven models: a case study in northern Iran

被引:0
|
作者
Mohammad Sina Jahangir
Seyed Mostafa Biazar
David Hah
John Quilty
Mohammad Isazadeh
机构
[1] University of Waterloo,Department of Civil and Environmental Engineering
[2] University of Tabriz,Department of Water Engineering, Faculty of Agriculture
关键词
Data-driven models; Solar radiation prediction; Input variable selection; Edgeworth approximation-based conditional mutual information; Iran;
D O I
暂无
中图分类号
学科分类号
摘要
Data-driven models have been explored in numerous studies for solar radiation (Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document}) prediction. However, the use of different input variable selection (IVS) methods for improving Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document} prediction accuracy has mostly been neglected. This study explores various IVS methods, including Gamma test (GT), Procrustes analysis (PA) and Edgeworth approximation-based conditional mutual information (EA) and evaluates their ability to improve Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document} prediction accuracy by coupling them with popular non-linear data-driven models, multilayer perceptron (MLP), support vector machine, extreme learning machine and multi-gene genetic programming (MGGP). The partial correlation input selection method was coupled with multiple linear regression to serve as a linear benchmark. Meteorological data from eight stations in northern Iran was used for building the Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document} prediction models. The type and number of variables selected at each station was dissimilar and dependent on the IVS method. The models utilizing EA selected fewer variables compared to the GT method and had higher accuracy, while models using PA selected fewer variables than all methods but were not able to adequately predict Rs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${R}_{s}$$\end{document}. It was also found that predictive performance substantially varied when pairing the IVS methods with different model types. For example, MLP, the model with the best average performance, when coupled with EA instead of PA resulted in a ~ 27% improvement (decrease) in the normalized root mean square error (nRMSE). The results also indicated that MGGP produced the least accurate predictions, where the nRMSE increased by up to 40% compared to MLP when the EA method was used for IVS. Finally, IVS hyper-parameter adjustment (which is routinely overlooked in the literature) profoundly affected the results and is recommended as a very important step to consider when developing data-driven models for solar radiation prediction.
引用
收藏
页码:225 / 249
页数:24
相关论文
共 50 条
  • [21] Selection of most relevant input parameters using WEKA for artificial neural network based solar radiation prediction models
    Yadav, Amit Kumar
    Malik, Hasmat
    Chandel, S. S.
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2014, 31 : 509 - 519
  • [22] Data-Driven Dam Outflow Prediction Using Deep Learning with Simultaneous Selection of Input Predictors and Hyperparameters Using the Bayesian Optimization Algorithm
    Vinh Ngoc Tran
    Duc Dang Dinh
    Binh Duy Huy Pham
    Kha Dinh Dang
    Tran Ngoc Anh
    Ha Nguyen Ngoc
    Giang Tien Nguyen
    Water Resources Management, 2024, 38 : 401 - 421
  • [23] Data-Driven Dam Outflow Prediction Using Deep Learning with Simultaneous Selection of Input Predictors and Hyperparameters Using the Bayesian Optimization Algorithm
    Tran, Vinh Ngoc
    Dinh, Duc Dang
    Pham, Binh Duy Huy
    Dang, Kha Dinh
    Anh, Tran Ngoc
    Ngoc, Ha Nguyen
    Nguyen, Giang Tien
    WATER RESOURCES MANAGEMENT, 2024, 38 (01) : 269 - 286
  • [24] Investigating the impact of weather parameters selection on the prediction of solar radiation under different genera of cloud cover: A case-study in a subtropical location
    Chakchak, Jamel
    Cetin, Numan Sabit
    MEASUREMENT, 2021, 176
  • [25] Comparison of the advanced machine learning methods for better prediction accuracy of solar radiation using only temperature data: A case study
    Mirbolouki, Amin
    Heddam, Salim
    Singh Parmar, Kulwinder
    Trajkovic, Slavisa
    Mehraein, Mojtaba
    Kisi, Ozgur
    INTERNATIONAL JOURNAL OF ENERGY RESEARCH, 2022, 46 (03) : 2709 - 2736
  • [26] Data-Driven Dynamic Active Node Selection for Event Localization in IoT Applications - A Case Study of Radiation Localization
    Alagha, Ahmed
    Singh, Shakti
    Mizouni, Rabeb
    Ouali, Anis
    Otrok, Hadi
    IEEE ACCESS, 2019, 7 : 16168 - 16183
  • [27] Multi-step solar radiation prediction using transformer: A case study from solar radiation data in Tokyo
    Dong, Huagang
    Tang, Pengwei
    He, Bo
    Chen, Lei
    Zhang, Zhuangzhuang
    Jia, Chengqi
    JOURNAL OF BUILDING PHYSICS, 2024, 47 (04) : 421 - 438
  • [28] Daily prediction of Urmia Lake water level using remote sensing data and honey badger optimization-based data-driven models
    Saroughi, Mohsen
    Katipoglu, Okan Mert
    Akturk, Gaye
    Gul, Enes
    Simsek, Oguz
    Citakoglu, Hatice
    ACTA GEOPHYSICA, 2025, : 2909 - 2933
  • [29] Incorporation of mechanistic model outputs as features for data-driven models for yield prediction: a case study on wheat and chickpea
    Al-Shammari, Dhahi
    Chen, Yang
    Wimalathunge, Niranjan S.
    Wang, Chen
    Han, Si Yang
    Bishop, Thomas F. A.
    PRECISION AGRICULTURE, 2024, 25 (05) : 2531 - 2553
  • [30] Daily Surface Solar Radiation Prediction Mapping Using Artificial Neural Network: The Case Study of Reunion Island
    Li, Peng
    Bessafi, Miloud
    Morel, Beatrice
    Chabriat, Jean-Pierre
    Delsaut, Mathieu
    Li, Qi
    JOURNAL OF SOLAR ENERGY ENGINEERING-TRANSACTIONS OF THE ASME, 2020, 142 (02):