Improving Solar Radiation Prediction in China: A Stacking Model Approach with Categorical Boosting Feature Selection

被引:0
|
作者
Ding, Yuehua [1 ]
Wang, Yuhang [2 ]
Li, Zhe [3 ]
Zhao, Long [1 ]
Shi, Yi [3 ]
Xing, Xuguang [4 ]
Chen, Shuangchen [1 ]
机构
[1] College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang,471000, China
[2] School of Energy Science and Engineering, Harbin Institute of Technology, 92, West Dazhi Street, Harbin,150001, China
[3] College of Agricultural Equipment Engineering, Henan University of Science and Technology, Luoyang,471000, China
[4] Key Laboratory for Agricultural Soil and Water Engineering in Arid Area of Ministry of Education, Northwest A&F University, Xianyang,712100, China
基金
中国国家自然科学基金;
关键词
Adaptive boosting - Prediction models - Solar irradiance - Time difference of arrival - Weather forecasting;
D O I
10.3390/atmos15121436
中图分类号
学科分类号
摘要
Solar radiation is an important energy source, and accurately predicting it [daily global and diffuse solar radiation (Rs and Rd)] is essential for research on surface energy exchange, hydrologic systems, and agricultural production. However, Rs and Rd estimation relies on meteorological data and related model parameters, which leads to inaccuracy in some regions. To improve the estimation accuracy and generalization ability of the Rs and Rd models, 17 representative radiation stations in China were selected. The categorical boosting (CatBoost) feature selection algorithm was utilized to construct a novel stacking model from sample and parameter diversity perspectives. The results revealed that the characteristics related to sunshine duration (n) and ozone (O3) significantly affect solar radiation prediction. The proposed new ensemble model framework had better accuracy than base models in root mean square error (RMSE), coefficient of determination (R2), mean absolute error (MAE), and global performance index (GPI). The solar radiation prediction model is more applicable to coastal areas, such as Shanghai and Guangzhou, than to inland regions of China. The range and mean of RMSE, MAE, and R2 for Rs prediction are 1.5737–3.7482 (1.9318), 1.1773–2.6814 (1.4336), and 0.7597–0.9655 (0.9226), respectively; for Rd prediction, they are 1.2589–2.9038 (1.8201), 0.9811–2.1024 (1.3493), and 0.5153–0.9217 (0.7248), respectively. The results of this study can provide a reference for Rs and Rd estimation and related applications in China. © 2024 by the authors.
引用
收藏
相关论文
共 50 条
  • [1] Model selection for accurate daily global solar radiation prediction in China
    Gouda, Shaban G.
    Hussein, Zakia
    Luo, Shuai
    Yuan, Qiaoxia
    JOURNAL OF CLEANER PRODUCTION, 2019, 221 : 132 - 144
  • [2] Brain stroke prediction model based on boosting and stacking ensemble approach
    Mondal S.
    Ghosh S.
    Nag A.
    International Journal of Information Technology, 2024, 16 (1) : 437 - 446
  • [3] Feature selection in Solar Radiation prediction using Bootstrapped SVRs
    Garcia-Hinde, O.
    Gomez-Verdejo, V.
    Martinez-Ramon, M.
    Casanova-Mateo, C.
    Sanz-Justo, J.
    Jimenez-Fernandez, S.
    Salcedo-Sanz, S.
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 3638 - 3645
  • [4] Twin Boosting: improved feature selection and prediction
    Buehlmann, Peter
    Hothorn, Torsten
    STATISTICS AND COMPUTING, 2010, 20 (02) : 119 - 138
  • [5] Twin Boosting: improved feature selection and prediction
    Peter Bühlmann
    Torsten Hothorn
    Statistics and Computing, 2010, 20 : 119 - 138
  • [6] Solar Flare Prediction and Feature Selection Using a Light-Gradient-Boosting Machine Algorithm
    Vysakh, P. A.
    Mayank, Prateek
    SOLAR PHYSICS, 2023, 298 (11)
  • [7] Solar Flare Prediction and Feature Selection Using a Light-Gradient-Boosting Machine Algorithm
    P. A. Vysakh
    Prateek Mayank
    Solar Physics, 2023, 298
  • [8] Improving Software Quality Estimation by Combining Boosting and Feature Selection
    Gao, Kehan
    Khoshgoftaar, Taghi
    Napolitano, Amri
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 27 - 33
  • [9] Feature selection for clustering categorical data with an embedded modelling approach
    Silvestre, Claudia
    Cardoso, Margarida G. M. S.
    Figueiredo, Mario
    EXPERT SYSTEMS, 2015, 32 (03) : 444 - 453
  • [10] AN HYBRID APPROACH TO FEATURE SELECTION FOR MIXED CATEGORICAL AND CONTINUOUS DATA
    Doquire, Gauthier
    Verleysen, Michel
    KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 394 - 401