Assessing temporal data partitioning scenarios for estimating reference evapotranspiration with machine learning techniques in arid regions

被引:27
|
作者
Kazemi, Mohammad Hossein [1 ]
Shiri, Jalal [1 ,2 ]
Marti, Pau [3 ]
Majnooni-Heris, Abolfazl [1 ]
机构
[1] Univ Tabriz, Fac Agr, Water Engn Dept, Tabriz, Iran
[2] Univ Tabriz, Fac Civil Engn, Ctr Excellence Hydroinformat, Tabriz, Iran
[3] Univ Illes Balears, Area Engn Agroforestal, Carretera Valldemossa Km 7-5, Palma De Mallorca 07022, Spain
关键词
Evapotranspiration; Gene expression programming; Hold out; K-fold validation; MODELING REFERENCE EVAPOTRANSPIRATION; NEURAL-NETWORKS; TIME-SERIES; TEMPERATURE; ALGORITHMS; STRATEGIES; EQUATIONS; SELECTION;
D O I
10.1016/j.jhydrol.2020.125252
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Recently, data driven machine learning techniques has been widely applied for modeling reference evapotranspiration (ETo) values under various climatic conditions taking into account the different number of sites and available data length. A major issue with applying those models is the proper selection of training/testing data sets. Although some spatial generalization approaches have been recommended for this purpose, there are no specified recommended local (temporal) data partitioning strategies for machine learning based ETo estimation. The present study evaluates different hold-out and k-fold validation temporal data partitioning strategies when using gene expression programming (GEP) technique to estimate daily ETo in arid regions. The k-fold validation strategies considered annual, monthly and growing season period patterns as test data sets. Although commonly used partitioning of the available patterns into training and testing sets gave accurate results, statistical analysis showed that the results obtained through k-fold validation assessment were more reliable. A two-block partitioning strategy with chronologic data selection for training and testing provided the most accurate results among the hold-out procedures (mean scatter index (SI) value of 0.162). Fixing the extreme ETo values as training data set in hold-out procedures provided the less accurate results with considerable over/underestimation of the ETo values (mean SI value was 0.506). Results on the basis of hold-out approaches can be biased or only partially valid depending on selection of the test data from the time series. K-fold validation yielded the lowest over/underestimations of ETo values. Further, considering monthly patterns as minimum affordable test size produced higher error magnitudes among k-fold validation strategies, while considering the complete patterns of one growing season provided more accurate results among k-fold validation strategies.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Estimating Daily Reference Evapotranspiration in a Semi-Arid Region Using Remote Sensing Data
    Najmaddin, Peshawa M.
    Whelan, Mick J.
    Balzter, Heiko
    REMOTE SENSING, 2017, 9 (08)
  • [22] Optimizing actual evapotranspiration simulation to identify evapotranspiration partitioning variations: A fusion of physical processes and machine learning techniques
    Jiang, Xiaoman
    Wang, Yuntao
    Yinglan, A.
    Wang, Guoqiang
    Zhang, Xiaojing
    Ma, Guangwen
    Duan, Limin
    Liu, Kai
    AGRICULTURAL WATER MANAGEMENT, 2024, 295
  • [23] Neural network approach to reference evapotranspiration modeling from limited climatic data in arid regions
    Abdelkader Laaboudi
    Brahim Mouhouche
    Belkacem Draoui
    International Journal of Biometeorology, 2012, 56 : 831 - 841
  • [24] Comparison of machine learning techniques and spatial distribution of daily reference evapotranspiration in Turkiye
    Yildirim, Demet
    Kucuktopcu, Erdem
    Cemek, Bilal
    Simsek, Halis
    APPLIED WATER SCIENCE, 2023, 13 (04)
  • [25] Calculating Sunshine Hours and Reference Evapotranspiration in Arid Regions When Solar Radiation Data are Limited
    Abd el-wahed, Mohamed H.
    Snyder, Richard L.
    IRRIGATION AND DRAINAGE, 2015, 64 (03) : 419 - 425
  • [26] Neural network approach to reference evapotranspiration modeling from limited climatic data in arid regions
    Laaboudi, Abdelkader
    Mouhouche, Brahim
    Draoui, Belkacem
    INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2012, 56 (05) : 831 - 841
  • [27] Estimation of reference evapotranspiration using machine learning models with limited data
    Ayaz, Adeeba
    Rajesh, Maddu
    Singh, Shailesh Kumar
    Rehana, Shaik
    AIMS GEOSCIENCES, 2021, 7 (03): : 268 - 290
  • [28] Evaluation of machine learning models for prediction of daily reference evapotranspiration in semi-arid India
    Singh, Amit K.
    Singh, J. B.
    Das, Bappa
    Singh, Ramesh
    Ghosh, Avijit
    Kantwa, S. R.
    RANGE MANAGEMENT AND AGROFORESTRY, 2023, 44 (01) : 118 - 125
  • [29] Comparative assessment of empirical and hybrid machine learning models for estimating daily reference evapotranspiration in sub-humid and semi-arid climates
    Acharki, Siham
    Raza, Ali
    Vishwakarma, Dinesh Kumar
    Amharref, Mina
    Bernoussi, Abdes Samed
    Singh, Sudhir Kumar
    Al-Ansari, Nadhir
    Dewidar, Ahmed Z.
    Al-Othman, Ahmed A.
    Mattar, Mohamed A.
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [30] Comparison of machine learning techniques and spatial distribution of daily reference evapotranspiration in Türkiye
    Demet Yildirim
    Erdem Küçüktopcu
    Bilal Cemek
    Halis Simsek
    Applied Water Science, 2023, 13