Data splitting strategies for improving data driven models for reference evapotranspiration estimation among similar stations

被引:35
|
作者
Shiri, Jalal [1 ]
Marti, Pau [2 ]
Karimi, Sepideh [1 ]
Landeras, Gorka [3 ]
机构
[1] Univ Tabriz, Fac Agr, Water Engn Dept, Tabriz, Iran
[2] Univ Illes Balears, Dept Biol, Area Engn Agroforestal, Carretera Valldemossa Km 7-5, Palma De Mallorca 07022, Spain
[3] AB Basque Country Res Inst Agr Dev, NEIKER, Alava, Basque Country, Spain
关键词
Data driven models; Evapotranspiration; Ancillary inputs; Gene expression programming; ARTIFICIAL NEURAL-NETWORKS;
D O I
10.1016/j.compag.2019.03.030
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
In the last years, different heuristic data driven models have been proposed to estimate reference evapotranspiration (ETo) with high performance accuracy as an alternative to empirical and physically-based approaches. However, these models, despite their complexity and soundness, rely on finite data series, like the empirical approaches, and their actual practical validity highly depend on the data management adopted in their development and assessment, in particular on the data splitting adopted. A major issue for ensuring a sound assessment of the heuristic model performance is the definition of a suitable criterion for splitting the data series in training and testing data. The present study evaluates new different data set splitting strategies based on the adoption of ancillary external inputs for enhancing the performance of the Gene Expression Programming- based models for estimating ETo. All models are assessed using k-fold validation considering annual test sizes. The results show that it is preferable to incorporate the external target variable as input to feed the new model, rather than to incorporate the original external input variables of the model. Regarding the external performance of the models, it is crucial to select a suitable training station for each testing station for providing accurate enough estimations. This way, the applicability of such approaches is not limited to local emergency models, but it allows estimating ETo elsewhere without the need of training previously a local model using local targets. Finally, it is important to select properly which station/s will provide external ancillary ETo inputs to the training process, because otherwise they introduce noise to the model and decrease their generalizability.
引用
收藏
页码:70 / 81
页数:12
相关论文
共 50 条
  • [1] Estimation of reference evapotranspiration using data driven techniques under limited data conditions
    Pandey P.K.
    Nyori T.
    Pandey V.
    Modeling Earth Systems and Environment, 2017, 3 (4) : 1449 - 1461
  • [2] Comparison and improvement of estimation models for the reference evapotranspiration using temperature data
    Li L.
    Qiu R.
    Liu C.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2021, 37 (24): : 123 - 130
  • [3] Evaluation of two models using CERES data for reference evapotranspiration estimation
    Carmona, F.
    Holzman, M.
    Rivas, R.
    Degano, M. F.
    Kruse, E.
    Bayala, M.
    REVISTA DE TELEDETECCION, 2018, (51): : 87 - 98
  • [4] Estimation of reference evapotranspiration using machine learning models with limited data
    Ayaz, Adeeba
    Rajesh, Maddu
    Singh, Shailesh Kumar
    Rehana, Shaik
    AIMS GEOSCIENCES, 2021, 7 (03): : 268 - 290
  • [5] Application of data driven models in estimating daily reference evapotranspiration in a coastal region
    Sattari, Mohammad Taghi
    Apaydin, Halit
    INTERNATIONAL JOURNAL OF SUSTAINABLE AGRICULTURAL MANAGEMENT AND INFORMATICS, 2024, 10 (03) : 296 - 326
  • [6] Data from NASA Power and surface weather stations under different climates on reference evapotranspiration estimation
    Rosa, Stefanie Lais Kreutz
    de Souza, Jorge Luiz Moretti
    dos Santos, Aline Aparecida
    PESQUISA AGROPECUARIA BRASILEIRA, 2023, 58
  • [7] Assessing integrity of weather data for reference evapotranspiration estimation
    Allen, RG
    JOURNAL OF IRRIGATION AND DRAINAGE ENGINEERING-ASCE, 1996, 122 (02): : 97 - 106
  • [8] Reference evapotranspiration estimation without local climatic data
    Marti, Pau
    Gonzalez-Altozano, Pablo
    Gasque, Maria
    IRRIGATION SCIENCE, 2011, 29 (06) : 479 - 495
  • [9] Reference evapotranspiration estimation without local climatic data
    Pau Martí
    Pablo González-Altozano
    María Gasque
    Irrigation Science, 2011, 29 : 479 - 495
  • [10] Use of average data of 181 synoptic stations for estimation of reference crop evapotranspiration by temperature-based methods
    Mohammad Valipour
    Water Resources Management, 2014, 28 : 4237 - 4255