Which one is more important in daily runoff forecasting using data driven models: Input data, model type, preprocessing or data length?

被引:29
|
作者
Moosavi, Vahid [1 ]
Fard, Zeinab Gheisoori [1 ]
Vafakhah, Mehdi [1 ]
机构
[1] Tarbiat Modares Univ, Fac Nat Resources & Marine Sci, Dept Watershed Management Engn, Tehran, Iran
关键词
Artificial intelligence; Data driven model; Optimization; Signal processing; Taguchi method; SUPPORT VECTOR MACHINE; NEURAL-NETWORK MODELS; PREDICT SCOUR DEPTH; ABUTMENT SCOUR; PART; DECOMPOSITION; WATER; OPTIMIZATION; SENSITIVITY; SELECTION;
D O I
10.1016/j.jhydrol.2022.127429
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Rainfall-runoff modeling is of great importance in hydrological sciences. Several different models have been developed for runoff modeling in three main categories i.e. physically-based, conceptual and empirical models. Data driven models are of the most widely used models in runoff modeling besides process based models. Different studies have been done to assess the performance of various models and the effect of input datasets, data length and disparate signal processing methods on the modeling performance. However, each of these studies has examined one of these factors separately and didn't assess the effect of these factors on the accuracy of runoff forecasting. Therefore, assessing the importance of each of the mentioned factors as well as determining the optimum structure that produces the best accuracy is still challenging. The main aim of this study was to determine the importance and the optimal combination of these factors in daily runoff modeling. In order to achieve this goal, Taguchi method was used. First, five levels were defined for each of the abovementioned factors. Five different input data combinations, five data driven models i.e. Adaptive Neuro-Fuzzy Inference System (ANFIS), Support Vector Regression (SVR), Group Method of Data Handling (GMDH), Random Forest (RF) and Partial Least Square Regression (PLS), four different signal processing methods i.e. normalization, wavelet, ensemble empirical mode decomposition (EEMD) and singular spectrum analysis (SSA) as well as no pre-processing condition, and five data lengths i.e. 2, 5, 10, 15 and 20 years were considered. The L-25 Taguchi orthogonal array was selected accordingly. The required 25 tests were implemented according to the L-25 Taguchi orthogonal array in three different basins to achieve more generalizable results. The results were then used in Taguchi analysis in order to attain the optimal combination of the levels of the mentioned factors and the importance of these factors in accurate prediction of runoff. Results showed that the hybrid wavelet-GMDH model with a complete dataset as input and 20-year data length provides the highest accuracy. It was also shown that the order of mentioned factors in terms of their importance and effect on runoff prediction accuracy is as follow: input dataset, data length, preprocessing and model type. GMDH and SVR had the best performance and wavelet and EEMD signal processing methods had the highest effect on the data driven models performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Forecasting Daily Fire Radiative Energy Using Data Driven Methods and Machine Learning Techniques
    Thapa, Laura H.
    Saide, Pablo E.
    Bortnik, Jacob
    Berman, Melinda T.
    da Silva, Arlindo
    Peterson, David A.
    Li, Fangjun
    Kondragunta, Shobha
    Ahmadov, Ravan
    James, Eric
    Romero-Alvarez, Johana
    Ye, Xinxin
    Soja, Amber
    Wiggins, Elizabeth
    Gargulinski, Emily
    JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2024, 129 (16)
  • [32] Forecasting daily pollen concentrations using data-driven modeling methods in Thessaloniki, Greece
    Voukantsis, Dimitris
    Niska, Harri
    Karatzas, Kostas
    Riga, Marina
    Damialis, Athanasios
    Vokou, Despoina
    ATMOSPHERIC ENVIRONMENT, 2010, 44 (39) : 5101 - 5111
  • [33] Improving Forecasting Ability of GITM Using Data-Driven Model Refinement
    Ponder, Brandon M.
    Ridley, Aaron J.
    Goel, Ankit
    Bernstein, D. S.
    SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2023, 21 (03):
  • [34] Ensemble data-driven rainfall-runoff modeling using multi-source satellite and gauge rainfall data input fusion
    Vahid Nourani
    Hüseyin Gökçekuş
    Tagesse Gichamo
    Earth Science Informatics, 2021, 14 : 1787 - 1808
  • [35] Ensemble data-driven rainfall-runoff modeling using multi-source satellite and gauge rainfall data input fusion
    Nourani, Vahid
    Gokcekus, Huseyin
    Gichamo, Tagesse
    EARTH SCIENCE INFORMATICS, 2021, 14 (04) : 1787 - 1808
  • [36] River Stage Forecasting Using Wavelet Packet Decomposition and Data-driven Models
    Seo, Youngmin
    Kim, Sungwon
    12TH INTERNATIONAL CONFERENCE ON HYDROINFORMATICS (HIC 2016) - SMART WATER FOR THE FUTURE, 2016, 154 : 1225 - 1230
  • [37] Data-driven battery degradation prediction: Forecasting voltage-capacity curves using one-cycle data
    Tian, Jinpeng
    Xiong, Rui
    Shen, Weixiang
    Lu, Jiahuan
    ECOMAT, 2022, 4 (05)
  • [38] Modeling daily soil temperature using data-driven models and spatial distribution
    Sungwon Kim
    Vijay P. Singh
    Theoretical and Applied Climatology, 2014, 118 : 465 - 479
  • [39] Modeling daily soil temperature using data-driven models and spatial distribution
    Kim, Sungwon
    Singh, Vijay P.
    THEORETICAL AND APPLIED CLIMATOLOGY, 2014, 118 (03) : 465 - 479
  • [40] Multi-model approach applied to meteorological data for solar radiation forecasting using data-driven approaches
    Neeraj
    Gupta P.
    Tomar A.
    Optik, 2023, 286