Which one is more important in daily runoff forecasting using data driven models: Input data, model type, preprocessing or data length?

被引:29
|
作者
Moosavi, Vahid [1 ]
Fard, Zeinab Gheisoori [1 ]
Vafakhah, Mehdi [1 ]
机构
[1] Tarbiat Modares Univ, Fac Nat Resources & Marine Sci, Dept Watershed Management Engn, Tehran, Iran
关键词
Artificial intelligence; Data driven model; Optimization; Signal processing; Taguchi method; SUPPORT VECTOR MACHINE; NEURAL-NETWORK MODELS; PREDICT SCOUR DEPTH; ABUTMENT SCOUR; PART; DECOMPOSITION; WATER; OPTIMIZATION; SENSITIVITY; SELECTION;
D O I
10.1016/j.jhydrol.2022.127429
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Rainfall-runoff modeling is of great importance in hydrological sciences. Several different models have been developed for runoff modeling in three main categories i.e. physically-based, conceptual and empirical models. Data driven models are of the most widely used models in runoff modeling besides process based models. Different studies have been done to assess the performance of various models and the effect of input datasets, data length and disparate signal processing methods on the modeling performance. However, each of these studies has examined one of these factors separately and didn't assess the effect of these factors on the accuracy of runoff forecasting. Therefore, assessing the importance of each of the mentioned factors as well as determining the optimum structure that produces the best accuracy is still challenging. The main aim of this study was to determine the importance and the optimal combination of these factors in daily runoff modeling. In order to achieve this goal, Taguchi method was used. First, five levels were defined for each of the abovementioned factors. Five different input data combinations, five data driven models i.e. Adaptive Neuro-Fuzzy Inference System (ANFIS), Support Vector Regression (SVR), Group Method of Data Handling (GMDH), Random Forest (RF) and Partial Least Square Regression (PLS), four different signal processing methods i.e. normalization, wavelet, ensemble empirical mode decomposition (EEMD) and singular spectrum analysis (SSA) as well as no pre-processing condition, and five data lengths i.e. 2, 5, 10, 15 and 20 years were considered. The L-25 Taguchi orthogonal array was selected accordingly. The required 25 tests were implemented according to the L-25 Taguchi orthogonal array in three different basins to achieve more generalizable results. The results were then used in Taguchi analysis in order to attain the optimal combination of the levels of the mentioned factors and the importance of these factors in accurate prediction of runoff. Results showed that the hybrid wavelet-GMDH model with a complete dataset as input and 20-year data length provides the highest accuracy. It was also shown that the order of mentioned factors in terms of their importance and effect on runoff prediction accuracy is as follow: input dataset, data length, preprocessing and model type. GMDH and SVR had the best performance and wavelet and EEMD signal processing methods had the highest effect on the data driven models performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Daily Runoff Forecasting Model Based on ANN and Data Preprocessing Techniques
    Wang, Yun
    Guo, Shenglian
    Xiong, Lihua
    Liu, Pan
    Liu, Dedi
    WATER, 2015, 7 (08) : 4144 - 4160
  • [2] Improving ANN model performance in runoff forecasting by adding soil moisture input and using data preprocessing techniques
    Ba, Huanhuan
    Guo, Shenglian
    Wang, Yun
    Hong, Xingjun
    Zhong, Yixuan
    Liu, Zhangjun
    HYDROLOGY RESEARCH, 2018, 49 (03): : 744 - 760
  • [3] Sensitivity of monthly rainfall-runoff models to input errors and data length
    Xu, C.-Y.
    Vandewiele, G.L.
    Hydrological Sciences Journal, 1994, 39 (02) : 157 - 176
  • [4] SENSITIVITY OF MONTHLY RAINFALL-RUNOFF MODELS TO INPUT ERRORS AND DATA LENGTH
    XU, CY
    VANDEWIELE, GL
    HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 1994, 39 (02): : 157 - 176
  • [5] Enhancing daily runoff forecasting in hydropower basins with a voting ensemble model using historical data
    Le, Ngoc Anh
    Thanh, Phong Nguyen
    Pham, Nhat Truong
    Huy, Le Quoc
    Mai, Son T.
    Do, Duc Dung
    Nguyen, Huy Anh
    Anh, Duong Tran
    HYDROLOGICAL SCIENCES JOURNAL, 2025,
  • [6] A comparative study of data-driven models for runoff, sediment, and nitrate forecasting
    Zamani, Mohammad G.
    Nikoo, Mohammad Reza
    Rastad, Dana
    Nematollahi, Banafsheh
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 341
  • [7] A Survey on Data-Driven Runoff Forecasting Models Based on Neural Networks
    Sheng, Ziyu
    Wen, Shiping
    Feng, Zhong-kai
    Gong, Jiaqi
    Shi, Kaibo
    Guo, Zhenyuan
    Yang, Yin
    Huang, Tingwen
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (04): : 1083 - 1097
  • [8] Daily runoff forecasting based on data-augmented neural network model
    Bi, Xiao-ying
    Li, Bo
    Lu, Wen-long
    Zhou, Xin-zhi
    JOURNAL OF HYDROINFORMATICS, 2020, 22 (04) : 900 - 915
  • [9] Long Lead Runoff Simulation Using Data Driven Models
    Karamouz, M.
    Fallahi, M.
    Nazif, S.
    Farahani, M. Rahimi
    INTERNATIONAL JOURNAL OF CIVIL ENGINEERING, 2012, 10 (04) : 328 - 336
  • [10] Forecasting of PV Power Generation using weather input data-preprocessing techniques
    Malvoni, Maria
    De Giorgi, Maria Grazia
    Congedo, Paolo Maria
    ATI 2017 - 72ND CONFERENCE OF THE ITALIAN THERMAL MACHINES ENGINEERING ASSOCIATION, 2017, 126 : 651 - 658