Improving the prediction accuracy of river inflow using two data pre-processing techniques coupled with data-driven model

被引:5
|
作者
Nazir, Hafiza Mamona [1 ]
Hussain, Ijaz [1 ]
Faisal, Muhammad [2 ,3 ]
Elashkar, Elsayed Elsherbini [4 ]
Shoukry, Alaa Mohamd [4 ,5 ]
机构
[1] Quaid I Azam Univ, Dept Stat, Islamabad, Pakistan
[2] Univ Bradford, Fac Hlth Studies, Bradford, W Yorkshire, England
[3] Bradford Teaching Hosp NHS Fdn Trust, Bradford Inst Hlth Res, Bradford, W Yorkshire, England
[4] King Saud Univ, Arriyadh Community Coll, Riyadh, Saudi Arabia
[5] KSA Workers Univ, Ksa, Egypt
来源
PEERJ | 2019年 / 7卷
关键词
Data-driven models; Variational Mode Decomposition; Ensemble Empirical Mode Decomposition; Empirical Mode Decomposition; ARTIFICIAL NEURAL-NETWORKS; SINGULAR SPECTRUM ANALYSIS; WAVELET TRANSFORM; DECOMPOSITION; ENSEMBLE; INTELLIGENCE; MACHINE;
D O I
10.7717/peerj.8043
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
River inflow prediction plays an important role in water resources management and power-generating systems. But the noises and multi-scale nature of river inflow data adds an extra layer of complexity towards accurate predictive model. To overcome this issue, we proposed a hybrid model, Variational Mode Decomposition (VMD), based on a singular spectrum analysis (SSA) denoising technique. First, SSA his applied to denoise the river inflow data. Second, VMD, a signal processing technique, is employed to decompose the denoised river inflow data into multiple intrinsic mode functions (IMFs), each with a relative frequency scale. Third, Empirical Bayes Threshold (EBT) is applied on non-linear IMF to smooth out. Fourth, predicted models of denoised and decomposed IMFs are established by learning the feature values of the Support Vector Machine (SVM). Finally, the ensemble predicted results are formulated by adding the predicted IMFs. The proposed model is demonstrated using daily river inflow data from four river stations of the Indus River Basin (IRB) system, which is the largest water system in Pakistan. To fully illustrate the superiority of our proposed approach, the SSA-VMD-EBT-SVM hybrid model was compared with SSA-VMD-SVM, VMD-SVM, Empirical Mode Decomposition (EMD) based i.e., EMD-SVM, SSA-EMD-SVM, Ensemble EMD (EEMD) based i.e., EEMD-SVM and SSA-EEMD-SVM. We found that our proposed hybrid SSA-EBT-VMD-SVM model outperformed than others based on following performance measures: the Nash-Sutcliffe Efficiency (NSE), Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE). Therefore, SSA-VMD-EBT-SVM model can be used for water resources management and power-generating systems using non-linear time series data.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Effective Pre-processing Methods with DTG Big Data by Using MapReduce Techniques
    Cho, Wonhee
    Choi, Eunmi
    ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2017, 421 : 389 - 395
  • [32] Selective pre-processing of imbalanced data for improving classification performance
    Stefanowski, Jerzy
    Wilk, Szymon
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 283 - 292
  • [33] Data-Driven Model for Rockburst Prediction
    Zhao, Hongbo
    Chen, Bingrui
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [34] Accuracy prediction using data-driven algorithm for carbon containing compounds
    Bisht R.
    Kumar D.
    Paswan M.
    Materials Today: Proceedings, 2022, 68 : 1921 - 1925
  • [35] Data-Driven Prediction Model for Analysis of Sensor Data
    Yotov, Ognyan
    Aleksieva-Petrova, Adelina
    ELECTRONICS, 2024, 13 (10)
  • [36] Pre-processing and Indexing Techniques for Constellation Queries in Big Data
    Khatibi, Amir
    Porto, Fabio
    Rittmeyer, Joao Guilherme
    Ogasawara, Eduardo
    Valduriez, Patrick
    Shasha, Dennis
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2017, 2017, 10440 : 164 - 172
  • [37] Data pre-processing and artificial neural networks for tidal level prediction at the Pearl River Estuary
    Liang, Bing-Xian
    Hu, Jin-Peng
    Liu, Cheng
    Hong, Bo
    JOURNAL OF HYDROINFORMATICS, 2021, 23 (02) : 368 - 382
  • [38] Ensemble-based software fault prediction with two staged data pre-processing
    Kulkarni, Shubham P.
    Patel, Sanjeev
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2023, 72 (03) : 212 - 222
  • [39] Assessment of Model Accuracy in Eyes Open and Closed EEG Data: Effect of Data Pre-Processing and Validation Methods
    Mattiev, Jamolbek
    Sajovic, Jakob
    Drevensek, Gorazd
    Rogelj, Peter
    BIOENGINEERING-BASEL, 2023, 10 (01):
  • [40] Data-driven techniques for temperature data prediction: big data analytics approach
    Adamson Oloyede
    Simeon Ozuomba
    Philip Asuquo
    Lanre Olatomiwa
    Omowunmi Mary Longe
    Environmental Monitoring and Assessment, 2023, 195