Improving the prediction accuracy of river inflow using two data pre-processing techniques coupled with data-driven model

被引:5
|
作者
Nazir, Hafiza Mamona [1 ]
Hussain, Ijaz [1 ]
Faisal, Muhammad [2 ,3 ]
Elashkar, Elsayed Elsherbini [4 ]
Shoukry, Alaa Mohamd [4 ,5 ]
机构
[1] Quaid I Azam Univ, Dept Stat, Islamabad, Pakistan
[2] Univ Bradford, Fac Hlth Studies, Bradford, W Yorkshire, England
[3] Bradford Teaching Hosp NHS Fdn Trust, Bradford Inst Hlth Res, Bradford, W Yorkshire, England
[4] King Saud Univ, Arriyadh Community Coll, Riyadh, Saudi Arabia
[5] KSA Workers Univ, Ksa, Egypt
来源
PEERJ | 2019年 / 7卷
关键词
Data-driven models; Variational Mode Decomposition; Ensemble Empirical Mode Decomposition; Empirical Mode Decomposition; ARTIFICIAL NEURAL-NETWORKS; SINGULAR SPECTRUM ANALYSIS; WAVELET TRANSFORM; DECOMPOSITION; ENSEMBLE; INTELLIGENCE; MACHINE;
D O I
10.7717/peerj.8043
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
River inflow prediction plays an important role in water resources management and power-generating systems. But the noises and multi-scale nature of river inflow data adds an extra layer of complexity towards accurate predictive model. To overcome this issue, we proposed a hybrid model, Variational Mode Decomposition (VMD), based on a singular spectrum analysis (SSA) denoising technique. First, SSA his applied to denoise the river inflow data. Second, VMD, a signal processing technique, is employed to decompose the denoised river inflow data into multiple intrinsic mode functions (IMFs), each with a relative frequency scale. Third, Empirical Bayes Threshold (EBT) is applied on non-linear IMF to smooth out. Fourth, predicted models of denoised and decomposed IMFs are established by learning the feature values of the Support Vector Machine (SVM). Finally, the ensemble predicted results are formulated by adding the predicted IMFs. The proposed model is demonstrated using daily river inflow data from four river stations of the Indus River Basin (IRB) system, which is the largest water system in Pakistan. To fully illustrate the superiority of our proposed approach, the SSA-VMD-EBT-SVM hybrid model was compared with SSA-VMD-SVM, VMD-SVM, Empirical Mode Decomposition (EMD) based i.e., EMD-SVM, SSA-EMD-SVM, Ensemble EMD (EEMD) based i.e., EEMD-SVM and SSA-EEMD-SVM. We found that our proposed hybrid SSA-EBT-VMD-SVM model outperformed than others based on following performance measures: the Nash-Sutcliffe Efficiency (NSE), Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE). Therefore, SSA-VMD-EBT-SVM model can be used for water resources management and power-generating systems using non-linear time series data.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Visualization Techniques on the Examination Timetabling Pre-processing Data
    Thomas, J. Joshua
    Khader, Ahamad Tajudin
    Belaton, Bahari
    PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION, 2009, : 454 - 458
  • [22] Data Pre-processing Techniques for Publication Performance Analysis
    Zulkepli, Fatin Shahirah
    Ibrahin, Roliana
    Saeed, Faisal
    RECENT TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2018, 5 : 59 - 65
  • [23] Survey of Pre-processing Techniques for Mining Big Data
    Hariharakrishnan, Jayaram
    Mohanavalli, S.
    Srividya
    Kumar, Sundhara K. B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 77 - 81
  • [24] Data Pre-Processing by Genetic Algorithms for Bankruptcy Prediction
    Tsai, Chih-Fong
    Chou, Jui-Sheng
    2011 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2011, : 1780 - 1783
  • [25] Quantifying the Uncertainties in Data-Driven Models for Reservoir Inflow Prediction
    Zhang, Xiaoli
    Wang, Haixia
    Peng, Anbang
    Wang, Wenchuan
    Li, Baojian
    Huang, Xudong
    WATER RESOURCES MANAGEMENT, 2020, 34 (04) : 1479 - 1493
  • [26] Spatiotemporal analysis and prediction of water quality in Pearl River, China, using multivariate statistical techniques and data-driven model
    HaoNan Ding
    Xiaojun Niu
    Dongqing Zhang
    Mengyu Lv
    Yang Zhang
    Zhang Lin
    Mingli Fu
    Environmental Science and Pollution Research, 2023, 30 : 63036 - 63051
  • [27] Quantifying the Uncertainties in Data-Driven Models for Reservoir Inflow Prediction
    Xiaoli Zhang
    Haixia Wang
    Anbang Peng
    Wenchuan Wang
    Baojian Li
    Xudong Huang
    Water Resources Management, 2020, 34 : 1479 - 1493
  • [28] Spatiotemporal analysis and prediction of water quality in Pearl River, China, using multivariate statistical techniques and data-driven model
    Ding, HaoNan
    Niu, Xiaojun
    Zhang, Dongqing
    Lv, Mengyu
    Zhang, Yang
    Lin, Zhang
    Fu, Mingli
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (22) : 63036 - 63051
  • [29] EVALUATION OF THE IMPACT OF THE PRE-PROCESSING OF DATA ON THE EFFECTIVENESS AND ACCURACY OF SVM
    Cisty, Milan
    Bezak, Juraj
    Bajtek, Zbynek
    GEOCONFERENCE ON WATER RESOURCES, FOREST, MARINE AND OCEAN ECOSYSTEMS, 2013, : 141 - 147
  • [30] A survey on pre-processing and post-processing techniques in data mining
    Tomar, Divya
    Agarwal, Sonali
    International Journal of Database Theory and Application, 2014, 7 (04): : 99 - 128