Hybrid Machine Learning Ensemble Techniques for Modeling Dissolved Oxygen Concentration

被引:57
|
作者
Abba, Sani Isah [1 ]
Linh, Nguyen Thi Thuy [2 ,3 ]
Abdullahi, Jazuli [4 ]
Ali, Shaban Ismael Albrka [4 ,5 ]
Pham, Quoc Bao [6 ,7 ]
Abdulkadir, Rabiu Aliyu [8 ]
Costache, Romulus [9 ,10 ]
Nam, Van Thai [11 ]
Anh, Duong Tran [11 ]
机构
[1] Yusuf Maitama Sule Univ, Dept Phys Planning Dev, Kano 700221, Nigeria
[2] Duy Tan Univ, Inst Res & Dev, Danang 550000, Vietnam
[3] Duy Tan Univ, Fac Environm & Chem Engn, Danang 550000, Vietnam
[4] Near East Univ, Dept Civil Engn, Fac Engn, CY-99138 Nicosia, Cyprus
[5] Univ Kebangsaan Malaysia, Sustainable Urban Transport Res Ctr SUTRA, Bangi 43600, Malaysia
[6] Ton Duc Thang Univ, Environm Qual Atmospher Sci & Climate Change Res, Ho Chi Minh City 700000, Vietnam
[7] Ton Duc Thang Univ, Fac Environm & Labour Safety, Ho Chi Minh City 700000, Vietnam
[8] Kano Univ Sci & Technol, Dept Elect Engn, Wudil 713101, Nigeria
[9] Univ Bucharest, Res Inst, Bucharest 050663, Romania
[10] Natl Inst Hydrol & Water Management, Bucharest 013686, Romania
[11] Ho Chi Minh City Univ Technol HUTECH, Ho Chi Minh City 700000, Vietnam
关键词
Predictive models; Biological system modeling; Rivers; Neural networks; Adaptation models; Data models; Dissolved oxygen; artificial intelligence; ensemble techniques; hybrid random forest ensemble; Kinta River; ARTIFICIAL NEURAL-NETWORK; WATER-QUALITY PARAMETERS; TIME-SERIES; PREDICTION; RIVER; INTELLIGENCE; REGRESSION; PERFORMANCE; RELIABILITY; STATION;
D O I
10.1109/ACCESS.2020.3017743
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reliable prediction of dissolved oxygen concentration (DO) is significantly crucial for protecting the health of the aquatic ecosystem. The current research employed four different single AI-based models, namely long short-term memory neural network (LSTM), extreme learning machine (ELM), Hammerstein-Weiner (HW) and general regression neural network (GRNN) for modeling the DO concentration of Kinta River, Malaysia using available water quality (WQ) parameters. Afterwards, the first scenario used four different ensemble techniques (ET). Two linear, i.e. simple averaging ensemble (SAE) and weighted averaging ensemble (WAE) and two nonlinear namely; backpropagation neural network ensemble (BPNN-E) and HW ensemble (HW-E). The second scenario employed a hybrid random forest (RF) ensemble in order to enhance the prediction accuracy of the single models. The WQ parameters were subjected to a different pre-analysis test to ascertain their stability. The four-model combinations are generated using the nonlinear sensitivity input selection approach. The modeling performance was assessed using the statistical measures of Nash-Sutcliffe coefficient efficiency (NSE), Willmott's index of agreement (WI), root mean square error (RMSE), mean absolute error (MAE) and mean square error (MSE) and correlation coefficient (CC). The results of the single AI-based models demonstrated that HW (M3) served as the best model for predicting DO concentration. For ensemble results, BPNN-E (WI = 0.9764) was superior to the other three ET with average decreased of more than 2% with regards to MAE. Investigation on the hybrid RF ensemble demonstrated the reliable accuracy for all the hybrid models with better predictive skill shown by the HW-RF (CC = 0.981) ensemble. The overall results verified the promising impact of HW-M3, ET and hybrid RF ensemble for the prediction of the DO concentration in the Kinta River, Malaysia.
引用
收藏
页码:157218 / 157237
页数:20
相关论文
共 50 条
  • [31] Machine Learning-based Dissolved Oxygen Prediction Modeling and Evaluation in the Yangtze River Estuary
    Li, Xiao-Ying
    Wang, Hua
    Wang, Yi-Qing
    Zhang, Liang-Jing
    Wu, Yi
    [J]. Huanjing Kexue/Environmental Science, 2024, 45 (12): : 7123 - 7133
  • [32] A Heterogeneous Ensemble Network Using Machine Learning Techniques
    Rashid, Tarik
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (08): : 335 - 339
  • [33] CLASSIFICATION OF DIABETES USING ENSEMBLE MACHINE LEARNING TECHNIQUES
    Ashisha, G.R.
    Mary, X Anitha
    Raja, J Mahimai
    [J]. Scalable Computing, 2024, 25 (04): : 3172 - 3180
  • [34] Game State Prediction with Ensemble of Machine Learning Techniques
    Woh, Sange-Myeong
    Lee, Jee-Hyong
    [J]. 2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 89 - 92
  • [35] A Hybrid Model for the Prediction of Air Pollutants Concentration, Based on Statistical and Machine Learning Techniques
    Minutti-Martinez, Carlos
    Arellano-Vazquez, Magali
    Zamora-Machado, Marlene
    [J]. ADVANCES IN SOFT COMPUTING (MICAI 2021), PT II, 2021, 13068 : 252 - 264
  • [36] Drug design by machine learning: Ensemble learning for QSAR modeling
    Liu, Y
    [J]. ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 187 - 193
  • [37] Machine learning techniques for mortality modeling
    Deprez P.
    Shevchenko P.V.
    Wüthrich M.V.
    [J]. European Actuarial Journal, 2017, 7 (2) : 337 - 352
  • [38] Feedforward neural network modeling and control for dissolved oxygen concentration
    Han, Guang
    Qiao, Jun-Fei
    Bo, Ying-Chun
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2013, 30 (05): : 585 - 591
  • [39] Hybrid Ensemble Machine Learning for Complex and Dynamic Data
    Krawczyk, Bartosz
    Trawinski, Bogdan
    [J]. NEW GENERATION COMPUTING, 2015, 33 (04) : 341 - 344
  • [40] Special Issue on Hybrid and Ensemble Methods in Machine Learning
    Oscar Cordón
    Przemysław Kazienko
    Bogdan Trawiński
    [J]. New Generation Computing, 2011, 29 : 241 - 244