Improving prediction of water quality indices using novel hybrid machine -learning algorithms

被引:213
|
作者
Duie Tien Bui [1 ,2 ]
Khosravi, Khabat [3 ]
Tiefenbacher, John [4 ]
Nguyen, Hoang [5 ]
Kazakis, Nerantzis [6 ]
机构
[1] Ton Duc Thang Univ, Geog Informat Sci Res Grp, Ho Chi Minh City, Vietnam
[2] Ton Duc Thang Univ, Fac Environm & Labour Safety, Ho Chi Minh City, Vietnam
[3] Univ Guelph, Sch Engn, Guelph, ON, Canada
[4] Texas State Univ, Dept Geog, San Marcos, TX 78666 USA
[5] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[6] Aristotle Univ Thessaloniki, Dept Geol, Lab Engn Geol & Hydrogeol, Thessaloniki 54124, Greece
关键词
Water quality index; Prediction; Data mining; Novel hybrid algorithms; SCATTER PLOT; MODELS; RIVER; SUPPORT; STREAMFLOW; INFERENCE;
D O I
10.1016/j.scitotenv.2020.137612
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
River water quality assessment is one of the most important tasks to enhance water resources management plans. A water quality index (WQI) considers several water quality variables simultaneously. Traditionally WQI calculations consume time and are often fraught with errors during derivations of sub-indices. In this study, 4 standalone (random forest (RF), M5P, random tree (RT), and reduced error pruning tree (REPT)) and 12 hybrid data-mining algorithms (combinations of standalones with bagging (BA), CV parameter selection (CVPS) and randomizable filtered classification (RFC)) were used to create Iran WQI (IRWQI(sc)) predictions. Six years (2012 to 2018) ofmonthly data fromtwowater quality monitoring stationswithin the Talar catchmentwere compiled. Using Pearson correlation coefficients, 10 different input combinations were constructed. The data were divided into two groups (ratio 70:30) for model building (training dataset) andmodel validation (testing dataset) using a 10-fold cross-validation technique. The models were evaluated using several statistical and visual evaluation metrics. Result show that fecal coliform (FC) and total solids (TS) had the greatest and least effect on the prediction of IRWQIsc. The best input combinations varied among the algorithms; generally variables with very lowcorrelations displayed weaker performance. Hybrid algorithms improved the prediction power of several of the standalone models, but not all. Hybrid BA-RT outperformed the other models (R-2 = 0.941, RMSE = 2.71, MAE = 1.87, NSE = 0.941, PBIAS = 0.500). PBIAS indicated that all algorithms, with the exceptions of RT, BART and CVPS-REPT, overestimated WQI values. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [2] Prediction of groundwater quality indices using machine learning algorithms
    Raheja, Hemant
    Goel, Arun
    Pal, Mahesh
    [J]. WATER PRACTICE AND TECHNOLOGY, 2022, 17 (01) : 336 - 351
  • [3] Improving river water quality prediction with hybrid machine learning and temporal analysis
    del Castillo, Alberto Fernandez
    Garibay, Marycarmen Verduzco
    Diaz-Vazquez, Diego
    Yebra-Montes, Carlos
    Brown, Lee E.
    Johnson, Andrew
    Garcia-Gonzalez, Alejandro
    Gradilla-Hernandez, Misael Sebastian
    [J]. ECOLOGICAL INFORMATICS, 2024, 82
  • [4] Water Quality Index (WQI) Prediction Using Machine Learning Algorithms
    Kularbphettong, Kunyanuth
    Waraporn, Phanu
    Raksuntorn, Nareenart
    Vivhivanives, Rujijan
    Sangsuwon, Chanyapat
    Boonseng, Chongrag
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 383 - 387
  • [5] Machine learning algorithms for efficient water quality prediction
    Azrour, Mourade
    Mabrouki, Jamal
    Fattah, Ghizlane
    Guezzaz, Azedine
    Aziz, Faissal
    [J]. MODELING EARTH SYSTEMS AND ENVIRONMENT, 2022, 8 (02) : 2793 - 2801
  • [6] Machine learning algorithms for efficient water quality prediction
    Mourade Azrour
    Jamal Mabrouki
    Ghizlane Fattah
    Azedine Guezzaz
    Faissal Aziz
    [J]. Modeling Earth Systems and Environment, 2022, 8 : 2793 - 2801
  • [7] Prediction on water quality of a lake in Chennai, India using machine learning algorithms
    Prasad, D. Venkata Vara
    Venkataramana, Lokeswari Y.
    Kumar, P. Senthil
    Prasannamedha, G.
    Soumya, K.
    Poornema, A. J.
    [J]. DESALINATION AND WATER TREATMENT, 2021, 218 : 44 - 51
  • [8] Efficient Prediction of Water Quality Index (WQI) Using Machine Learning Algorithms
    Md. Mehedi Hassan
    Md. Mahedi Hassan
    Laboni Akter
    Md. Mushfiqur Rahman
    Sadika Zaman
    Khan Md. Hasib
    Nusrat Jahan
    Raisun Nasa Smrity
    Jerin Farhana
    M. Raihan
    Swarnali Mollick
    [J]. Human-Centric Intelligent Systems, 2021, 1 (3-4): : 86 - 97
  • [9] Groundwater Quality Assessment and Irrigation Water Quality Index Prediction Using Machine Learning Algorithms
    Hussein, Enas E.
    Derdour, Abdessamed
    Zerouali, Bilel
    Almaliki, Abdulrazak
    Wong, Yong Jie
    los Santos, Manuel Ballesta-de
    Ngoc, Pham Minh
    Hashim, Mofreh A.
    Elbeltagi, Ahmed
    [J]. WATER, 2024, 16 (02)
  • [10] Identifying the Most Discriminative Parameter for Water Quality Prediction Using Machine Learning Algorithms
    Chatterjee, Tapan
    Gogoi, Usha Rani
    Samanta, Animesh
    Chatterjee, Ayan
    Singh, Mritunjay Kumar
    Pasupuleti, Srinivas
    [J]. WATER, 2024, 16 (03)