Performance of Machine Learning, Artificial Neural Network (ANN), and stacked ensemble models in predicting Water Quality Index (WQI) from surface water quality parameters, climatic and land use data

被引:0
|
作者
Satish, Nagalapalli [1 ]
Anmala, Jagadeesh [1 ]
Varma, Murari R. R. [1 ]
Rajitha, K. [1 ]
机构
[1] Birla Inst Technol & Sci Pilani, Dept Civil Engn, Hyderabad Campus, Hyderabad 500078, Telangana, India
关键词
Water Quality Index; Machine Learning; Stacked Artificial Neural Networks; Land Use and Land Cover; Climatic factors; RIVER; INDICATORS; BASIN; TOOL;
D O I
10.1016/j.psep.2024.10.054
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Assessing water quality is essential for managing freshwater resources, safeguarding ecosystems, and guaranteeing public health. Traditional water quality assessment methods suffer from seasonal sampling, multi- parameter requirements, and labor-intensive sampling processes, which are major constraints for the frequent monitoring of vast river basins. To overcome this issue, the study modeled the remote sensing-based climatic and land use parameters with Principal Component Analysis (PCA) to leverage Artificial Neural Networks (ANN) and machine learning (ML) algorithms to predict the Water Quality Index (WQI). The Weighted Arithmetic Water Quality Index (WAWQI) method was used to calculate the WQI of the Godavari River Basin for the available 19 stream water quality parameters (SWQPs). Further, PCA was applied to reduce the dimensionality of the parameters from 19 to 6. These results led to the development of two modeling methods to predict the WQI. In the first method, the correlation-based model was developed to predict WQI by evaluating six SWQPs. The second method, the causal-effect model, uses land use and meteorological factors to determine WQI using causality. Using advanced AutoML techniques, the initial pool of 40 ML models was meticulously evaluated and refined, culminating in the selection of the top three exemplary models such as Extreme Gradient Boosting (XGB), Extra Trees (ET), and Random Forest (RF). In both methods, XGB models show better prediction, with the coefficient of determination (R2) value of 0.95 during training and 0.83 during testing in method one. Whereas in the second method, R2 of 0.93 in training and 0.80 in testing are obtained. Further, XGB, ET, and ANN outputs were stacked with each model to enhance these results in both methods. Among these three stacked models, the stacked ANN_ML model performed better compared to stacked XGB_ML and stacked ET_ML. In the first method, the stacked ANN_ML model predicts R2 values of 0.95 and 0.91 for training and testing. In the second method, 0.95 and 0.90 for training and testing are obtained using stacked ANN_ML model. These findings emphasize the stacked model prediction ability to capture nonlinear relationships in the parameters and the novel approach of land use and climate parameters based WQI prediction, which replace the laborious, time-consuming SWQP measurements.
引用
收藏
页码:177 / 195
页数:19
相关论文
共 50 条
  • [41] Assessing drinking water quality based on physical, chemical and microbial parameters in the Red Sea State, Sudan using a combination of water quality index and artificial neural network model
    Ismael, Mohamedelfatieh
    Mokhtar, Ali
    Farooq, Muhammad
    Lu, Xin
    GROUNDWATER FOR SUSTAINABLE DEVELOPMENT, 2021, 14
  • [42] Scaling an Artificial Neural Network-Based Water Quality Index Model from Small to Large Catchments
    Aalipour, Mehdi
    St'astny, Bohumil
    Horky, Filip
    Amiri, Bahman Jabbarian
    WATER, 2022, 14 (06)
  • [43] Machine learning method for quick identification of water quality index (WQI) based on Sentinel-2 MSI data: Ebinur Lake case study
    Li, Xiaohang
    Ding, Jianli
    Ilyas, Nurmemet
    WATER SUPPLY, 2021, 21 (03) : 1291 - 1312
  • [44] Quality assessment and prediction of municipal drinking water using water quality index and artificial neural network: A case study of Wuhan, central China, from 2013 to 2019
    Xia, Lu
    Han, Qing
    Shang, Lv
    Wang, Yao
    Li, Xinying
    Zhang, Jia
    Yang, Tingting
    Liu, Junling
    Liu, Li
    SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 844
  • [45] Incorporation of information entropy theory, artificial neural network, and soft computing models in the development of integrated industrial water quality index
    Johnbosco C. Egbueri
    Environmental Monitoring and Assessment, 2022, 194
  • [46] Incorporation of information entropy theory, artificial neural network, and soft computing models in the development of integrated industrial water quality index
    Egbueri, Johnbosco C.
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2022, 194 (10)
  • [47] EVALUATING THE PERFORMANCE OF MACHINE LEARNING APPROACHES IN PREDICTING ALBANIAN SHKUMBINI RIVER'S WATERS USING WATER QUALITY INDEX MODEL
    Basha, Lule
    Shyti, Bederiana
    Bekteshi, Lirim
    JOURNAL OF ENVIRONMENTAL ENGINEERING AND LANDSCAPE MANAGEMENT, 2024, 32 (02) : 117 - 127
  • [48] The use of feed-forward back propagation and cascade correlation for the neural network prediction of surface water quality parameters
    Moussa S. Elbisy
    Hatem M. Ali
    M. A. Abd-Elall
    Turki M. Alaboud
    Water Resources, 2014, 41 : 709 - 718
  • [49] The Use of Feed-Forward Back Propagation and Cascade Correlation for the Neural Network Prediction of Surface Water Quality Parameters
    Elbisy, Moussa S.
    Ali, Hatem M.
    Abd-Elall, M. A.
    Alaboud, Turki M.
    WATER RESOURCES, 2014, 41 (06) : 709 - 718
  • [50] Use of artificial neural network for predicting effluent quality parameters and enabling wastewater reuse for climate change resilience - A case from Jordan
    Al-Ghazawi, Ziad
    Alawneh, Rami
    JOURNAL OF WATER PROCESS ENGINEERING, 2021, 44