Performance of Machine Learning, Artificial Neural Network (ANN), and stacked ensemble models in predicting Water Quality Index (WQI) from surface water quality parameters, climatic and land use data

被引:0
|
作者
Satish, Nagalapalli [1 ]
Anmala, Jagadeesh [1 ]
Varma, Murari R. R. [1 ]
Rajitha, K. [1 ]
机构
[1] Birla Inst Technol & Sci Pilani, Dept Civil Engn, Hyderabad Campus, Hyderabad 500078, Telangana, India
关键词
Water Quality Index; Machine Learning; Stacked Artificial Neural Networks; Land Use and Land Cover; Climatic factors; RIVER; INDICATORS; BASIN; TOOL;
D O I
10.1016/j.psep.2024.10.054
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Assessing water quality is essential for managing freshwater resources, safeguarding ecosystems, and guaranteeing public health. Traditional water quality assessment methods suffer from seasonal sampling, multi- parameter requirements, and labor-intensive sampling processes, which are major constraints for the frequent monitoring of vast river basins. To overcome this issue, the study modeled the remote sensing-based climatic and land use parameters with Principal Component Analysis (PCA) to leverage Artificial Neural Networks (ANN) and machine learning (ML) algorithms to predict the Water Quality Index (WQI). The Weighted Arithmetic Water Quality Index (WAWQI) method was used to calculate the WQI of the Godavari River Basin for the available 19 stream water quality parameters (SWQPs). Further, PCA was applied to reduce the dimensionality of the parameters from 19 to 6. These results led to the development of two modeling methods to predict the WQI. In the first method, the correlation-based model was developed to predict WQI by evaluating six SWQPs. The second method, the causal-effect model, uses land use and meteorological factors to determine WQI using causality. Using advanced AutoML techniques, the initial pool of 40 ML models was meticulously evaluated and refined, culminating in the selection of the top three exemplary models such as Extreme Gradient Boosting (XGB), Extra Trees (ET), and Random Forest (RF). In both methods, XGB models show better prediction, with the coefficient of determination (R2) value of 0.95 during training and 0.83 during testing in method one. Whereas in the second method, R2 of 0.93 in training and 0.80 in testing are obtained. Further, XGB, ET, and ANN outputs were stacked with each model to enhance these results in both methods. Among these three stacked models, the stacked ANN_ML model performed better compared to stacked XGB_ML and stacked ET_ML. In the first method, the stacked ANN_ML model predicts R2 values of 0.95 and 0.91 for training and testing. In the second method, 0.95 and 0.90 for training and testing are obtained using stacked ANN_ML model. These findings emphasize the stacked model prediction ability to capture nonlinear relationships in the parameters and the novel approach of land use and climate parameters based WQI prediction, which replace the laborious, time-consuming SWQP measurements.
引用
收藏
页码:177 / 195
页数:19
相关论文
共 50 条
  • [1] Prediction of water quality index (WQI) based on artificial neural network (ANN)
    Khuan, LY
    Hamzah, N
    Jailani, R
    2002 STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT, PROCEEDINGS: GLOBALIZING RESEARCH AND DEVELOPMENT IN ELECTRICAL AND ELECTRONICS ENGINEERING, 2002, : 157 - 161
  • [2] Artificial Neural Network (ANN)-Based Water Quality Index (WQI) for Assessing Spatiotemporal Trends in Surface Water Quality-A Case Study of South African River Basins
    Banda, Talent Diotrefe
    Kumarasamy, Muthukrishnavellaisamy
    WATER, 2024, 16 (11)
  • [3] Artificial neural network-based assessment of water quality index (WQI) of surface water in Gwalior-Chambal region
    Chauhan, Shyamveer Singh
    Trivedi, Manoj Kumar
    INTERNATIONAL JOURNAL OF ENERGY AND ENVIRONMENTAL ENGINEERING, 2023, 14 (01) : 47 - 61
  • [4] Artificial neural network-based assessment of water quality index (WQI) of surface water in Gwalior-Chambal region
    Shyamveer Singh Chauhan
    Manoj Kumar Trivedi
    International Journal of Energy and Environmental Engineering, 2023, 14 : 47 - 61
  • [5] Assessment and prediction of Water Quality Index (WQI) by seasonal key water parameters in a coastal city: application of machine learning models
    Mo, Yuming
    Xu, Jing
    Liu, Chanjuan
    Wu, Jinran
    Chen, Dong
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2024, 196 (11)
  • [6] Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index
    Abba, Sani Isah
    Pham, Quoc Bao
    Saini, Gaurav
    Linh, Nguyen Thi Thuy
    Ahmed, Ali Najah
    Mohajane, Meriame
    Khaledian, Mohammadreza
    Abdulkadir, Rabiu Aliyu
    Bach, Quang-Vu
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2020, 27 (33) : 41524 - 41539
  • [7] Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index
    Sani Isah Abba
    Quoc Bao Pham
    Gaurav Saini
    Nguyen Thi Thuy Linh
    Ali Najah Ahmed
    Meriame Mohajane
    Mohammadreza Khaledian
    Rabiu Aliyu Abdulkadir
    Quang-Vu Bach
    Environmental Science and Pollution Research, 2020, 27 : 41524 - 41539
  • [8] Assessing and predicting water quality index with key water parameters by machine learning models in coastal cities, China
    Xu, Jing
    Mo, Yuming
    Zhu, Senlin
    Wu, Jinran
    Jin, Guangqiu
    Wang, You-Gan
    Ji, Qingfeng
    Li, Ling
    HELIYON, 2024, 10 (13)
  • [9] Artificial Neural Network Modeling of the Water Quality Index Using Land Use Areas as Predictors
    Gazzaz, Nabeel M.
    Yusoff, Mohd Kamil
    Ramli, Mohammad Firuz
    Juahir, Hafizan
    Aris, Ahmad Zaharin
    WATER ENVIRONMENT RESEARCH, 2015, 87 (02) : 99 - 112
  • [10] Relative performance of artificial neural networks and regression models in predicting missing water quality data
    Tyagi, Punam
    Chandramouli, V.
    Lingireddy, Srinivasa
    Buddhi, D.
    ENVIRONMENTAL ENGINEERING SCIENCE, 2008, 25 (05) : 657 - 668