Performance of Machine Learning, Artificial Neural Network (ANN), and stacked ensemble models in predicting Water Quality Index (WQI) from surface water quality parameters, climatic and land use data

被引:0
|
作者
Satish, Nagalapalli [1 ]
Anmala, Jagadeesh [1 ]
Varma, Murari R. R. [1 ]
Rajitha, K. [1 ]
机构
[1] Birla Inst Technol & Sci Pilani, Dept Civil Engn, Hyderabad Campus, Hyderabad 500078, Telangana, India
关键词
Water Quality Index; Machine Learning; Stacked Artificial Neural Networks; Land Use and Land Cover; Climatic factors; RIVER; INDICATORS; BASIN; TOOL;
D O I
10.1016/j.psep.2024.10.054
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Assessing water quality is essential for managing freshwater resources, safeguarding ecosystems, and guaranteeing public health. Traditional water quality assessment methods suffer from seasonal sampling, multi- parameter requirements, and labor-intensive sampling processes, which are major constraints for the frequent monitoring of vast river basins. To overcome this issue, the study modeled the remote sensing-based climatic and land use parameters with Principal Component Analysis (PCA) to leverage Artificial Neural Networks (ANN) and machine learning (ML) algorithms to predict the Water Quality Index (WQI). The Weighted Arithmetic Water Quality Index (WAWQI) method was used to calculate the WQI of the Godavari River Basin for the available 19 stream water quality parameters (SWQPs). Further, PCA was applied to reduce the dimensionality of the parameters from 19 to 6. These results led to the development of two modeling methods to predict the WQI. In the first method, the correlation-based model was developed to predict WQI by evaluating six SWQPs. The second method, the causal-effect model, uses land use and meteorological factors to determine WQI using causality. Using advanced AutoML techniques, the initial pool of 40 ML models was meticulously evaluated and refined, culminating in the selection of the top three exemplary models such as Extreme Gradient Boosting (XGB), Extra Trees (ET), and Random Forest (RF). In both methods, XGB models show better prediction, with the coefficient of determination (R2) value of 0.95 during training and 0.83 during testing in method one. Whereas in the second method, R2 of 0.93 in training and 0.80 in testing are obtained. Further, XGB, ET, and ANN outputs were stacked with each model to enhance these results in both methods. Among these three stacked models, the stacked ANN_ML model performed better compared to stacked XGB_ML and stacked ET_ML. In the first method, the stacked ANN_ML model predicts R2 values of 0.95 and 0.91 for training and testing. In the second method, 0.95 and 0.90 for training and testing are obtained using stacked ANN_ML model. These findings emphasize the stacked model prediction ability to capture nonlinear relationships in the parameters and the novel approach of land use and climate parameters based WQI prediction, which replace the laborious, time-consuming SWQP measurements.
引用
收藏
页码:177 / 195
页数:19
相关论文
共 50 条
  • [21] Performance comparison of various machine learning models for predicting water quality parameters in the Chebika Zone of Central Tunisia
    Abdelhedi, Mohamed
    Gabtni, Hakim
    EARTH SCIENCE INFORMATICS, 2024, 17 (05) : 4245 - 4259
  • [22] Water Quality Assessment with Artificial Neural Network Models: Performance Comparison Between SMN, MLP and PS-ANN Methodologies
    Isik, Hakan
    Akkan, Tamer
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2025, 50 (01) : 369 - 387
  • [23] Use of artificial neural networks for predicting optimal alum doses and treated water quality parameters
    Maier, HR
    Morgan, N
    Chow, CWK
    ENVIRONMENTAL MODELLING & SOFTWARE, 2004, 19 (05) : 485 - 494
  • [24] Using Machine Learning Models for Predicting the Water Quality Index in the La Buong River, Vietnam
    Khoi, Dao Nguyen
    Quan, Nguyen Trong
    Linh, Do Quang
    Nhi, Pham Thi Thao
    Thuy, Nguyen Thi Diem
    WATER, 2022, 14 (10)
  • [25] Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data
    Chen, Kangyang
    Chen, Hexia
    Zhou, Chuanlong
    Huang, Yichao
    Qi, Xiangyang
    Shen, Ruqin
    Liu, Fengrui
    Zuo, Min
    Zou, Xinyi
    Wang, Jinfeng
    Zhang, Yan
    Chen, Da
    Chen, Xingguo
    Deng, Yongfeng
    Ren, Hongqiang
    WATER RESEARCH, 2020, 171 (171)
  • [26] Application of an unsupervised artificial neural network technique to multivariant surface water quality data
    Cinar, Oezer
    Merdun, Hasan
    ECOLOGICAL RESEARCH, 2009, 24 (01) : 163 - 173
  • [27] Predicting the Quality of Treated Water from an Activated Carbon Based Water Purifier Using Artificial Neural Network
    Roy, Ritabrata
    ASIAN JOURNAL OF WATER ENVIRONMENT AND POLLUTION, 2019, 16 (03) : 125 - 128
  • [28] Linear regression and artificial neural network models for predicting abrasive water jet marble drilling quality
    Hammouda, Mouna
    Ghienne, Martin
    Dion, Jean-Luc
    Ben Yahia, Noureddine
    ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (09)
  • [29] Comparative analysis of machine learning models for predicting water quality index in Dhaka's rivers of Bangladesh
    Nishat, Mosaraf Hosan
    Khan, Md. Habibur Rahman Bejoy
    Ahmed, Tahmeed
    Hossain, Syed Nahin
    Ahsan, Amimul
    El-Sergany, M. M.
    Shafiquzzaman, Md.
    Imteaz, Monzur Alam
    Alresheedi, Mohammad T.
    ENVIRONMENTAL SCIENCES EUROPE, 2025, 37 (01)
  • [30] APPLICATION OF ARTIFICIAL NEURAL NETWORK TECHNIQUES FOR PREDICTING THE WATER QUALITY INDEX IN THE PARAKAI LAKE, TAMIL NADU, INDIA
    Vasanthi, Sahaya S.
    Kumar, Adish S.
    APPLIED ECOLOGY AND ENVIRONMENTAL RESEARCH, 2019, 17 (02): : 1947 - 1958