Comparative evaluation of machine learning models for groundwater quality assessment

被引:66
|
作者
Bedi, Shine [1 ]
Samal, Ashok [1 ]
Ray, Chittaranjan [2 ]
Snow, Daniel [3 ]
机构
[1] Univ Nebraska, Comp Sci & Engn, Lincoln, NE 68588 USA
[2] Univ Nebraska, Nebraska Water Ctr, Lincoln, NE USA
[3] Univ Nebraska, Water Sci Lab, Lincoln, NE USA
关键词
Artificial neural networks (ANN); Support vector machines (SVM); XGBoost; Data imbalance; Feature importance; Groundwater quality; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINES; IMBALANCED DATA; WATER-QUALITY; UNITED-STATES; NITRATE; PREDICTION; CLASSIFICATION; CONTAMINATION; SIMULATION;
D O I
10.1007/s10661-020-08695-3
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Contamination from pesticides and nitrate in groundwater is a significant threat to water quality in general and agriculturally intensive regions in particular. Three widely used machine learning models, namely, artificial neural networks (ANN), support vector machines (SVM), and extreme gradient boosting (XGB), were evaluated for their efficacy in predicting contamination levels using sparse data with non-linear relationships. The predictive ability of the models was assessed using a dataset consisting of 303 wells across 12 Midwestern states in the USA. Multiple hydrogeologic, water quality, and land use features were chosen as the independent variables, and classes were based on measured concentration ranges of nitrate and pesticide. This study evaluates the classification performance of the models for two, three, and four class scenarios and compares them with the corresponding regression models. The study also examines the issue of class imbalance and tests the efficacy of three class imbalance mitigation techniques: oversampling, weighting, and oversampling and weighting, for all the scenarios. The models' performance is reported using multiple metrics, both insensitive to class imbalance (accuracy) and sensitive to class imbalance (F1 score and MCC). Finally, the study assesses the importance of features using game-theoretic Shapley values to rank features consistently and offer model interpretability.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Comparative evaluation of machine learning models for groundwater quality assessment
    Shine Bedi
    Ashok Samal
    Chittaranjan Ray
    Daniel Snow
    [J]. Environmental Monitoring and Assessment, 2020, 192
  • [2] Comparative Assessment of Machine Learning Models for Groundwater Quality Prediction Using Various ParametersComparative Assessment of Machine Learning Models for Groundwater Quality Prediction Using Various ParametersNiazkar et al.
    Majid Niazkar
    Reza Piraei
    Mohammad Reza Goodarzi
    Mohammad Javad Abedi
    [J]. Environmental Processes, 2025, 12 (1)
  • [3] Application of machine learning models in groundwater quality assessment and prediction: progress and challenges
    Yanpeng Huang
    Chao Wang
    Yuanhao Wang
    Guangfeng Lyu
    Sijie Lin
    Weijiang Liu
    Haobo Niu
    Qing Hu
    [J]. Frontiers of Environmental Science & Engineering, 2024, 18
  • [4] Application of machine learning models in groundwater quality assessment and prediction: progress and challenges
    Huang, Yanpeng
    Wang, Chao
    Wang, Yuanhao
    Lyu, Guangfeng
    Lin, Sijie
    Liu, Weijiang
    Niu, Haobo
    Hu, Qing
    [J]. FRONTIERS OF ENVIRONMENTAL SCIENCE & ENGINEERING, 2024, 18 (03)
  • [5] Groundwater Quality Assessment Using machine learning
    Mullasseri, Sileesh
    Mishra, Ravi
    Singh, Archana
    Chandra, G. Sharath
    Jhariya, D. C.
    Mishra, Shwetakshi
    Jadav, Ravindra
    Hans, Aradhana L.
    Buch, Khuban
    [J]. CURRENT SCIENCE, 2021, 121 (05): : 606 - 607
  • [6] Evaluation of machine learning algorithms for groundwater quality modeling
    Sahour, Soheil
    Khanbeyki, Matin
    Gholami, Vahid
    Sahour, Hossein
    Kahvazade, Irene
    Karimi, Hadi
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (16) : 46004 - 46021
  • [7] Evaluation of machine learning algorithms for groundwater quality modeling
    Soheil Sahour
    Matin Khanbeyki
    Vahid Gholami
    Hossein Sahour
    Irene Kahvazade
    Hadi Karimi
    [J]. Environmental Science and Pollution Research, 2023, 30 : 46004 - 46021
  • [8] Machine Learning For Groundwater Quality Classification: A Step Towards Economic and Sustainable Groundwater Quality Assessment Process
    Zegaar, Aymen
    Ounoki, Samira
    Telli, Abdelmoutia
    [J]. WATER RESOURCES MANAGEMENT, 2024, 38 (02) : 621 - 637
  • [9] Machine Learning For Groundwater Quality Classification: A Step Towards Economic and Sustainable Groundwater Quality Assessment Process
    Aymen Zegaar
    Samira Ounoki
    Abdelmoutia Telli
    [J]. Water Resources Management, 2024, 38 : 621 - 637
  • [10] Comparative Assessment of Individual and Ensemble Machine Learning Models for Efficient Analysis of River Water Quality
    Alqahtani, Abdulaziz
    Shah, Muhammad Izhar
    Aldrees, Ali
    Javed, Muhammad Faisal
    [J]. SUSTAINABILITY, 2022, 14 (03)