Water quality prediction using machine learning models based on grid search method

被引:24
|
作者
Shams, Mahmoud Y. [1 ]
Elshewey, Ahmed M. [2 ]
El-kenawy, El-Sayed M. [3 ]
Ibrahim, Abdelhameed [4 ]
Talaat, Fatma M. [1 ,5 ]
Tarek, Zahraa [6 ]
机构
[1] Kafrelsheikh Univ, Fac Artificial Intelligence, Kafrelsheikh 33516, Egypt
[2] Suez Univ, Fac Comp & Informat, Comp Sci Dept, Suez, Egypt
[3] Delta Higher Inst Engn & Technol, Dept Commun & Elect, Mansoura 35111, Egypt
[4] Mansoura Univ, Fac Engn, Comp Engn & Control Syst Dept, Mansoura 35516, Egypt
[5] New Mansoura Univ, Fac Comp Sci & Engn, Mansoura 35712, Egypt
[6] Mansoura Univ, Fac Comp & Informat, Comp Sci Dept, Mansoura 35561, Egypt
关键词
Water quality; Machine learning models; Grid search; Water quality index; Water quality classification; RIVER; IDENTIFICATION; NETWORKS; SYSTEM; INDEX;
D O I
10.1007/s11042-023-16737-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Water quality is very dominant for humans, animals, plants, industries, and the environment. In the last decades, the quality of water has been impacted by contamination and pollution. In this paper, the challenge is to anticipate Water Quality Index (WQI) and Water Quality Classification (WQC), such that WQI is a vital indicator for water validity. In this study, parameters optimization and tuning are utilized to improve the accuracy of several machine learning models, where the machine learning techniques are utilized for the process of predicting WQI and WQC. Grid search is a vital method used for optimizing and tuning the parameters for four classification models and also, for optimizing and tuning the parameters for four regression models. Random forest (RF) model, Extreme Gradient Boosting (Xgboost) model, Gradient Boosting (GB) model, and Adaptive Boosting (AdaBoost) model are used as classification models for predicting WQC. K-nearest neighbor (KNN) regressor model, decision tree (DT) regressor model, support vector regressor (SVR) model, and multi-layer perceptron (MLP) regressor model are used as regression models for predicting WQI. In addition, preprocessing step including, data imputation (mean imputation) and data normalization were performed to fit the data and make it convenient for any further processing. The dataset used in this study includes 7 features and 1991 instances. To examine the efficacy of the classification approaches, five assessment metrics were computed: accuracy, recall, precision, Matthews's Correlation Coefficient (MCC), and F1 score. To assess the effectiveness of the regression models, four assessment metrics were computed: Mean Absolute Error (MAE), Median Absolute Error (MedAE), Mean Square Error (MSE), and coefficient of determination (R2). In terms of classification, the testing findings showed that the GB model produced the best results, with an accuracy of 99.50% when predicting WQC values. According to the experimental results, the MLP regressor model outperformed other models in regression and achieved an R2 value of 99.8% while predicting WQI values.
引用
收藏
页码:35307 / 35334
页数:28
相关论文
共 50 条
  • [1] Water quality prediction using machine learning models based on grid search method
    Mahmoud Y. Shams
    Ahmed M. Elshewey
    El-Sayed M. El-kenawy
    Abdelhameed Ibrahim
    Fatma M. Talaat
    Zahraa Tarek
    [J]. Multimedia Tools and Applications, 2024, 83 : 35307 - 35334
  • [2] Prediction of baking quality using machine learning based intelligent models
    Isleroglu, Hilal
    Beyhan, Selami
    [J]. HEAT AND MASS TRANSFER, 2020, 56 (07) : 2045 - 2055
  • [3] Prediction of baking quality using machine learning based intelligent models
    Hilal Isleroglu
    Selami Beyhan
    [J]. Heat and Mass Transfer, 2020, 56 : 2045 - 2055
  • [4] Prediction of irrigation water quality indices based on machine learning and regression models
    Mokhtar, Ali
    Elbeltagi, Ahmed
    Gyasi-Agyei, Yeboah
    Al-Ansari, Nadhir
    Abdel-Fattah, Mohamed K.
    [J]. APPLIED WATER SCIENCE, 2022, 12 (04)
  • [5] Prediction of irrigation water quality indices based on machine learning and regression models
    Ali Mokhtar
    Ahmed Elbeltagi
    Yeboah Gyasi-Agyei
    Nadhir Al-Ansari
    Mohamed K. Abdel-Fattah
    [J]. Applied Water Science, 2022, 12
  • [6] Water quality prediction using machine learning methods
    Haghiabi, Amir Hamzeh
    Nasrolahi, Ali Heidar
    Parsaie, Abbas
    [J]. WATER QUALITY RESEARCH JOURNAL OF CANADA, 2018, 53 (01): : 3 - 13
  • [7] Water quality prediction based on sparse dataset using enhanced machine learning
    Huang, Sheng
    Xia, Jun
    Wang, Yueling
    Lei, Jiarui
    Wang, Gangsheng
    [J]. ENVIRONMENTAL SCIENCE AND ECOTECHNOLOGY, 2024, 20
  • [8] Air Quality Prediction System Using Machine Learning Models
    Chaturvedi, Pooja
    [J]. WATER AIR AND SOIL POLLUTION, 2024, 235 (09):
  • [9] Grid search in hyperparameter optimization of machine learning models for prediction of HIV/AIDS test results
    Belete, Daniel Mesafint
    Huchaiah, Manjaiah D.
    [J]. International Journal of Computers and Applications, 2022, 44 (09) : 875 - 886
  • [10] Efficient water quality prediction models based on machine learning algorithms for Nainital Lake, Uttarakhand
    Koranga, Manisha
    Pant, Pushpa
    Kumar, Tarun
    Pant, Durgesh
    Bhatt, Ashutosh Kumar
    Pant, R. P.
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 57 : 1706 - 1712