Predicting river water quality index using data mining techniques

被引:0
|
作者
Richa Babbar
Sakshi Babbar
机构
[1] Thapar University,Department of Civil Engineering
[2] GD Goenka University,School of Engineering
来源
关键词
Water quality parameters; Water quality index; Overall Index of Pollution; -fold cross-validation; Data mining classifiers;
D O I
暂无
中图分类号
学科分类号
摘要
This paper demonstrates the application of data mining techniques to predict river water quality index. The usefulness of these techniques lies in the automated extraction of novel knowledge from the data to improve decision-making. The popular classification techniques, namely k-nearest neighbor, decision trees, Naive Bayes, artificial neural networks, rule-based and support vector machines were used to develop the predictive environment to classify water quality into understandable terms based on the Overall Index of Pollution. Experimentation was conducted on two types of data sets: synthetic and real. A repeated k-fold cross-validation procedure was followed to design the learning and testing frameworks of the predictive environment. Based on the validation results, it was found that the error rate in defining the true water quality class was 20 and 28%, 11 and 24%, 1 and 38% and 10 and 20% for the k-nearest neighbor, Naive Bayes, artificial neural network and rule-based classifiers for synthetic and real data sets, respectively. The decision tree and support vector machines classifiers were found to be the best predictive models with 0% error rates during automated extraction of the water quality class. This study reveals that data mining techniques have the potential to quickly predict water quality class, provided data given are a true representation of the domain knowledge.
引用
收藏
相关论文
共 50 条
  • [21] Predicting Hypoglycemia in Diabetic Patients Using Data Mining Techniques
    Eljil, Khouloud Safi
    Qadah, Ghassan
    Pasquier, Michel
    [J]. 2013 9TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2013,
  • [22] Predicting School Failure and Dropout by Using Data Mining Techniques
    Marquez-Vera, Carlos
    Romero Morales, Cristobal
    Ventura Soto, Sebastian
    [J]. IEEE REVISTA IBEROAMERICANA DE TECNOLOGIAS DEL APRENDIZAJE-IEEE RITA, 2013, 8 (01): : 7 - 14
  • [23] Software quality prediction using data mining techniques
    Merzah, Bayadaa M.
    [J]. 2019 International Conference on Information and Communications Technology, ICOIACT 2019, 2019, : 394 - 397
  • [24] Assessing the impact of land use and land cover on river water quality using water quality index and remote sensing techniques
    Gani, Md Ataul
    Sajib, Abdul Majed
    Siddik, Md Abubakkor
    Moniruzzaman, Md
    [J]. ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (04)
  • [25] Evaluation of the Swat River, Northern Pakistan, water quality using multivariate statistical techniques and water quality index (WQI) model
    Shah Jehan
    Ihsan Ullah
    Sardar Khan
    Said Muhammad
    Seema Anjum Khattak
    Tariq Khan
    [J]. Environmental Science and Pollution Research, 2020, 27 : 38545 - 38558
  • [26] Evaluation of the Swat River, Northern Pakistan, water quality using multivariate statistical techniques and water quality index (WQI) model
    Jehan, Shah
    Ullah, Ihsan
    Khan, Sardar
    Muhammad, Said
    Khattak, Seema Anjum
    Khan, Tariq
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2020, 27 (31) : 38545 - 38558
  • [27] Assessing the impact of land use and land cover on river water quality using water quality index and remote sensing techniques
    Md Ataul Gani
    Abdul Majed Sajib
    Md Abubakkor Siddik
    [J]. Environmental Monitoring and Assessment, 2023, 195
  • [28] Surface water quality classification using data mining approaches: Irrigation along the Aladag River
    Sattari, Mohammad Taghi
    Feizi, Hajar
    Colak, Muslume Sevba
    Ozturk, Ahmet
    Ozturk, Fazli
    Apaydin, Halit
    [J]. IRRIGATION AND DRAINAGE, 2021, 70 (05) : 1227 - 1246
  • [29] Induction of Model Trees for Predicting BOD in River Water: A Data Mining Perspective
    Mangai, J. Alamelu
    Gulyani, Bharat B.
    [J]. ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2016, 9728 : 1 - 13
  • [30] Measuring data quality of geoscience datasets using data mining techniques
    Cai, Cuo
    Xie, Kunqing
    [J]. Data Science Journal, 2007, 6 (SUPPL.)