Predicting river water quality index using data mining techniques

被引:0
|
作者
Richa Babbar
Sakshi Babbar
机构
[1] Thapar University,Department of Civil Engineering
[2] GD Goenka University,School of Engineering
来源
关键词
Water quality parameters; Water quality index; Overall Index of Pollution; -fold cross-validation; Data mining classifiers;
D O I
暂无
中图分类号
学科分类号
摘要
This paper demonstrates the application of data mining techniques to predict river water quality index. The usefulness of these techniques lies in the automated extraction of novel knowledge from the data to improve decision-making. The popular classification techniques, namely k-nearest neighbor, decision trees, Naive Bayes, artificial neural networks, rule-based and support vector machines were used to develop the predictive environment to classify water quality into understandable terms based on the Overall Index of Pollution. Experimentation was conducted on two types of data sets: synthetic and real. A repeated k-fold cross-validation procedure was followed to design the learning and testing frameworks of the predictive environment. Based on the validation results, it was found that the error rate in defining the true water quality class was 20 and 28%, 11 and 24%, 1 and 38% and 10 and 20% for the k-nearest neighbor, Naive Bayes, artificial neural network and rule-based classifiers for synthetic and real data sets, respectively. The decision tree and support vector machines classifiers were found to be the best predictive models with 0% error rates during automated extraction of the water quality class. This study reveals that data mining techniques have the potential to quickly predict water quality class, provided data given are a true representation of the domain knowledge.
引用
收藏
相关论文
共 50 条
  • [31] Evaluation of Aydughmush River water quality using the National Sanitation Foundation Water Quality Index (NSFWQI), River Pollution Index (RPI), and Forestry Water Quality Index (FWQI)
    Hoseinzadeh, Edris
    Khorsandi, Hassan
    Wei, Chiang
    Alipour, Mahdi
    [J]. DESALINATION AND WATER TREATMENT, 2015, 54 (11) : 2994 - 3002
  • [32] A Review on Predicting Student's Performance using Data Mining Techniques
    Shahiri, Amirah Mohamed
    Husain, Wahidah
    Rashid, Nur'aini Abdul
    [J]. THIRD INFORMATION SYSTEMS INTERNATIONAL CONFERENCE 2015, 2015, 72 : 414 - 422
  • [33] Explaining and predicting workplace accidents using data-mining techniques
    Rivas, T.
    Paz, M.
    Martin, J. E.
    Matias, J. M.
    Garcia, J. F.
    Taboada, J.
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2011, 96 (07) : 739 - 747
  • [34] Predicting Serious Outcomes in Syncope Patients Using Data Mining Techniques
    Mansouri, Ardeshir
    Ordikhani, Mohammad
    Abadeh, Mohammad Saniee
    Tajdini, Masih
    [J]. 2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 409 - 413
  • [35] Predicting the Course Knowledge Level of Students using Data Mining Techniques
    Parkavi, A.
    Lakshmi, K.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), 2017, : 128 - 133
  • [36] Predicting students' performance in English and Mathematics using data mining techniques
    Bin Roslan, Muhammad Haziq
    Chen, Chwen Jen
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2023, 28 (02) : 1427 - 1453
  • [37] Predicting Instructor Performance Using Data Mining Techniques in Higher Education
    Agaoglu, Mustafa
    [J]. IEEE ACCESS, 2016, 4 : 2379 - 2387
  • [38] Predicting Chronic Kidney Failure Disease Using Data Mining Techniques
    Boukenze, Basma
    Haqiq, Abdelkrim
    Mousannif, Hajar
    [J]. ADVANCES IN UBIQUITOUS NETWORKING 2, 2017, 397 : 701 - 712
  • [39] Predicting Learner Performance Using Data-Mining Techniques and Ontology
    Abd El-Rady, Alla
    Shehab, Mohamed
    El Fakharany, Essam
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 660 - 669
  • [40] Predicting students’ performance in English and Mathematics using data mining techniques
    Muhammad Haziq Bin Roslan
    Chwen Jen Chen
    [J]. Education and Information Technologies, 2023, 28 : 1427 - 1453