Predicting river water quality index using data mining techniques

被引:0
|
作者
Richa Babbar
Sakshi Babbar
机构
[1] Thapar University,Department of Civil Engineering
[2] GD Goenka University,School of Engineering
来源
关键词
Water quality parameters; Water quality index; Overall Index of Pollution; -fold cross-validation; Data mining classifiers;
D O I
暂无
中图分类号
学科分类号
摘要
This paper demonstrates the application of data mining techniques to predict river water quality index. The usefulness of these techniques lies in the automated extraction of novel knowledge from the data to improve decision-making. The popular classification techniques, namely k-nearest neighbor, decision trees, Naive Bayes, artificial neural networks, rule-based and support vector machines were used to develop the predictive environment to classify water quality into understandable terms based on the Overall Index of Pollution. Experimentation was conducted on two types of data sets: synthetic and real. A repeated k-fold cross-validation procedure was followed to design the learning and testing frameworks of the predictive environment. Based on the validation results, it was found that the error rate in defining the true water quality class was 20 and 28%, 11 and 24%, 1 and 38% and 10 and 20% for the k-nearest neighbor, Naive Bayes, artificial neural network and rule-based classifiers for synthetic and real data sets, respectively. The decision tree and support vector machines classifiers were found to be the best predictive models with 0% error rates during automated extraction of the water quality class. This study reveals that data mining techniques have the potential to quickly predict water quality class, provided data given are a true representation of the domain knowledge.
引用
收藏
相关论文
共 50 条
  • [41] Predicting Students Performance in Examination Using Supervised Data Mining Techniques
    Abiodun, Kazeem Moses
    Adeniyi, Emmanuel Abidemi
    Aremu, Dayo Reuben
    Awotunde, Joseph Bamidele
    Ogbuji, Emmanuel
    [J]. INFORMATICS AND INTELLIGENT APPLICATIONS, 2022, 1547 : 63 - 77
  • [42] Predicting Micro-Enterprise Failures Using Data Mining Techniques
    Ptak-Chmielewska, Aneta
    [J]. JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2019, 12 (01)
  • [43] Spatiotemporal Analysis of Water Quality Using Multivariate Statistical Techniques and the Water Quality Identification Index for the Qinhuai River Basin, East China
    Ma, Xiaoxue
    Wang, Lachun
    Yang, Hong
    Li, Na
    Gong, Chang
    [J]. WATER, 2020, 12 (10)
  • [44] Predicting the Status of Water Pumps Using Data Mining Approach
    Darmatasia
    Arymurthy, Aniati Murni
    [J]. 2016 INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS), 2016, : 57 - 63
  • [45] Evaluating surface water quality using water quality index in Beiyun River, China
    Huihui Wu
    Wenjie Yang
    Ruihua Yao
    Yue Zhao
    Yunqiang Zhao
    Yuhang Zhang
    Qianhui Yuan
    Aijun Lin
    [J]. Environmental Science and Pollution Research, 2020, 27 : 35449 - 35458
  • [46] Water quality assessment of Dudhganga river using water quality index and anthropogenic activities
    Lagade, Vishwajeet Mahadev
    Taware, Shital Shantaram
    Lagade, Swapnaja Vishwajeet
    [J]. JOURNAL OF WATER AND CLIMATE CHANGE, 2024, : 4237 - 4253
  • [47] Evaluating surface water quality using water quality index in Beiyun River, China
    Wu, Huihui
    Yang, Wenjie
    Yao, Ruihua
    Zhao, Yue
    Zhao, Yunqiang
    Zhang, Yuhang
    Yuan, Qianhui
    Lin, Aijun
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2020, 27 (28) : 35449 - 35458
  • [48] Predicting chemical parameters of river water quality from bioindicator data
    Dzeroski, S
    Demsar, D
    Grbovic, J
    [J]. APPLIED INTELLIGENCE, 2000, 13 (01) : 7 - 17
  • [49] Predicting Chemical Parameters of River Water Quality from Bioindicator Data
    Sašo Džeroski
    Damjan Demšar
    Jasna Grbović
    [J]. Applied Intelligence, 2000, 13 : 7 - 17
  • [50] Workflow Quality of Service management using data mining techniques
    Cardoso, Jorge
    [J]. 2006 3RD INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 470 - 473