An Approach for Predicting River Water Quality Using Data Mining Technique

被引:2
|
作者
Gulyani, Bharat B. [1 ]
Mangai, J. Alamelu [1 ]
Fathima, Arshia [2 ]
机构
[1] BITS Pilani, Dubai Int Acad City, U Arab Emirates
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
Biochemical oxygen demand (BOD); Data mining; Support vector machines; Multiple regression; Correlation coefficient;
D O I
10.1007/978-3-319-20910-4_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Water contains many chemical, physical, and biological impurities. Some impurities are benign while others are toxic. The quality of water is defined in terms of its physical, chemical, and biological parameters and ascertaining its quality is crucial before use for various intended purposes such as potable water, agricultural, industrial, etc. Various water analysis methods are employed to determine water quality parameters such as DO, COD, BOD, pH, TDS, salinity, chlorophyll-a, coli form, and organic contaminants such as pesticides. The list of potential water contaminants is exhaustive and impractical to test for in its entirety. Such water testing is sometimes costly and time consuming. This paper attempts to present application of data mining technique to build a model to predict a widely used gross water quality parameter called Biochemical oxygen demand (BOD). BOD is a measure of the amount of dissolved oxygen used by microbial oxidation of organic matter in wastewater. The standard method for measuring BOD is a 5-day process. Dilution of sample, constant pH and nutrient content besides the temperature of 20 degrees C and dark area are required for correct results. High levels of nitrogen compounds yield false BOD results. Winkler titration which is also used to measure BOD is a chemical intensive process. Hence an automatic prediction model for BOD has been sought for accurate, cost-effective and time saving measurement. Based on data available for BOD measurements, this paper describes the development of a prediction model for BOD using a technique of data mining, namely, support vector machines (SVM). A correlation coefficient of 0.9471 and RMSE of 0.5019 was obtained for the BOD prediction model on river water quality data. The performance of the proposed model was also compared with two other models namely artificial neural network (ANN) and regression by discretization. Simulation results show that the proposed model performs better than the other two in terms of correlation coefficient and RMSE.
引用
收藏
页码:233 / 243
页数:11
相关论文
共 50 条
  • [21] Predicting Academic Performance of Students Using a Hybrid Data Mining Approach
    Francis, Bindhia K.
    Babu, Suvanam Sasidhar
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (06)
  • [22] Predicting Academic Performance of Students Using a Hybrid Data Mining Approach
    Bindhia K. Francis
    Suvanam Sasidhar Babu
    [J]. Journal of Medical Systems, 2019, 43
  • [23] Statistical and Data Mining Techniques for Understanding Water Quality Profiles in a Mining-Affected River Basin
    Simmonds, Jose
    Gomez, Juan A.
    Ledezma, Agapito
    [J]. INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2018, 9 (02) : 1 - 19
  • [24] Effects of Mining Activities on River Water Quality
    Richter, Pavel
    Pecharova, Emilie
    [J]. POLISH JOURNAL OF ENVIRONMENTAL STUDIES, 2013, 22 (04): : 1269 - 1276
  • [25] Quality Data for Data Mining and Data Mining for Quality Data: A Constraint Based Approach in XML
    Shahriar, Md. Sumon
    Anam, Sarawat
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING SYMPOSIA, VOLS 1-5, PROCEEDINGS, 2008, : 142 - +
  • [26] A DIFFERENT APPROACH TO THE MONITORING OF THE QUALITY OF DRINKING WATER WITH DATA MINING TOOLS
    Camur, Derya
    Altin, Ahmet
    Topbas, Murat
    Ilter, Huseyin
    [J]. FRESENIUS ENVIRONMENTAL BULLETIN, 2022, 31 (1A): : 1188 - 1200
  • [27] Predicting assembly quality of complex structures using data mining - Predicting with decision tree algorithm
    Ponomareva, Ekaterina S.
    Wang, Kesheng
    Lien, Terje K.
    [J]. Knowledge Enterprise: Intelligent Strategies in Product Design, Manufacturing, and Management, 2006, 207 : 263 - 268
  • [28] Power Quality Data Mining Using Hybrid Feature Extraction Technique
    Sivaramakrishnan, Vidhya
    Mahadevan, Balaji
    Vijayarajan, Kamaraj
    [J]. SMART SENSORS MEASUREMENT AND INSTRUMENTATION, CISCON 2021, 2023, 957 : 491 - 502
  • [29] Predicting Career Using Data Mining
    Arafath, Md. Yeasin
    Saifuzzaman, Mohd.
    Ahmed, Sumaiya
    Hossain, Syed Akhter
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 2018, : 889 - 894
  • [30] Groundwater quality assessment using geospatial technique based water quality index (WQI) approach in a coal mining region of India
    Kumar A.
    Krishna A.P.
    [J]. Arabian Journal of Geosciences, 2021, 14 (12)