Evaluating statistical model performance in water quality prediction

被引:85
|
作者
Avila, Rodelyn [1 ,2 ]
Horn, Beverley [2 ]
Moriarty, Elaine [2 ]
Hodson, Roger [3 ]
Moltchanova, Elena [1 ]
机构
[1] Univ Canterbury, Sch Math & Stat, Private Bag 4800, Christchurch 8140, New Zealand
[2] ESR, Inst Environm Sci & Res, POB 29181, Christchurch 8540, New Zealand
[3] Environm Southland, Private Bag 90116, Invercargill 9840, New Zealand
关键词
Water quality prediction; E; coli; Statistical models; Bayesian networks; ESCHERICHIA-COLI; SURVIVAL; HEALTH; MPN;
D O I
10.1016/j.jenvman.2017.11.049
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Exposure to contaminated water while swimming or boating or participating in other recreational activities can cause gastrointestinal and respiratory disease. It is not uncommon for water bodies to experience rapid fluctuations in water quality, and it is therefore vital to be able to predict them accurately and in time so as to minimise population's exposure to pathogenic organisms. E. coli is commonly used as an indicator to measure water quality in freshwater, and higher counts of E. coil are associated with increased risk to illness. In this case study, we compare the performance of a wide range of statistical models in prediction of water quality via E. coli levels for the weekly data collected over the summer months from 2006 to 2014 at the recreational site on the Oreti river in Wallacetown, New Zealand. The models include naive model, multiple linear regression, dynamic regression, regression tree, Markov chain, classification tree, random forests, multinomial logistic regression, discriminant analysis and Bayesian network. The results show that Bayesian network was superior to all the other models. Overall, it had a leave-one-out and k-fold cross validation error rate of 21%, while predicting the majority of instances of E. coli levels classified as unsafe by the Microbiological Water Quality Guidelines for Marine and Freshwater Recreational Areas 2003, New Zealand. Because Bayesian networks are also flexible in handling missing data and outliers and allow for continuous updating in real time, we have found them to be a promising tool, and in the future, plan to extend the analysis beyond the current case study site. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:910 / 919
页数:10
相关论文
共 50 条
  • [11] Evaluating the performance of a multispecies statistical catch-at-age model
    Curti, Kiersten L.
    Collie, Jeremy S.
    Legault, Christopher M.
    Link, Jason S.
    CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 2013, 70 (03) : 470 - 484
  • [12] A hybrid regression model for water quality prediction
    Chakraborty, Tanujit
    Chakraborty, Ashis Kumar
    Mansoor, Zubia
    OPSEARCH, 2019, 56 (04) : 1167 - 1178
  • [13] A hybrid regression model for water quality prediction
    Tanujit Chakraborty
    Ashis Kumar Chakraborty
    Zubia Mansoor
    OPSEARCH, 2019, 56 : 1167 - 1178
  • [14] Role of statistical remote sensing for Inland water quality parameters prediction
    Abdelmalik, K. W.
    EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2018, 21 (02): : 193 - 200
  • [15] Making and Evaluating a Statistical Prediction Model for the Absolute Risk of Prostate Cancer Recurrence
    Kattan, Michael W.
    Gerds, Thomas A.
    CANCER, 2011, 117 (22) : 5026 - 5028
  • [16] Evaluating Surface Water Quality in a Coastal Province of Vietnamese Mekong Delta Using Water Quality Index and Statistical Methods
    Giao, Nguyen Thanh
    Ly, Nguyen Hong Thao
    POLISH JOURNAL OF ENVIRONMENTAL STUDIES, 2023, 32 (03): : 2113 - 2124
  • [17] Evaluating mobile ad hoc networks: A performance index and statistical model
    Ajbar, Ikhlas
    Perkins, Dmitri
    PROCEEDINGS - 16TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, VOLS 1-3, 2007, : 284 - 290
  • [18] A Statistical Performance Prediction Model for OpenCL Kernels on NVIDIA GPUs
    Karami, Ali
    Mirsoleimani, Sayyed Ali
    Khunjush, Farshad
    2013 17TH CSI INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND DIGITAL SYSTEMS (CADS 2013), 2013, : 15 - 22
  • [19] Water quality prediction in a reservoir: Linguistic model approach for interval prediction
    Jin-Il Park
    Nahm-Chung Jung
    Keun-Chang Kwak
    Myung-Geun Chun
    International Journal of Control, Automation and Systems, 2010, 8 : 868 - 874
  • [20] Water Quality Prediction in a Reservoir: Linguistic Model Approach for Interval Prediction
    Park, Jin-Il
    Jung, Nahm-Chung
    Kwak, Keun-Chang
    Chun, Myung-Geun
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2010, 8 (04) : 868 - 874