Evaluating statistical model performance in water quality prediction

被引:85
|
作者
Avila, Rodelyn [1 ,2 ]
Horn, Beverley [2 ]
Moriarty, Elaine [2 ]
Hodson, Roger [3 ]
Moltchanova, Elena [1 ]
机构
[1] Univ Canterbury, Sch Math & Stat, Private Bag 4800, Christchurch 8140, New Zealand
[2] ESR, Inst Environm Sci & Res, POB 29181, Christchurch 8540, New Zealand
[3] Environm Southland, Private Bag 90116, Invercargill 9840, New Zealand
关键词
Water quality prediction; E; coli; Statistical models; Bayesian networks; ESCHERICHIA-COLI; SURVIVAL; HEALTH; MPN;
D O I
10.1016/j.jenvman.2017.11.049
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Exposure to contaminated water while swimming or boating or participating in other recreational activities can cause gastrointestinal and respiratory disease. It is not uncommon for water bodies to experience rapid fluctuations in water quality, and it is therefore vital to be able to predict them accurately and in time so as to minimise population's exposure to pathogenic organisms. E. coli is commonly used as an indicator to measure water quality in freshwater, and higher counts of E. coil are associated with increased risk to illness. In this case study, we compare the performance of a wide range of statistical models in prediction of water quality via E. coli levels for the weekly data collected over the summer months from 2006 to 2014 at the recreational site on the Oreti river in Wallacetown, New Zealand. The models include naive model, multiple linear regression, dynamic regression, regression tree, Markov chain, classification tree, random forests, multinomial logistic regression, discriminant analysis and Bayesian network. The results show that Bayesian network was superior to all the other models. Overall, it had a leave-one-out and k-fold cross validation error rate of 21%, while predicting the majority of instances of E. coli levels classified as unsafe by the Microbiological Water Quality Guidelines for Marine and Freshwater Recreational Areas 2003, New Zealand. Because Bayesian networks are also flexible in handling missing data and outliers and allow for continuous updating in real time, we have found them to be a promising tool, and in the future, plan to extend the analysis beyond the current case study site. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:910 / 919
页数:10
相关论文
共 50 条
  • [31] IMPROVED STATISTICAL MODEL FOR EVALUATING PARAMETERS AFFECTING WATER YIELDS OF RIVER BASINS
    HARRIS, B
    GIBBS, AE
    SHARP, AL
    OWEN, WJ
    JOURNAL OF GEOPHYSICAL RESEARCH, 1961, 66 (08): : 2532 - +
  • [32] Surface water quality prediction system for Luton Hoo lake: A statistical approach
    Anyachebelu, Tochukwu K.
    Conrad, Marc
    Ajmal, Tahmina
    2014 FOURTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2014, : 146 - 151
  • [33] MODEL-FREE STATISTICAL-METHODS FOR WATER TABLE PREDICTION
    YAKOWITZ, S
    TRANSACTIONS-AMERICAN GEOPHYSICAL UNION, 1976, 57 (08): : 602 - 602
  • [34] MODEL-FREE STATISTICAL-METHODS FOR WATER TABLE PREDICTION
    YAKOWITZ, S
    WATER RESOURCES RESEARCH, 1976, 12 (05) : 836 - 844
  • [35] EXTERNAL QUALITY ASSESSMENT IN WATER MICROBIOLOGY - STATISTICAL-ANALYSIS OF PERFORMANCE
    TILLETT, HE
    LIGHTFOOT, NF
    EATON, S
    JOURNAL OF APPLIED BACTERIOLOGY, 1993, 74 (04): : 497 - 502
  • [36] Evaluating the Prediction Performance of the WRF-CUACE Model in Xinjiang, China
    Wulayin, Yisilamu
    Li, Huoqing
    Zhang, Lei
    Mamtimin, Ali
    Liu, Junjian
    Huo, Wen
    Liu, Hongli
    REMOTE SENSING, 2024, 16 (19)
  • [37] Statistical model for prediction of fatigue life of high performance lightweight concrete
    Ramakrishnan, V.
    Chockalingam, Sivakumar
    Journal of Structural Engineering (Madras), 26 (02): : 113 - 122
  • [38] Statistical flaws of the fitness-fatigue sports performance prediction model
    Marchal, Alexandre
    Benazieb, Othmene
    Weldegebriel, Yisakor
    Meline, Thibaut
    Imbach, Frank
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [39] A methodology for evaluating the performance of model-based traffic prediction systems
    Gomes, Gabriel
    Gan, Qijian
    Bayen, Alexandre
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 96 : 160 - 169
  • [40] Application of chaotic prediction model based on wavelet transform on water quality prediction
    Zhang, L.
    Zou, Z. H.
    Zhao, Y. F.
    INTERNATIONAL CONFERENCE ON WATER RESOURCE AND ENVIRONMENT 2016 (WRE2016), 2016, 39