Statistical methods and pitfalls in environmental data analysis

被引:19
|
作者
Rong, Y [1 ]
机构
[1] Calif Reg Water Qual Control Board, Los Angeles, CA 90013 USA
关键词
normal distribution; log-normal distribution; percentile; confidence interval; correlation coefficient; regression; ANOVA; groundwater monitoring;
D O I
10.1006/enfo.2000.0022
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This payer reviews four commonly used statistical methods for environmental data analysis and discusses potential pitfalls associated with application of these methods through real case study data. The four statistical methods are percentile and confidence interval, correlation coefficient, regression analysis, and analysis of variance (ANOVA). The potential pitfall for estimation of percentile and confidence interval includes the automatic assumption of a normal distribution to environmental data, which so often show a log-normal distribution. The potential pitfall for correlation coefficient includes the use of a wide range of data points in which the maximum in value may trivialize other smaller data points and consequently skew the correlation coefficient. The potential pitfall for regression analysis includes the propagation of uncertainties of input variables to the regression model prediction, which may be even more uncertain. The potential pitfall for ANOVA includes the acceptance of a hypothesis as a weak argument to imply a strong conclusion. As demonstrated in this paper, we may draw very different conclusions based on statistical analysis if the pitfalls are not identified. Reminder and enlightenment obtained from the pitfalls are given at the end of this article. (C) 2000 AEHS.
引用
收藏
页码:213 / 220
页数:8
相关论文
共 50 条
  • [31] ANALYSIS OF PSYCHIATRIC DATA BY MULTIVARIATE STATISTICAL METHODS
    BENTE, D
    ENGELMEI.MP
    HEINRICH, K
    HIPPIUS, H
    SCHMITT, W
    ARZNEIMITTEL-FORSCHUNG, 1969, 19 (3A): : 405 - &
  • [32] Statistical methods for water data analysis and processing
    Lobanova, HV
    ADVANCES IN WATER SUPPLY MANAGEMENT, 2003, : 3 - 12
  • [33] Statistical methods for the analysis of NRS pain data
    Goulet, J.
    Buta, E.
    Carroll, C.
    Brandt, C.
    JOURNAL OF PAIN, 2015, 16 (04): : S7 - S7
  • [34] Statistical and Econometric Methods for Transportation Data Analysis
    Kevin Cullinane
    Maritime Economics & Logistics, 2004, 6 (2) : 187 - 189
  • [35] Statistical methods for the analysis of adverse event data
    Kieser, Meinhard
    PHARMACEUTICAL STATISTICS, 2016, 15 (04) : 290 - 291
  • [36] STATISTICAL METHODS FOR ANALYSIS OF EARNINGS DATA - KLEVMARKEN,A
    RAO, CR
    SANKHYA-THE INDIAN JOURNAL OF STATISTICS SERIES B, 1973, 35 (JUN): : 278 - 278
  • [37] METHODS OF WEIGHTING HATCHING DATA IN STATISTICAL ANALYSIS
    BATEN, WD
    HENDERSON, EW
    POULTRY SCIENCE, 1958, 37 (04) : 781 - 785
  • [38] A Survey of Statistical Methods for Microbiome Data Analysis
    Lutz, Kevin C.
    Jiang, Shuang
    Neugent, Michael L.
    De Nisco, Nicole J.
    Zhan, Xiaowei
    Li, Qiwei
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2022, 8
  • [39] Selected statistical methods of data analysis for multivariate functional data
    Gorecki, Tomasz
    Krzysko, Miroslaw
    Waszak, Lukasz
    Wolynski, Waldemar
    STATISTICAL PAPERS, 2018, 59 (01) : 153 - 182
  • [40] Analysis of Neurosurgery Data Using Statistical and Data Mining Methods
    Berka, Petr
    Jablonsky, Josef
    Marek, Lubos
    Vrabec, Michal
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND ITS APPLICATIONS, MICAI 2015, PT II, 2015, 9414 : 310 - 321