Statistical methods and pitfalls in environmental data analysis

被引:19
|
作者
Rong, Y [1 ]
机构
[1] Calif Reg Water Qual Control Board, Los Angeles, CA 90013 USA
关键词
normal distribution; log-normal distribution; percentile; confidence interval; correlation coefficient; regression; ANOVA; groundwater monitoring;
D O I
10.1006/enfo.2000.0022
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This payer reviews four commonly used statistical methods for environmental data analysis and discusses potential pitfalls associated with application of these methods through real case study data. The four statistical methods are percentile and confidence interval, correlation coefficient, regression analysis, and analysis of variance (ANOVA). The potential pitfall for estimation of percentile and confidence interval includes the automatic assumption of a normal distribution to environmental data, which so often show a log-normal distribution. The potential pitfall for correlation coefficient includes the use of a wide range of data points in which the maximum in value may trivialize other smaller data points and consequently skew the correlation coefficient. The potential pitfall for regression analysis includes the propagation of uncertainties of input variables to the regression model prediction, which may be even more uncertain. The potential pitfall for ANOVA includes the acceptance of a hypothesis as a weak argument to imply a strong conclusion. As demonstrated in this paper, we may draw very different conclusions based on statistical analysis if the pitfalls are not identified. Reminder and enlightenment obtained from the pitfalls are given at the end of this article. (C) 2000 AEHS.
引用
收藏
页码:213 / 220
页数:8
相关论文
共 50 条
  • [1] Analysis and commentary on "statistical methods and pitfalls in environmental data analysis" by Yue Rong
    Sutherland, RA
    ENVIRONMENTAL FORENSICS, 2001, 2 (04) : 265 - 274
  • [2] Response to Dr Ross Sutherland's comments on the article "Statistical methods and pitfalls in environmental data analysis"
    Rong, Y
    ENVIRONMENTAL FORENSICS, 2001, 2 (04) : 275 - 275
  • [3] Pitfalls in statistical methods
    Mario Petretta
    Alberto Cuocolo
    Journal of Nuclear Cardiology, 2012, 19 : 818 - 818
  • [4] Pitfalls in statistical methods
    Petretta, Mario
    Cuocolo, Alberto
    JOURNAL OF NUCLEAR CARDIOLOGY, 2012, 19 (04) : 818 - 818
  • [5] Pitfalls in statistical methods
    Fei Gao
    David Machin
    Journal of Nuclear Cardiology, 2013, 20 : 650 - 651
  • [6] Pitfalls in statistical methods
    Gao, Fei
    Machin, David
    JOURNAL OF NUCLEAR CARDIOLOGY, 2013, 20 (04) : 650 - 651
  • [7] Pitfalls in statistical methods Reply
    Gibbons, Raymond J.
    Hodge, David O.
    JOURNAL OF NUCLEAR CARDIOLOGY, 2012, 19 (04) : 819 - 819
  • [8] Pitfalls in the statistical analysis of microbiome amplicon sequencing data
    Boshuizen, Hendriek C.
    te Beest, Dennis E.
    MOLECULAR ECOLOGY RESOURCES, 2023, 23 (03) : 539 - 548
  • [9] Statistical methods of data analysis
    Galanis, P.
    ARCHIVES OF HELLENIC MEDICINE, 2009, 26 (05): : 699 - 711
  • [10] Multivariate statistical analysis of environmental data
    Brzezinska, Justyna
    Rybicka, Aneta
    Pelka, Marcin
    12TH PROFESSOR ALEKSANDER ZELIAS INTERNATIONAL CONFERENCE ON MODELLING AND FORECASTING OF SOCIO-ECONOMIC PHENOMENA, 2018, 1 : 40 - 49