Statistical methods and pitfalls in environmental data analysis

被引:19
|
作者
Rong, Y [1 ]
机构
[1] Calif Reg Water Qual Control Board, Los Angeles, CA 90013 USA
关键词
normal distribution; log-normal distribution; percentile; confidence interval; correlation coefficient; regression; ANOVA; groundwater monitoring;
D O I
10.1006/enfo.2000.0022
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This payer reviews four commonly used statistical methods for environmental data analysis and discusses potential pitfalls associated with application of these methods through real case study data. The four statistical methods are percentile and confidence interval, correlation coefficient, regression analysis, and analysis of variance (ANOVA). The potential pitfall for estimation of percentile and confidence interval includes the automatic assumption of a normal distribution to environmental data, which so often show a log-normal distribution. The potential pitfall for correlation coefficient includes the use of a wide range of data points in which the maximum in value may trivialize other smaller data points and consequently skew the correlation coefficient. The potential pitfall for regression analysis includes the propagation of uncertainties of input variables to the regression model prediction, which may be even more uncertain. The potential pitfall for ANOVA includes the acceptance of a hypothesis as a weak argument to imply a strong conclusion. As demonstrated in this paper, we may draw very different conclusions based on statistical analysis if the pitfalls are not identified. Reminder and enlightenment obtained from the pitfalls are given at the end of this article. (C) 2000 AEHS.
引用
收藏
页码:213 / 220
页数:8
相关论文
共 50 条