FINDING CAUSES OF OUTLIERS IN MULTIVARIATE ENVIRONMENTAL DATA

被引:8
|
作者
GARNER, FC
STAPANIAN, MA
FITZGERALD, KE
机构
关键词
MULTIVARIATE KURTOSIS; GENERALIZED DISTANCE; MULTIVARIATE OUTLIERS;
D O I
10.1002/cem.1180050311
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multivariate outliers in environmental data sets are often caused by atypical measurement error in a single variable. From a quality assurance perspective it is important to identify these variables efficiently so that corrective actions may be performed. We demonstrate a procedure for using two multivariate tests to identify which variable 'caused' each outlier. The procedure is tested with simulated data sets that have the same correlation structure as selected water chemistry variables from a survey of lakes in the Western United States. The success rates are evaluated for three of the variables for sample sizes of 50 and 100, significance levels of 0.01 and 0.05 and various amounts of mean shift. The procedure works best for highly correlated variables.
引用
收藏
页码:241 / 248
页数:8
相关论文
共 50 条
  • [1] FINDING SUSPECTED CAUSES OF MEASUREMENT ERROR IN MULTIVARIATE ENVIRONMENTAL DATA
    STAPANIAN, MA
    GARNER, FC
    FITZGERALD, KE
    FLATMAN, GT
    NOCERINO, JM
    [J]. JOURNAL OF CHEMOMETRICS, 1993, 7 (03) : 165 - 176
  • [2] Finding multivariate outliers with FastPCS
    Vakili, Kaveh
    Schmitt, Eric
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 69 : 54 - 66
  • [3] Finding multivariate outliers in fMRI time-series data
    Magnotti, John F.
    Billor, Nedret
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 53 : 115 - 124
  • [4] Finding an unknown number of multivariate outliers
    Riani, Marco
    Atkinson, Anthony C.
    Cerioli, Andrea
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 447 - 466
  • [5] ON THE DETECTION OF MULTIVARIATE DATA OUTLIERS AND REGRESSION OUTLIERS
    LAZRAQ, A
    CLEROUX, R
    [J]. DATA ANALYSIS, LEARNING SYMBOLIC AND NUMERIC KNOWLEDGE, 1989, : 133 - 140
  • [6] Correlation of Outliers in Multivariate Data
    Kaszuba, Bartosz
    [J]. DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 265 - 272
  • [7] Finding the Outliers in Scanpath Data
    Burch, Michael
    Kumar, Ayush
    Mueller, Klaus
    Kervezee, Titus
    Nuijten, Wouter
    Oostenbach, Rens
    Peeters, Lucas
    Smit, Gijs
    [J]. ETRA 2019: 2019 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, 2019,
  • [8] Identification of outliers in multivariate data
    Rocke, DM
    Woodruff, DL
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (435) : 1047 - 1061
  • [9] PROPAGATION OF OUTLIERS IN MULTIVARIATE DATA
    Alqallaf, Fatemah
    Van Aelst, Stefan
    Yohai, Victor J.
    Zamar, Ruben H.
    [J]. ANNALS OF STATISTICS, 2009, 37 (01): : 311 - 331
  • [10] Statistical Method for Finding Outliers in Multivariate Data using a Boxplot and Multiple Linear Regression
    Thanwiset, Theeraphat
    Srisodaphol, Wuttichai
    [J]. SAINS MALAYSIANA, 2023, 52 (09): : 2725 - 2732