FINDING CAUSES OF OUTLIERS IN MULTIVARIATE ENVIRONMENTAL DATA

被引:8
|
作者
GARNER, FC
STAPANIAN, MA
FITZGERALD, KE
机构
关键词
MULTIVARIATE KURTOSIS; GENERALIZED DISTANCE; MULTIVARIATE OUTLIERS;
D O I
10.1002/cem.1180050311
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multivariate outliers in environmental data sets are often caused by atypical measurement error in a single variable. From a quality assurance perspective it is important to identify these variables efficiently so that corrective actions may be performed. We demonstrate a procedure for using two multivariate tests to identify which variable 'caused' each outlier. The procedure is tested with simulated data sets that have the same correlation structure as selected water chemistry variables from a survey of lakes in the Western United States. The success rates are evaluated for three of the variables for sample sizes of 50 and 100, significance levels of 0.01 and 0.05 and various amounts of mean shift. The procedure works best for highly correlated variables.
引用
收藏
页码:241 / 248
页数:8
相关论文
共 50 条
  • [31] A comparative study of methods to handle outliers in multivariate data analysis
    Grentzelos, Christos
    Caroni, Chrysseis
    Barranco-Chamorro, Inmaculada
    COMPUTATIONAL AND MATHEMATICAL METHODS, 2021, 3 (03)
  • [32] COMPARISON OF DIFFERENT TECHNIQUES FOR DETECTION OF OUTLIERS IN CASE OF MULTIVARIATE DATA
    Iqbal, Muhammad Zafar
    Habib, Samra
    Khan, Muhammad Imran
    Kashif, Muhammad
    PAKISTAN JOURNAL OF AGRICULTURAL SCIENCES, 2020, 57 (03): : 865 - 869
  • [33] Detection of multivariate outliers in business survey data with incomplete information
    Todorov, Valentin
    Templ, Matthias
    Filzmoser, Peter
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2011, 5 (01) : 37 - 56
  • [34] FINDING OUTLIERS THAT MATTER
    ANDREWS, DF
    PREGIBON, D
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1978, 40 (01): : 85 - 93
  • [35] Detecting outliers in multivariate data and visualization-R scripts
    Kim, Sung-Soo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2018, 31 (04) : 517 - 528
  • [36] Eigenstructure-Based Angle for Detecting Outliers in Multivariate Data
    Aziz, Nazrina
    SAINS MALAYSIANA, 2014, 43 (12): : 1973 - 1977
  • [37] Detection of multivariate outliers in business survey data with incomplete information
    Valentin Todorov
    Matthias Templ
    Peter Filzmoser
    Advances in Data Analysis and Classification, 2011, 5 : 37 - 56
  • [39] The Comparison of Algorithms for the Automatic Detection of Outliers in Environmental Data
    Campulova, Martina
    Grochova, Ladislava Issever
    Michalek, Jaroslav
    INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116
  • [40] Detection of outliers in multivariate data: A method based on clustering and robust estimators
    Santos-Pereira, CM
    Pires, AM
    COMPSTAT 2002: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2002, : 291 - 296