The BACON Approach for Rank-Deficient Data

被引:1
|
作者
Kondylis, Athanassios [1 ]
Hadi, Ali S. [2 ,3 ]
Werner, Mark [4 ]
机构
[1] Philip Morris Int ACR, Neuchatel, Switzerland
[2] Amer Univ Cairo, Dept Math & Actuarial Sci, Cairo, Egypt
[3] Cornell Univ, Dept Stat Sci, Ithaca, NY 14853 USA
[4] Univ Georgia, Dept Stat, Athens, GA 30602 USA
关键词
High-dimensional data; Mahalanobis distance; Outlier detection; Spatial median;
D O I
10.18187/pjsor.v8i3.514
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Rank-deficient data are not uncommon in practice. They result from highly collinear variables and/or highdimensional data. A special case of the latter occurs when the number of recorded variables exceeds the number of observations. The use of the BACON algorithm for outlier detection in multivariate data is extended here to include rank-deficient data. We present two approaches to identifying outliers in rankdeficient data based on the original BACON algorithm. The first algorithm projects the data onto a robust subspace of reduced dimension, while the second employs a ridge type regularization on the covariance matrix. Both algorithms are tested on real as well as simulated data sets with good results in terms of their effectiveness in outlier detection. They are also examined in terms of computational efficiency and found to be very fast, with particularly good scaling properties for increasing dimension.
引用
收藏
页码:359 / 379
页数:21
相关论文
共 50 条
  • [41] Local factor analysis of rank-deficient reaction systems
    Xu, C. J.
    Gourvenec, S.
    Liang, Y. Z.
    Massart, D. L.
    ANALYTICA CHIMICA ACTA, 2006, 575 (01) : 1 - 8
  • [42] Rank-deficient prewhitening with quotient SVD and ULV decompositions
    Hansen, PC
    BIT, 1998, 38 (01): : 34 - 43
  • [43] Kinetic hard-modelling and spectral validation of rank-deficient spectroscopic data: A case study
    Billeter, Julien
    Neuhold, Yorck-Michael
    Hungerbuehler, Konrad
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2009, 98 (02) : 213 - 226
  • [44] BEHAVIOR OF SPIRAL WAVE SPECTRA WITH A RANK-DEFICIENT DIFFUSION MATRIX
    Dodson, Stephanie
    Sandstede, Bjorn
    SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 2022, 54 (03) : 3789 - 3816
  • [45] Incremental Parameter Estimation under Rank-Deficient Measurement Conditions
    Villez, Kris
    Billeter, Julien
    Bonvin, Dominique
    PROCESSES, 2019, 7 (02):
  • [46] Efficient solution of the rank-deficient linear least squares problem
    Universidad Jaime I, Castellon, Spain
    Siam J. Sci. Comput., 3 (1155-1163):
  • [47] Direct Schmid–Leiman Transformations and Rank-Deficient Loadings Matrices
    Niels G. Waller
    Psychometrika, 2018, 83 : 858 - 870
  • [48] Degrees of Freedom of the MIMO Rank-deficient Interference Channel with Feedback
    Chae, Sung Ho
    Suh, Changho
    Chung, Sae-Young
    2013 51ST ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2013, : 1159 - 1165
  • [49] Cooperative Relaying for the Rank-Deficient MIMO Relay Interference Channel
    Chae, Sung Ho
    Jeon, Sang-Woon
    Chung, Sae-Young
    IEEE COMMUNICATIONS LETTERS, 2012, 16 (01) : 9 - 11
  • [50] Subspace-based speech enhancement with rank-deficient prewhitening
    Jensen, SH
    Kargo, JP
    Rodbro, CA
    Sorensen, KV
    2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 166 - 168