Multivariate location and scatter matrix estimation under cellwise and casewise contamination

被引:10
|
作者
Leung, Andy [1 ]
Yohai, Victor [2 ]
Zamar, Ruben [1 ]
机构
[1] Univ British Columbia, Dept Stat, 3182-2207 Main Mall, Vancouver, BC V6T 1Z4, Canada
[2] Univ Buenos Aires, Fac Ciencias Exactas & Nat, Dept Matemat, Ciudad Univ,Pabellon 1, RA-1426 Buenos Aires, DF, Argentina
基金
加拿大自然科学与工程研究理事会;
关键词
Multivariate location and scatter; Robust estimation; Cellwise outliers; Componentwise contamination; ROBUST ESTIMATION; OUTLIER DETECTION; HIGH DIMENSION;
D O I
10.1016/j.csda.2017.02.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Real data may contain both cellwise outliers and casewise outliers. There is a vast literature on robust estimation for casewise outliers, but only a scant literature for cellwise outliers and almost none for both types of outliers. Estimation of multivariate location and scatter matrix is a corner stone in multivariate data analysis. A two-step approach was recently proposed to perform robust estimation of multivariate location and scatter matrix in the presence of cellwise and casewise outliers. In the first step a univariate filter was applied to remove cellwise outliers. In the second step a generalized S-estimator was used to downweight casewise outliers. This proposal can be further improved in three main directions. First, through the introduction of a consistent bivariate filter to be used in combination with the univariate filter in the first step. Second, through the proposal of a new fast subsampling procedure to generate starting points for the generalized S-estimator in the second step. Third, through the use of a non-monotonic weight function for the generalized S-estimator to better handle casewise outliers in high dimension. A simulation study and a real data example show that, unlike the original two-step procedure, the modified two-step approach performs and scales well in high dimension. Moreover, they show that the modified procedure outperforms the original one and other state-of-the-art robust procedures under cellwise and casewise data contamination. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:59 / 76
页数:18
相关论文
共 50 条
  • [1] Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
    Agostinelli, Claudio
    Leung, Andy
    Yohai, Victor J.
    Zamar, Ruben H.
    [J]. TEST, 2015, 24 (03) : 441 - 461
  • [2] Comments on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
    Welsch, Roy E.
    [J]. TEST, 2015, 24 (03) : 482 - 483
  • [3] Comments on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
    Roy E. Welsch
    [J]. TEST, 2015, 24 : 482 - 483
  • [4] Rejoinder on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
    Claudio Agostinelli
    Andy Leung
    Victor J. Yohai
    Ruben H. Zamar
    [J]. TEST, 2015, 24 : 484 - 488
  • [5] Robust regression estimation and inference in the presence of cellwise and casewise contamination
    Leung, Andy
    Zhang, Hongyang
    Zamar, Ruben
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 99 : 1 - 11
  • [6] High-dimensional robust precision matrix estimation: Cellwise corruption under ε-contamination
    Loh, Po-Ling
    Tan, Xin Lu
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (01): : 1429 - 1467
  • [7] Robust estimation of precision matrices under cellwise contamination
    Tarr, G.
    Mueller, S.
    Weber, N. C.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 93 : 404 - 420
  • [8] Weighted likelihood estimation of multivariate location and scatter
    Agostinelli, Claudio
    Greco, Luca
    [J]. TEST, 2019, 28 (03) : 756 - 784
  • [9] Weighted likelihood estimation of multivariate location and scatter
    Claudio Agostinelli
    Luca Greco
    [J]. TEST, 2019, 28 : 756 - 784
  • [10] Robust and efficient estimation of multivariate scatter and location
    Maronna, Ricardo A.
    Yohai, Victor J.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 109 : 64 - 75