Robust Estimation of Multivariate Location and Scatter in the Presence of Missing Data

被引:16
|
作者
Danilov, Mike [1 ]
Yohai, Victor J. [2 ]
Zamar, Ruben H. [3 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Univ Buenos Aires, Fac Ciencias Exactas & Nat, Dept Matemat, RA-1428 Buenos Aires, DF, Argentina
[3] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Consistent; Elliptical distribution; EM algorithm; Fixed point equation; LIKELIHOOD; IMPUTATION; COVARIANCE; SHAPE;
D O I
10.1080/01621459.2012.699792
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Two main issues regarding data quality are data contamination (outliers) and data completion (missing data). These two problems have attracted much attention and research but surprisingly, they are seldom considered together. Popular robust methods such as S-estimators of multivariate location and scatter offer protection against outliers but cannot deal with missing data, except for the obviously inefficient approach of deleting all incomplete cases. We generalize the definition of S-estimators of multivariate location and scatter to simultaneously deal with missing data and outliers. We show that the proposed estimators are strongly consistent under elliptical models when data are missing completely at random. We derive an algorithm similar to the Expectation-Maximization algorithm for computing the proposed estimators. This algorithm is initialized by an extension for missing data of the minimum volume ellipsoid. We assess the performance of our proposal by Monte Carlo simulation and give some real data examples. This article has supplementary material online.
引用
收藏
页码:1178 / 1186
页数:9
相关论文
共 50 条
  • [1] Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
    Agostinelli, Claudio
    Leung, Andy
    Yohai, Victor J.
    Zamar, Ruben H.
    [J]. TEST, 2015, 24 (03) : 441 - 461
  • [2] Robust and efficient estimation of multivariate scatter and location
    Maronna, Ricardo A.
    Yohai, Victor J.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 109 : 64 - 75
  • [3] Rejoinder on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
    Claudio Agostinelli
    Andy Leung
    Victor J. Yohai
    Ruben H. Zamar
    [J]. TEST, 2015, 24 : 484 - 488
  • [4] Comments on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
    Roy E. Welsch
    [J]. TEST, 2015, 24 : 482 - 483
  • [5] Comments on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination
    Welsch, Roy E.
    [J]. TEST, 2015, 24 (03) : 482 - 483
  • [6] Optimization techniques for robust multivariate location and scatter estimation
    Chatzinakos, C.
    Pitsoulis, L.
    Zioutas, G.
    [J]. JOURNAL OF COMBINATORIAL OPTIMIZATION, 2016, 31 (04) : 1443 - 1460
  • [7] Optimization techniques for robust multivariate location and scatter estimation
    C. Chatzinakos
    L. Pitsoulis
    G. Zioutas
    [J]. Journal of Combinatorial Optimization, 2016, 31 : 1443 - 1460
  • [8] Robust location estimation with missing data
    Sued, Mariela
    Yohai, Victor J.
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2013, 41 (01): : 111 - 132
  • [9] ROBUST M-ESTIMATORS OF MULTIVARIATE LOCATION AND SCATTER IN THE PRESENCE OF ASYMMETRY
    WIENS, DP
    ZHENG, Z
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1986, 14 (02): : 161 - 176
  • [10] D-optimal design methods for robust estimation of multivariate location and scatter
    Poston, WL
    Wegman, EJ
    Solka, JL
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1998, 73 (1-2) : 205 - 213