Faster Algorithms for High-Dimensional Robust Covariance Estimation

被引:0
|
作者
Cheng, Yu [1 ]
Diakonikolas, Ilias [2 ]
Ge, Rong [1 ]
Woodruff, David P. [3 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Univ Southern Calif, Los Angeles, CA USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
关键词
COMPLEXITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of estimating the covariance matrix of a high-dimensional distribution when a small constant fraction of the samples can be arbitrarily corrupted. Recent work gave the first polynomial time algorithms for this problem with near-optimal error guarantees for several natural structured distributions. Our main contribution is to develop faster algorithms for this problem whose running time nearly matches that of computing the empirical covariance. Given N = (Omega) over tilde (d(2)/epsilon(2)) samples from a d-dimensional Gaussian distribution, an epsilon-fraction of which may be arbitrarily corrupted, our algorithm runs in time (O) over tilde (d(3.26))= poly(epsilon) and approximates the unknown covariance matrix to optimal error up to a logarithmic factor. Previous robust algorithms with comparable error guarantees all have runtimes (Omega) over tilde (d(2 omega)) when epsilon = Omega(1), where omega is the exponent of matrix multiplication. We also provide evidence that improving the running time of our algorithm may require new algorithmic techniques.
引用
收藏
页数:31
相关论文
共 50 条
  • [31] COMPUTATIONALLY EFFICIENT ALGORITHMS FOR HIGH-DIMENSIONAL ROBUST ESTIMATORS
    MOUNT, DM
    NETANYAHU, NS
    CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1994, 56 (04): : 289 - 303
  • [32] High-Dimensional Covariance Estimation From a Small Number of Samples
    Vishny, David
    Morzfeld, Matthias
    Gwirtz, Kyle
    Bach, Eviatar
    Dunbar, Oliver R. A.
    Hodyss, Daniel
    JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2024, 16 (09)
  • [33] HIGH-DIMENSIONAL COVARIANCE MATRIX ESTIMATION IN APPROXIMATE FACTOR MODELS
    Fan, Jianqing
    Liao, Yuan
    Mincheva, Martina
    ANNALS OF STATISTICS, 2011, 39 (06): : 3320 - 3356
  • [34] TEST FOR BANDEDNESS OF HIGH-DIMENSIONAL COVARIANCE MATRICES AND BANDWIDTH ESTIMATION
    Qiu, Yumou
    Chen, Song Xi
    ANNALS OF STATISTICS, 2012, 40 (03): : 1285 - 1314
  • [35] A BLOCKING AND REGULARIZATION APPROACH TO HIGH-DIMENSIONAL REALIZED COVARIANCE ESTIMATION
    Hautsch, Nikolaus
    Kyj, Lada M.
    Oomen, Roel C. A.
    JOURNAL OF APPLIED ECONOMETRICS, 2012, 27 (04) : 625 - 645
  • [36] High-dimensional Covariance Estimation Based On Gaussian Graphical Models
    Zhou, Shuheng
    Ruetimann, Philipp
    Xu, Min
    Buehlmann, Peter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 2975 - 3026
  • [37] ROBUST DEPENDENCE MODELING FOR HIGH-DIMENSIONAL COVARIANCE MATRICES WITH FINANCIAL APPLICATIONS
    Zhu, Zhe
    Welsch, Roy E.
    ANNALS OF APPLIED STATISTICS, 2018, 12 (02): : 1228 - 1249
  • [38] Detecting outliers in high-dimensional neuroimaging datasets with robust covariance estimators
    Fritsch, Virgile
    Varoquaux, Gael
    Thyreau, Benjamin
    Poline, Jean-Baptiste
    Thirion, Bertrand
    MEDICAL IMAGE ANALYSIS, 2012, 16 (07) : 1359 - 1370
  • [39] On Coupling Robust Estimation with Regularization for High-Dimensional Data
    Kalina, Jan
    Hlinka, Jaroslav
    DATA SCIENCE: INNOVATIVE DEVELOPMENTS IN DATA ANALYSIS AND CLUSTERING, 2017, : 15 - 27
  • [40] A Robust High-Dimensional Estimation of Multinomial Mixture Models
    Sabbaghi, Azam
    Eskandari, Farzad
    Navabpoor, Hamid Reza
    JOURNAL OF STATISTICAL THEORY AND APPLICATIONS, 2021, 20 (01): : 21 - 32