Fast covariance estimation for high-dimensional functional data

被引:49
|
作者
Xiao, Luo [1 ]
Zipunnikov, Vadim [1 ]
Ruppert, David [2 ,3 ]
Crainiceanu, Ciprian [1 ]
机构
[1] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[2] Cornell Univ, Dept Stat Sci, Ithaca, NY USA
[3] Cornell Univ, Sch Operat Res & Informat Engn, Ithaca, NY USA
关键词
FACE; fPCA; Penalized splines; Sandwich smoother; Smoothing; Singular value decomposition; PRINCIPAL-COMPONENTS-ANALYSIS; NONPARAMETRIC-ESTIMATION; REGRESSION; SPLINES;
D O I
10.1007/s11222-014-9485-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose two fast covariance smoothing methods and associated software that scale up linearly with the number of observations per function. Most available methods and software cannot smooth covariance matrices of dimension J > 500; a recently introduced sandwich smoother is an exception but is not adapted to smooth covariance matrices of large dimensions, such as J = 10,000. We introduce two new methods that circumvent those problems: (1) a fast implementation of the sandwich smoother for covariance smoothing; and (2) a two-step procedure that first obtains the singular value decomposition of the data matrix and then smoothes the eigenvectors. These new approaches are at least an order of magnitude faster in high dimensions and drastically reduce computer memory requirements. The new approaches provide instantaneous (a few seconds) smoothing for matrices of dimension J = 10,000 and very fast (< 10 min) smoothing for J = 100,000. R functions, simulations, and data analysis provide ready to use, reproducible, and scalable tools for practical data analysis of noisy high-dimensional functional data.
引用
收藏
页码:409 / 421
页数:13
相关论文
共 50 条
  • [1] Fast covariance estimation for high-dimensional functional data
    Luo Xiao
    Vadim Zipunnikov
    David Ruppert
    Ciprian Crainiceanu
    [J]. Statistics and Computing, 2016, 26 : 409 - 421
  • [2] Fast and Positive Definite Estimation of Large Covariance Matrix for High-Dimensional Data Analysis
    Wen, Fei
    Chu, Lei
    Ying, Rendong
    Liu, Peilin
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2021, 7 (03) : 603 - 609
  • [3] High-dimensional covariance matrix estimation
    Lam, Clifford
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2020, 12 (02)
  • [4] Adaptive banding covariance estimation for high-dimensional multivariate longitudinal data
    Qian, Fang
    Zhang, Weiping
    Chen, Yu
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2021, 49 (03): : 906 - 938
  • [5] Ridge estimation of inverse covariance matrices from high-dimensional data
    van Wieringen, Wessel N.
    Peeters, Carel F. W.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 103 : 284 - 303
  • [6] A new robust covariance matrix estimation for high-dimensional microbiome data
    Wang, Jiyang
    Liang, Wanfeng
    Li, Lijie
    Wu, Yue
    Ma, Xiaoyan
    [J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2024, 66 (02) : 281 - 295
  • [7] Optimal covariance matrix estimation for high-dimensional noise in high-frequency data
    Chang, Jinyuan
    Hu, Qiao
    Liu, Cheng
    Tang, Cheng Yong
    [J]. JOURNAL OF ECONOMETRICS, 2024, 239 (02)
  • [8] Fast covariance estimation for sparse functional data
    Xiao, Luo
    Li, Cai
    Checkley, William
    Crainiceanu, Ciprian
    [J]. STATISTICS AND COMPUTING, 2018, 28 (03) : 511 - 522
  • [9] Minimax optimal estimation of high-dimensional sparse covariance matrices with missing data
    Qi, Xinyu
    Wang, Jinru
    Zeng, Xiaochen
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2022, 20 (06)
  • [10] Fast covariance estimation for sparse functional data
    Luo Xiao
    Cai Li
    William Checkley
    Ciprian Crainiceanu
    [J]. Statistics and Computing, 2018, 28 : 511 - 522