Fast covariance estimation for high-dimensional functional data

被引:49
|
作者
Xiao, Luo [1 ]
Zipunnikov, Vadim [1 ]
Ruppert, David [2 ,3 ]
Crainiceanu, Ciprian [1 ]
机构
[1] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[2] Cornell Univ, Dept Stat Sci, Ithaca, NY USA
[3] Cornell Univ, Sch Operat Res & Informat Engn, Ithaca, NY USA
关键词
FACE; fPCA; Penalized splines; Sandwich smoother; Smoothing; Singular value decomposition; PRINCIPAL-COMPONENTS-ANALYSIS; NONPARAMETRIC-ESTIMATION; REGRESSION; SPLINES;
D O I
10.1007/s11222-014-9485-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose two fast covariance smoothing methods and associated software that scale up linearly with the number of observations per function. Most available methods and software cannot smooth covariance matrices of dimension J > 500; a recently introduced sandwich smoother is an exception but is not adapted to smooth covariance matrices of large dimensions, such as J = 10,000. We introduce two new methods that circumvent those problems: (1) a fast implementation of the sandwich smoother for covariance smoothing; and (2) a two-step procedure that first obtains the singular value decomposition of the data matrix and then smoothes the eigenvectors. These new approaches are at least an order of magnitude faster in high dimensions and drastically reduce computer memory requirements. The new approaches provide instantaneous (a few seconds) smoothing for matrices of dimension J = 10,000 and very fast (< 10 min) smoothing for J = 100,000. R functions, simulations, and data analysis provide ready to use, reproducible, and scalable tools for practical data analysis of noisy high-dimensional functional data.
引用
收藏
页码:409 / 421
页数:13
相关论文
共 50 条
  • [41] Test on the linear combinations of covariance matrices in high-dimensional data
    Zhidong Bai
    Jiang Hu
    Chen Wang
    Chao Zhang
    [J]. Statistical Papers, 2021, 62 : 701 - 719
  • [42] Homogeneity tests of covariance matrices with high-dimensional longitudinal data
    Zhong, Ping-Shou
    Li, Runze
    Santo, Shawn
    [J]. BIOMETRIKA, 2019, 106 (03) : 619 - 634
  • [43] Homogeneity test of several covariance matrices with high-dimensional data
    Qayed, Abdullah
    Han, Dong
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2021, 31 (04) : 523 - 540
  • [44] The minimum weighted covariance determinant estimator for high-dimensional data
    Jan Kalina
    Jan Tichavský
    [J]. Advances in Data Analysis and Classification, 2022, 16 : 977 - 999
  • [45] Test on the linear combinations of covariance matrices in high-dimensional data
    Bai, Zhidong
    Hu, Jiang
    Wang, Chen
    Zhang, Chao
    [J]. STATISTICAL PAPERS, 2021, 62 (02) : 701 - 719
  • [46] A Comparative Study of Covariance Matrix Estimators in High-Dimensional Data
    Lee, DongHyuk
    Lee, Jae Won
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2013, 26 (05) : 747 - 758
  • [47] The minimum weighted covariance determinant estimator for high-dimensional data
    Kalina, Jan
    Tichavsky, Jan
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (04) : 977 - 999
  • [48] Fast Poisson estimation with high-dimensional fixed effects
    Correia, Sergio
    Guimaraes, Paulo
    Zylkin, Tom
    [J]. STATA JOURNAL, 2020, 20 (01): : 95 - 115
  • [49] Estimation of high-dimensional integrated covariance matrix based on noisy high-frequency data with multiple observations
    Wang, Moming
    Xia, Ningning
    [J]. STATISTICS & PROBABILITY LETTERS, 2021, 170
  • [50] Lower bound estimation for a family of high-dimensional sparse covariance matrices
    Li, Huimin
    Liu, Youming
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2024, 22 (02)