Functional data analysis for sparse longitudinal data

被引:1032
|
作者
Yao, F [1 ]
Müller, HG
Wang, JL
机构
[1] Colorado State Univ, Dept Stat, Ft Collins, CO 80523 USA
[2] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
asymptotics; conditioning; confidence band; measurement error; principal components; simultaneous inferences; smoothing;
D O I
10.1198/016214504000001745
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a nonparametric method to perform functional principal components analysis for the case of sparse longitudinal data. The method aims at irregularly spaced longitudinal data, where the number of repeated measurements available per subject is small. In contrast, classical functional data analysis requires a large number of regularly spaced measurements per subject. We assume that the repeated measurements are located randomly with a random number of repetitions for each subject and are determined by an underlying smooth random (subject-specific) trajectory plus measurement errors. Basic elements of our approach are the parsimonious estimation of the covariance structure and mean function of the trajectories, and the estimation of the variance of the measurement errors. The eigenfunction basis is estimated from the data, and functional principal components score estimates are obtained by a conditioning step. This conditional estimation method is conceptually simple and straightforward to implement. A key step is the derivation of asymptotic consistency and distribution results under mild conditions, using tools from functional analysis. Functional data analysis for sparse longitudinal data enables prediction of individual smooth trajectories even if only one or few measurements are available for a subject. Asymptotic pointwise and simultaneous confidence bands are obtained for predicted individual trajectories, based on asymptotic distributions, for simultaneous bands under the assumption of a finite number of components. Model selection techniques, such as the Akaike information criterion, are used to choose the model dimension corresponding to the number of eigenfunctions in the model. The methods are illustrated with a simulation study, longitudinal CD4 data for a sample of AIDS patients, and time-course gene expression data for the yeast cell cycle.
引用
收藏
页码:577 / 590
页数:14
相关论文
共 50 条
  • [31] Likelihood Ratio Tests for Dependent Data with Applications to Longitudinal and Functional Data Analysis
    Staicu, Ana-Maria
    Li, Yingxing
    Crainiceanu, Ciprian M.
    Ruppert, David
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2014, 41 (04) : 932 - 949
  • [32] Temporally-Constrained Group Sparse Learning for Longitudinal Data Analysis
    Zhang, Daoqiang
    Liu, Jun
    Shen, Dinggang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2012, PT III, 2012, 7512 : 264 - 271
  • [33] Sparse logistic functional principal component analysis for binary data
    Rou Zhong
    Shishi Liu
    Haocheng Li
    Jingxiao Zhang
    [J]. Statistics and Computing, 2023, 33
  • [34] Sparse logistic functional principal component analysis for binary data
    Zhong, Rou
    Liu, Shishi
    Li, Haocheng
    Zhang, Jingxiao
    [J]. STATISTICS AND COMPUTING, 2023, 33 (01)
  • [35] Emerging issues in longitudinal and functional data analysis - Introduction
    Davidian, M
    Lin, XH
    Wang, JL
    [J]. STATISTICA SINICA, 2004, 14 (03) : 613 - 614
  • [36] Functional principal component analysis for longitudinal and survival data
    Yao, Fang
    [J]. STATISTICA SINICA, 2007, 17 (03) : 965 - 983
  • [37] Regression Analysis of Asynchronous Longitudinal Functional and Scalar Data
    Li, Ting
    Li, Tengfei
    Zhu, Zhongyi
    Zhu, Hongtu
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1228 - 1242
  • [38] SPARSE VARYING COEFFICIENT MODELS FOR LONGITUDINAL DATA
    Noh, Hoh Suk
    Park, Byeong U.
    [J]. STATISTICA SINICA, 2010, 20 (03) : 1183 - 1202
  • [39] Modeling sparse longitudinal data in early neurodevelopment
    Chen, Yaqing
    Dubey, Paromita
    Muller, Hans-Georg
    Bruchhage, Muriel
    Wang, Jane-Ling
    Deoni, Sean
    [J]. NEUROIMAGE, 2021, 237
  • [40] Modeling sparse longitudinal data on Riemannian manifolds
    Dai, Xiongtao
    Lin, Zhenhua
    Muller, Hans-Georg
    [J]. BIOMETRICS, 2021, 77 (04) : 1328 - 1341