Functional L-Optimality Subsampling for Functional Generalized Linear Models with Massive Data

被引:0
|
作者
Liu, Hua [1 ]
You, Jinhong [2 ]
Cao, Jiguo [3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Econ & Finance, Xian 710049, Shaanxi, Peoples R China
[2] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
[3] Simon Fraser Univ, Dept Stat & Actuarial Sci, Burnaby, BC V5A 1S6, Canada
基金
中国国家自然科学基金; 中国博士后科学基金; 加拿大自然科学与工程研究理事会;
关键词
Efficient computation algorithm; Functional data analysis; Kidney transplant; Large-scale data; Penalized B-spline; VARYING COEFFICIENT MODEL; ASYMPTOTIC PROPERTIES; ESTIMATORS; REGRESSION; SPLINES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Massive data bring the big challenges of memory and computation for analysis. These challenges can be tackled by taking subsamples from the full data as a surrogate. For functional data, it is common to collect multiple measurements over their domains, which require even more memory and computation time when the sample size is large. The computation would be much more intensive when statistical inference is required through bootstrap samples. Motivated by analyzing large-scale kidney transplant data, we propose an optimal subsampling method based on the functional L-optimality criterion for functional generalized linear models. To the best of our knowledge, this is the first attempt to propose a subsampling method for functional data analysis. The asymptotic properties of the resultant estimators are also established. The analysis results from extensive simulation studies and from the kidney transplant data show that the functional L-optimality subsampling (FLoS) method is much better than the uniform subsampling approach and can well approximate the results based on the full data while dramatically reducing the computation time and memory.
引用
收藏
页数:41
相关论文
共 50 条
  • [1] Optimal decorrelated score subsampling for generalized linear models with massive data
    Gao, Junzhuo
    Wang, Lei
    Lian, Heng
    SCIENCE CHINA-MATHEMATICS, 2024, 67 (02) : 405 - 430
  • [2] Optimal decorrelated score subsampling for generalized linear models with massive data
    Junzhuo Gao
    Lei Wang
    Heng Lian
    Science China Mathematics, 2024, 67 : 405 - 430
  • [3] Optimal subsampling for double generalized linear models with heterogeneous massive data
    Xiong, Zhengyu
    Jin, Haoyu
    Wu, Liucang
    Yang, Lanjun
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2025,
  • [4] Optimal decorrelated score subsampling for generalized linear models with massive data
    Junzhuo Gao
    Lei Wang
    Heng Lian
    Science China(Mathematics), 2024, 67 (02) : 405 - 430
  • [5] Outcome dependent subsampling divide and conquer in generalized linear models for massive data
    Yin, Jie
    Ding, Jieli
    Yang, Changming
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2025, 237
  • [6] Generalized functional linear models
    Müller, HG
    Stadtmüller, U
    ANNALS OF STATISTICS, 2005, 33 (02): : 774 - 805
  • [7] Generalized linear models with functional predictors
    James, GM
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 : 411 - 432
  • [8] Dependent generalized functional linear models
    Jadhav, S.
    Koul, H. L.
    Lu, Q.
    BIOMETRIKA, 2017, 104 (04) : 987 - 994
  • [9] Estimation in generalized linear models for functional data via penalized likelihood
    Cardot, H
    Sarda, P
    JOURNAL OF MULTIVARIATE ANALYSIS, 2005, 92 (01) : 24 - 41
  • [10] Partially Linear Generalized Single Index Models for Functional Data (PLGSIMF)
    Alahiane, Mohamed
    Ouassou, Idir
    Rachdi, Mustapha
    Vieu, Philippe
    STATS, 2021, 4 (04): : 793 - 813