Additive partially linear models for massive heterogeneous data

被引:6
|
作者
Wang, Binhuan [1 ]
Fang, Yixin [2 ]
Lian, Heng [3 ]
Liang, Hua [4 ]
机构
[1] NYU, Dept Populat Hlth, Sch Med, New York, NY 10003 USA
[2] New Jersey Inst Technol, Dept Math Sci, Newark, NJ 07102 USA
[3] City Univ Hong Kong, Dept Math, Hong Kong, Peoples R China
[4] George Washington Univ, Dept Stat, Washington, DC 20052 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2019年 / 13卷 / 01期
关键词
Divide-and-conquer; homogeneity; heterogeneity; oracle property; regression splines;
D O I
10.1214/18-EJS1528
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider an additive partially linear framework for modelling massive heterogeneous data. The major goal is to extract multiple common features simultaneously across all sub-populations while exploring heterogeneity of each sub-population. We propose an aggregation type of estimators for the commonality parameters that possess the asymptotic optimal bounds and the asymptotic distributions as if there were no heterogeneity. This oracle result holds when the number of sub-populations does not grow too fast and the tuning parameters are selected carefully. A plugin estimator for the heterogeneity parameter is further constructed, and shown to possess the asymptotic distribution as if the commonality information were available. Furthermore, we develop a heterogeneity test for the linear components and a homogeneity test for the non-linear components accordingly. The performance of the proposed methods is evaluated via simulation studies and an application to the Medicare Provider Utilization and Payment data.
引用
收藏
页码:391 / 431
页数:41
相关论文
共 50 条
  • [1] A PARTIALLY LINEAR FRAMEWORK FOR MASSIVE HETEROGENEOUS DATA
    Zhao, Tianqi
    Cheng, Guang
    Liu, Han
    ANNALS OF STATISTICS, 2016, 44 (04): : 1400 - 1437
  • [2] Subgroup analysis for heterogeneous additive partially linear models and its application to car sales data
    Liu, Lili
    Lin, Lu
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 138 : 239 - 259
  • [3] Partially Linear Additive Models
    Toledo, Camila G.
    Lopes, Joysce S.
    Ferreira, Clecio S.
    SIGMAE, 2024, 13 (01): : 24 - 31
  • [4] Partially Linear Functional Additive Models for Multivariate Functional Data
    Wong, Raymond K. W.
    Li, Yehua
    Zhu, Zhengyuan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (525) : 406 - 418
  • [5] Sparse Partially Linear Additive Models
    Lou, Yin
    Bien, Jacob
    Caruana, Rich
    Gehrke, Johannes
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2016, 25 (04) : 1026 - 1040
  • [6] Partially Linear Additive Gaussian Graphical Models
    Geng, Sinong
    Yan, Minhao
    Kolar, Mladen
    Koyejo, Oluwasanmi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] Efficient estimation of additive partially linear models
    Li, Q
    INTERNATIONAL ECONOMIC REVIEW, 2000, 41 (04) : 1073 - 1092
  • [8] Estimation of fixed effects panel data partially linear additive regression models
    Ai, Chunrong
    You, Jinhong
    Zhou, Yong
    ECONOMETRICS JOURNAL, 2014, 17 (01): : 83 - 106
  • [9] Estimation and variable selection of quantile partially linear additive models for correlated data
    Zhao, Weihua
    Li, Rui
    Lian, Heng
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (02) : 315 - 345
  • [10] Subgroup detection in the heterogeneous partially linear additive Cox model
    Cai, Tingting
    Hu, Tao
    JOURNAL OF NONPARAMETRIC STATISTICS, 2024,