Forward Variable Selection for Sparse Ultra-High Dimensional Varying Coefficient Models

被引:39
|
作者
Cheng, Ming-Yen [1 ]
Honda, Toshio [2 ]
Zhang, Jin-Ting [3 ]
机构
[1] Natl Taiwan Univ, Dept Math, Taipei 106, Taiwan
[2] Hitotsubashi Univ, Grad Sch Econ, Tokyo, Japan
[3] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore, Singapore
关键词
BIC; B-spline; Independence screening; Marginal model; Sub-Gaussion error; BAYESIAN INFORMATION CRITERION; ORACLE PROPERTIES; LONGITUDINAL DATA; DANTZIG SELECTOR; ADDITIVE-MODELS; FEATURE SPACE; REGRESSION; LASSO; SHRINKAGE; LIKELIHOOD;
D O I
10.1080/01621459.2015.1080708
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Varying coefficient models have numerous applications in a wide scope of scientific areas. While enjoying nice interpretability, they also allow for flexibility in modeling dynamic impacts of the covariates. But, in the new era of big data, it is challenging to select the relevant variables when the dimensionality is very large. Recently, several works are focused on this important problem based on sparsity assumptions; they are subject to some limitations, however. We introduce an appealing forward selection procedure. It selects important variables sequentially according to a reduction in sum of squares criterion and it employs a Bayesian information criterion (BIC)-based stopping rule. Clearly, it is simple to implement and fast to compute, and possesses many other desirable properties from theoretical and numerical viewpoints. The BIC is a special case of the extended BIC (EBIC) when an extra tuning parameter in the latter vanishes. We establish rigorous screening consistency results when either BIC or EBIC is used as the stopping criterion. The theoretical, results depend on some conditions on the eigenvalues related to the design matrices, which can be relaxed in some situations. Results of an extensive simulation study and a real data example are also presented to show the efficacy and usefulness of our procedure. Supplementary materials for this article are available online.
引用
收藏
页码:1209 / 1221
页数:13
相关论文
共 50 条
  • [31] Variable selection for ultra-high-dimensional logistic models
    Du, Pang
    Wu, Pan
    Liang, Hua
    [J]. PERSPECTIVES ON BIG DATA ANALYSIS: METHODOLOGIES AND APPLICATIONS, 2014, 622 : 141 - 158
  • [32] Robust and sparse learning of varying coefficient models with high-dimensional features
    Xiong, Wei
    Tian, Maozai
    Tang, Manlai
    Pan, Han
    [J]. JOURNAL OF APPLIED STATISTICS, 2023, 50 (16) : 3312 - 3336
  • [33] Rates of convergence of the adaptive elastic net and the post-selection procedure in ultra-high dimensional sparse models
    Yang, Yuehan
    Yang, Hu
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (01) : 73 - 94
  • [34] Spline estimator for simultaneous variable selection and constant coefficient identification in high-dimensional generalized varying-coefficient models
    Lian, Heng
    Meng, Jie
    Zhao, Kaifeng
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 141 : 81 - 103
  • [35] Variable selection for fixed effects varying coefficient models
    Gao Rong Li
    Heng Lian
    Peng Lai
    Heng Peng
    [J]. Acta Mathematica Sinica, English Series, 2015, 31 : 91 - 110
  • [36] Variable selection for varying coefficient models with measurement errors
    Peixin Zhao
    Liugen Xue
    [J]. Metrika, 2011, 74 : 231 - 245
  • [37] Variable selection of the quantile varying coefficient regression models
    Weihua Zhao
    Riquan Zhang
    Yazhao Lv
    Jicai Liu
    [J]. Journal of the Korean Statistical Society, 2013, 42 : 343 - 358
  • [38] Variable selection of the quantile varying coefficient regression models
    Zhao, Weihua
    Zhang, Riquan
    Lv, Yazhao
    Liu, Jicai
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2013, 42 (03) : 343 - 358
  • [39] Variable bandwidth selection in varying-coefficient models
    Zhang, WY
    Lee, SY
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2000, 74 (01) : 116 - 134
  • [40] Variable selection for varying coefficient models with measurement errors
    Zhao, Peixin
    Xue, Liugen
    [J]. METRIKA, 2011, 74 (02) : 231 - 245