Variable selection for semiparametric regression models with iterated penalisation

被引:2
|
作者
Dai, Ying [1 ]
Ma, Shuangge [1 ]
机构
[1] Yale Univ, Sch Publ Hlth, New Haven, CT 06520 USA
基金
美国国家科学基金会;
关键词
iterated penalisation; variable selection; semiparametric regression; ORACLE PROPERTIES; ADAPTIVE LASSO; LINEAR-MODELS; RISK;
D O I
10.1080/10485252.2012.661054
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Semiparametric regression models with multiple covariates are commonly encountered. When there are covariates that are not associated with a response variable, variable selection may lead to sparser models, more lucid interpretations and more accurate estimation. In this study, we adopt a sieve approach for the estimation of nonparametric covariate effects in semiparametric regression models. We adopt a two-step iterated penalisation approach for variable selection. In the first step, a mixture of Lasso and group Lasso penalties are employed to conduct the first-round variable selection and obtain the initial estimate. In the second step, a mixture of weighted Lasso and weighted group Lasso penalties, with weights constructed using the initial estimate, are employed for variable selection. We show that the proposed iterated approach has the variable selection consistency property, even when the number of unknown parameters diverges with sample size. Numerical studies, including simulation and analysis of a diabetes data set, show satisfactory performance of the proposed approach.
引用
收藏
页码:283 / 298
页数:16
相关论文
共 50 条
  • [1] Variable Selection for Semiparametric Isotonic Regression Models
    Du, Jiang
    Zhang, Zhongzhan
    Xie, Tianfa
    [J]. NONLINEAR MATHEMATICS FOR UNCERTAINTY AND ITS APPLICATIONS, 2011, 100 : 525 - 532
  • [2] Ensemble estimation and variable selection with semiparametric regression models
    Shin, Sunyoung
    Liu, Yufeng
    Cole, Stephen R.
    Fine, Jason P.
    [J]. BIOMETRIKA, 2020, 107 (02) : 433 - 448
  • [3] Penalized estimating functions and variable selection in semiparametric regression models
    Johnson, Brent A.
    Lin, D. Y.
    Zeng, Donglin
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (482) : 672 - 680
  • [4] Variable selection in semiparametric regression modeling
    Li, Runze
    Liang, Hua
    [J]. ANNALS OF STATISTICS, 2008, 36 (01): : 261 - 286
  • [5] On selection of semiparametric spatial regression models
    Wang, Guannan
    Wang, Jue
    [J]. STAT, 2019, 8 (01):
  • [6] Variable Selection for Semiparametric Partially Linear Covariate-Adjusted Regression Models
    Du, Jiang
    Li, Gaorong
    Peng, Heng
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2015, 44 (13) : 2809 - 2826
  • [7] BNSP: an R Package for Fitting Bayesian Semiparametric Regression Models and Variable Selection
    Papageorgiou, Georgios
    [J]. R JOURNAL, 2018, 10 (02): : 526 - 548
  • [8] Variable selection in semiparametric regression models for longitudinal data with informative observation times
    Jazi, Omidali Aghababaei
    Pullenayegum, Eleanor
    [J]. STATISTICS IN MEDICINE, 2022, 41 (17) : 3281 - 3298
  • [9] Variable selection in semiparametric regression analysis for longitudinal data
    Zhao, Peixin
    Xue, Liugen
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2012, 64 (01) : 213 - 231
  • [10] Variable selection in semiparametric regression analysis for longitudinal data
    Peixin Zhao
    Liugen Xue
    [J]. Annals of the Institute of Statistical Mathematics, 2012, 64 : 213 - 231