Variable selection for semiparametric regression models with iterated penalisation

被引:2
|
作者
Dai, Ying [1 ]
Ma, Shuangge [1 ]
机构
[1] Yale Univ, Sch Publ Hlth, New Haven, CT 06520 USA
基金
美国国家科学基金会;
关键词
iterated penalisation; variable selection; semiparametric regression; ORACLE PROPERTIES; ADAPTIVE LASSO; LINEAR-MODELS; RISK;
D O I
10.1080/10485252.2012.661054
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Semiparametric regression models with multiple covariates are commonly encountered. When there are covariates that are not associated with a response variable, variable selection may lead to sparser models, more lucid interpretations and more accurate estimation. In this study, we adopt a sieve approach for the estimation of nonparametric covariate effects in semiparametric regression models. We adopt a two-step iterated penalisation approach for variable selection. In the first step, a mixture of Lasso and group Lasso penalties are employed to conduct the first-round variable selection and obtain the initial estimate. In the second step, a mixture of weighted Lasso and weighted group Lasso penalties, with weights constructed using the initial estimate, are employed for variable selection. We show that the proposed iterated approach has the variable selection consistency property, even when the number of unknown parameters diverges with sample size. Numerical studies, including simulation and analysis of a diabetes data set, show satisfactory performance of the proposed approach.
引用
收藏
页码:283 / 298
页数:16
相关论文
共 50 条
  • [21] ESTIMATION AND VARIABLE SELECTION FOR SEMIPARAMETRIC ADDITIVE PARTIAL LINEAR MODELS
    Liu, Xiang
    Wang, Li
    Liang, Hua
    STATISTICA SINICA, 2011, 21 (03) : 1225 - 1248
  • [22] Variable selection in strong hierarchical semiparametric models for longitudinal data
    Zeng, Xianbin
    Ma, Shuangge
    Qin, Yichen
    Li, Yang
    STATISTICS AND ITS INTERFACE, 2015, 8 (03) : 355 - 365
  • [23] Variable selection for semiparametric varying coefficient partially linear models
    Zhao, Peixin
    Xue, Liugen
    STATISTICS & PROBABILITY LETTERS, 2009, 79 (20) : 2148 - 2157
  • [24] Component selection and variable selection for mixture regression models
    Qi, Xuefei
    Xu, Xingbai
    Feng, Zhenghui
    Peng, Heng
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 206
  • [25] Variable selection for semiparametric errors-in-variables regression model with longitudinal data
    Tian, Ruiqin
    Xue, Liugen
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2014, 84 (08) : 1654 - 1669
  • [26] Simultaneous variable selection and estimation in semiparametric regression of mixed panel count data
    Ge, Lei
    Hu, Tao
    Li, Yang
    BIOMETRICS, 2024, 80 (01)
  • [27] BANDWIDTH SELECTION IN SEMIPARAMETRIC ESTIMATION OF CENSORED LINEAR-REGRESSION MODELS
    HALL, P
    HOROWITZ, JL
    ECONOMETRIC THEORY, 1990, 6 (02) : 123 - 150
  • [28] Variable selection and transformation in linear regression models
    Yeo, IK
    STATISTICS & PROBABILITY LETTERS, 2005, 72 (03) : 219 - 226
  • [29] Simultaneous variable selection for heteroscedastic regression models
    Zhang, Zhongzhan
    Wang, Darong
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON FINANCIAL ENGINEERING AND RISK MANAGEMENT 2008, 2008, : 141 - 143
  • [30] Variable selection in Functional Additive Regression Models
    Febrero-Bande, Manuel
    Gonzalez-Manteiga, Wenceslao
    Oviedo de la Fuente, Manuel
    FUNCTIONAL STATISTICS AND RELATED FIELDS, 2017, : 113 - 122