Variable selection and estimation in generalized linear models with the seamless L0 penalty

被引:17
|
作者
Li, Zilin [2 ]
Wang, Sijian [3 ,4 ]
Lin, Xihong [1 ]
机构
[1] Harvard Univ, Dept Biostat, Boston, MA 02115 USA
[2] Tsinghua Univ, Dept Math, Beijing 100084, Peoples R China
[3] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
[4] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
基金
中国国家自然科学基金;
关键词
BIC; consistency; coordinate descent algorithm; model selection; oracle property; penalized likelihood methods; SELO penalty; tuning parameter selection; NONCONCAVE PENALIZED LIKELIHOOD; ORACLE PROPERTIES; CROSS-VALIDATION; DIVERGING NUMBER; LASSO; REGRESSION; PARAMETER; CRITERION; RISK;
D O I
10.1002/cjs.11165
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we propose variable selection and estimation in generalized linear models using the seamless $L_0$ (SELO) penalized likelihood approach. The SELO penalty is a smooth function that very closely resembles the discontinuous $L_0$ penalty. We develop an efficient algorithm to fit the model, and show that the SELO-GLM procedure has the oracle property in the presence of a diverging number of variables. We propose a Bayesian information criterion (BIC) to select the tuning parameter. We show that under some regularity conditions, the proposed SELO-GLM/BIC procedure consistently selects the true model. We perform simulation studies to evaluate the finite sample performance of the proposed methods. Our simulation studies show that the proposed SELO-GLM procedure has a better finite sample performance than several existing methods, especially when the number of variables is large and the signals are weak. We apply the SELO-GLM to analyze a breast cancer genetic dataset to identify the SNPs that are associated with breast cancer risk. The Canadian Journal of Statistics 40: 745769; 2012 (C) 2012 Statistical Society of Canada
引用
收藏
页码:745 / 769
页数:25
相关论文
共 50 条
  • [1] VARIABLE SELECTION AND ESTIMATION WITH THE SEAMLESS-L0 PENALTY
    Dicker, Lee
    Huang, Baosheng
    Lin, Xihong
    [J]. STATISTICA SINICA, 2013, 23 (02) : 929 - 962
  • [2] Variable selection and estimation using a continuous approximation to the L0 penalty
    Wang, Yanxin
    Fan, Qibin
    Zhu, Li
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2018, 70 (01) : 191 - 214
  • [3] Model selection in high-dimensional quantile regression with seamless L0 penalty
    Ciuperca, Gabriela
    [J]. STATISTICS & PROBABILITY LETTERS, 2015, 107 : 313 - 323
  • [4] Simultaneous variable selection and estimation for survival data via the Gaussian seamless-L0 penalty
    Liu, Zili
    Wang, Hong
    [J]. STATISTICS IN MEDICINE, 2024, 43 (08) : 1509 - 1526
  • [5] Efficient Regularized Regression with L0 Penalty for Variable Selection and Network Construction
    Liu, Zhenqiu
    Li, Gang
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2016, 2016
  • [6] Variable selection and estimation for multivariate panel count data via the seamless-L0 penalty
    Zhang, Haixiang
    Sun, Jianguo
    Wang, Dehui
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2013, 41 (02): : 368 - 385
  • [7] Variable selection and estimation for accelerated failure time model via seamless-L0 penalty
    Xu, Yin
    Wang, Ning
    [J]. AIMS MATHEMATICS, 2022, 8 (01): : 1195 - 1207
  • [8] Variables selection using L0 penalty
    Zhang, Tonglin
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 190
  • [9] Scalable network estimation with L0 penalty
    Kim, Junghi
    Zhu, Hongtu
    Wang, Xiao
    Do, Kim-Anh
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2021, 14 (01) : 18 - 30
  • [10] SPARSE VARIABLE NOISY PCA USING l0 PENALTY
    Ulfarsson, M. O.
    Solo, V.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 3950 - 3953