Penalized robust estimators in sparse logistic regression

被引:4
|
作者
Bianco, Ana M. [1 ,2 ]
Boente, Graciela [3 ,4 ]
Chebi, Gonzalo [3 ,4 ]
机构
[1] Univ Buenos Aires, Fac Ciencias Exactas & Nat, Inst Calculo, Ciudad Univ,Pabellon 2, RA-1428 Buenos Aires, DF, Argentina
[2] Consejo Nacl Invest Cient & Tecn, Ciudad Univ,Pabellon 2, RA-1428 Buenos Aires, DF, Argentina
[3] Univ Buenos Aires, Fac Ciencias Exactas & Nat, Dept Matemat, Ciudad Univ,Pabellon 1, RA-1428 Buenos Aires, DF, Argentina
[4] Consejo Nacl Invest Cient & Tecn, Ciudad Univ,Pabellon 1, RA-1428 Buenos Aires, DF, Argentina
关键词
Logistic regression; Outliers; Penalty functions; Robust estimation; Sparse models; VARIABLE SELECTION; LIKELIHOOD; INFERENCE; LASSO;
D O I
10.1007/s11749-021-00792-w
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Sparse covariates are frequent in classification and regression problems where the task of variable selection is usually of interest. As it is well known, sparse statistical models correspond to situations where there are only a small number of nonzero parameters, and for that reason, they are much easier to interpret than dense ones. In this paper, we focus on the logistic regression model and our aim is to address robust and penalized estimation for the regression parameter. We introduce a family of penalized weighted M-type estimators for the logistic regression parameter that are stable against atypical data. We explore different penalization functions including the so-called Sign penalty. We provide a careful analysis of the estimators convergence rates as well as their variable selection capability and asymptotic distribution for fixed and random penalties. A robust cross-validation criterion is also proposed. Through a numerical study, we compare the finite sample performance of the classical and robust penalized estimators, under different contamination scenarios. The analysis of real datasets enables to investigate the stability of the penalized estimators in the presence of outliers.
引用
收藏
页码:563 / 594
页数:32
相关论文
共 50 条
  • [31] Performance of Some Logistic Ridge Regression Estimators
    Kibria, B. M. Golam
    Mansson, Kristofer
    Shukur, Ghazi
    [J]. COMPUTATIONAL ECONOMICS, 2012, 40 (04) : 401 - 414
  • [32] Variable selection for sparse logistic regression
    Zanhua Yin
    [J]. Metrika, 2020, 83 : 821 - 836
  • [33] Image Annotation by Sparse Logistic Regression
    He, Siqiong
    Jia, Jinzhu
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT II, 2010, 6298 : 22 - +
  • [34] Sparse data and use of logistic regression
    Siddarth, Prabha
    [J]. EPILEPSIA, 2018, 59 (05) : 1085 - 1086
  • [35] Sparse Logistic Regression with Logical Features
    Zou, Yuan
    Roos, Teemu
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 316 - 327
  • [36] An aggregation method for sparse logistic regression
    Liu, Zhe
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 17 (01) : 85 - 96
  • [37] Sparse logistic regression on functional data
    Xu, Yunnan
    Du, Pang
    Robertson, John
    Senger, Ryan
    [J]. STATISTICS AND ITS INTERFACE, 2022, 15 (02) : 171 - 179
  • [38] Variable selection for sparse logistic regression
    Yin, Zanhua
    [J]. METRIKA, 2020, 83 (07) : 821 - 836
  • [39] Some New Robust Estimators for Circular Logistic Regression Model with Applications on Meteorological and Ecological Data
    Alshqaq, Shokrya S.
    Ahmadini, Abdullah A.
    Abuzaid, Ali H.
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [40] Logistic Regression with Robust Bootstrapping
    Li, Yawei
    Fauss, Michael
    Zoubir, Abdelhak M.
    [J]. 2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 346 - 350