FULLY EFFICIENT ROBUST ESTIMATION, OUTLIER DETECTION AND VARIABLE SELECTION VIA PENALIZED REGRESSION

被引:18
|
作者
Kong, Dehan [1 ]
Bondell, Howard D. [2 ]
Wu, Yichao [2 ]
机构
[1] Univ Toronto, Dept Stat Sci, Toronto, ON M5S 3G3, Canada
[2] North Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Adaptive; breakdown point; least trimmed squares; outliers; penalized regression; robust regression; variable selection; LEAST ANGLE REGRESSION; SQUARES REGRESSION; ORACLE PROPERTIES; MODEL SELECTION; HIGH BREAKDOWN; LASSO; LIKELIHOOD; SHRINKAGE;
D O I
10.5705/ss.202016.0441
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper studies the outlier detection and variable selection problem in linear regression. A mean shift parameter is added to the linear model to reflect the effect of outliers, where an outlier has a nonzero shift parameter. We then apply an adaptive regularization to these shift parameters to shrink most of them to zero. Those observations with nonzero mean shift parameter estimates are regarded as outliers. An L1 penalty is added to the regression parameters to select important predictors. We propose an efficient algorithm to solve this jointly penalized optimization problem and use the extended Bayesian information criteria tuning method to select the regularization parameters, since the number of parameters exceeds the sample size. Theoretical results are provided in terms of high breakdown point, full efficiency, as well as outlier detection consistency. We illustrate our method with simulations and data. Our method is extended to high-dimensional problems with dimension much larger than the sample size.
引用
收藏
页码:1031 / 1052
页数:22
相关论文
共 50 条
  • [31] Variable selection via generalized SELO-penalized linear regression models
    SHI Yue-yong
    CAO Yong-xiu
    YU Ji-chang
    JIAO Yu-ling
    Applied Mathematics:A Journal of Chinese Universities, 2018, 33 (02) : 145 - 162
  • [32] Variable Selection via Generalized SELO-Penalized Cox Regression Models
    Shi Yueyong
    Xu Deyi
    Cao Yongxiu
    Jiao Yuling
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2019, 32 (02) : 709 - 736
  • [33] Simultaneous variable selection and outlier detection using a robust genetic algorithm
    Wiegand, Patrick
    Pell, Randy
    Comas, Enric
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2009, 98 (02) : 108 - 114
  • [34] Robust Moderately Clipped LASSO for Simultaneous Outlier Detection and Variable Selection
    Peng, Yang
    Luo, Bin
    Gao, Xiaoli
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2022, 84 (02): : 694 - 707
  • [35] Variable selection via generalized SELO-penalized linear regression models
    Yue-yong Shi
    Yong-xiu Cao
    Ji-chang Yu
    Yu-ling Jiao
    Applied Mathematics-A Journal of Chinese Universities, 2018, 33 : 145 - 162
  • [36] Variable Selection via Generalized SELO-Penalized Cox Regression Models
    SHI Yueyong
    XU Deyi
    CAO Yongxiu
    JIAO Yuling
    Journal of Systems Science & Complexity, 2019, 32 (02) : 709 - 736
  • [37] Variable selection via generalized SELO-penalized linear regression models
    Shi Yue-yong
    Cao Yong-xiu
    Yu Ji-chang
    Jiao Yu-ling
    APPLIED MATHEMATICS-A JOURNAL OF CHINESE UNIVERSITIES SERIES B, 2018, 33 (02) : 145 - 162
  • [38] Variable Selection via Generalized SELO-Penalized Cox Regression Models
    Yueyong Shi
    Deyi Xu
    Yongxiu Cao
    Yuling Jiao
    Journal of Systems Science and Complexity, 2019, 32 : 709 - 736
  • [39] Robust Moderately Clipped LASSO for Simultaneous Outlier Detection and Variable Selection
    Yang Peng
    Bin Luo
    Xiaoli Gao
    Sankhya B, 2022, 84 : 694 - 707
  • [40] High-dimensional macroeconomic forecasting and variable selection via penalized regression
    Uematsu, Yoshimasa
    Tanaka, Shinya
    ECONOMETRICS JOURNAL, 2019, 22 (01): : 34 - +