FULLY EFFICIENT ROBUST ESTIMATION, OUTLIER DETECTION AND VARIABLE SELECTION VIA PENALIZED REGRESSION

被引:18
|
作者
Kong, Dehan [1 ]
Bondell, Howard D. [2 ]
Wu, Yichao [2 ]
机构
[1] Univ Toronto, Dept Stat Sci, Toronto, ON M5S 3G3, Canada
[2] North Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Adaptive; breakdown point; least trimmed squares; outliers; penalized regression; robust regression; variable selection; LEAST ANGLE REGRESSION; SQUARES REGRESSION; ORACLE PROPERTIES; MODEL SELECTION; HIGH BREAKDOWN; LASSO; LIKELIHOOD; SHRINKAGE;
D O I
10.5705/ss.202016.0441
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper studies the outlier detection and variable selection problem in linear regression. A mean shift parameter is added to the linear model to reflect the effect of outliers, where an outlier has a nonzero shift parameter. We then apply an adaptive regularization to these shift parameters to shrink most of them to zero. Those observations with nonzero mean shift parameter estimates are regarded as outliers. An L1 penalty is added to the regression parameters to select important predictors. We propose an efficient algorithm to solve this jointly penalized optimization problem and use the extended Bayesian information criteria tuning method to select the regularization parameters, since the number of parameters exceeds the sample size. Theoretical results are provided in terms of high breakdown point, full efficiency, as well as outlier detection consistency. We illustrate our method with simulations and data. Our method is extended to high-dimensional problems with dimension much larger than the sample size.
引用
收藏
页码:1031 / 1052
页数:22
相关论文
共 50 条
  • [1] Robust estimation and outlier detection for varying-coefficient models via penalized regression
    Yang, Guangren
    Xiang, Sijia
    Yao, Weixin
    Xu, Lin
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (10) : 5845 - 5856
  • [2] Outlier detection and robust variable selection via the penalized weighted LAD-LASSO method
    Jiang, Yunlu
    Wang, Yan
    Zhang, Jiantao
    Xie, Baojian
    Liao, Jibiao
    Liao, Wenhui
    JOURNAL OF APPLIED STATISTICS, 2021, 48 (02) : 234 - 246
  • [3] Outlier Detection and Robust Variable Selection for Least Angle Regression
    Shahriari, Shirin
    Faria, Susana
    Manuela Goncalves, A.
    Van Aelst, Stefan
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2014, PT III, 2014, 8581 : 512 - +
  • [4] Penalized weighted proportional hazards model for robust variable selection and outlier detection
    Luo, Bin
    Gao, Xiaoli
    Halabi, Susan
    STATISTICS IN MEDICINE, 2022, 41 (17) : 3398 - 3420
  • [5] Efficient and robust estimation of regression and scale parameters, with outlier detection
    Desgagne, Alain
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 155
  • [6] Variable selection via penalized minimum φ-divergence estimation in logistic regression
    Sakate, D. M.
    Kashid, D. N.
    JOURNAL OF APPLIED STATISTICS, 2014, 41 (06) : 1233 - 1246
  • [7] Outlier Detection and Robust Estimation in Nonparametric Regression
    Kong, Dehan
    Bondell, Howard
    Shen, Weining
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [8] Rapid outlier detection, model selection and variable selection using penalized likelihood estimation for general spatial models
    Song, Yunquan
    Fang, Minglu
    Wang, Yuanfeng
    Hou, Yiming
    SPATIAL STATISTICS, 2024, 61
  • [9] Variable selection in spatial regression via penalized least squares
    Wang, Haonan
    Zhu, Jun
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2009, 37 (04): : 607 - 624
  • [10] Outlier Detection Using Nonconvex Penalized Regression
    She, Yiyuan
    Owen, Art B.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) : 626 - 639