Robust linear regression for high-dimensional data: An overview

被引:29
|
作者
Filzmoser, Peter [1 ]
Nordhausen, Klaus [1 ]
机构
[1] Vienna Univ Technol, Inst Stat & Math Methods Econ, Wiedner Hauptstr 8-10, A-1040 Vienna, Austria
关键词
dimension reduction; high-dimensional data; Outlier; regression; sparsity; LEAST-SQUARES REGRESSION; VARIABLE SELECTION; RIDGE-REGRESSION; SPARSE; ESTIMATORS; PROJECTION; SHRINKAGE; OUTLIERS;
D O I
10.1002/wics.1524
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Digitization as the process of converting information into numbers leads to bigger and more complex data sets, bigger also with respect to the number of measured variables. This makes it harder or impossible for the practitioner to identify outliers or observations that are inconsistent with an underlying model. Classical least-squares based procedures can be affected by those outliers. In the regression context, this means that the parameter estimates are biased, with consequences on the validity of the statistical inference, on regression diagnostics, and on the prediction accuracy. Robust regression methods aim at assigning appropriate weights to observations that deviate from the model. While robust regression techniques are widely known in the low-dimensional case, researchers and practitioners might still not be very familiar with developments in this direction for high-dimensional data. Recently, different strategies have been proposed for robust regression in the high-dimensional case, typically based on dimension reduction, on shrinkage, including sparsity, and on combinations of such techniques. A very recent concept is downweighting single cells of the data matrix rather than complete observations, with the goal to make better use of the model-consistent information, and thus to achieve higher efficiency of the parameter estimates.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Robust Ridge Regression for High-Dimensional Data
    Maronna, Ricardo A.
    [J]. TECHNOMETRICS, 2011, 53 (01) : 44 - 53
  • [2] Robust high-dimensional regression for data with anomalous responses
    Mingyang Ren
    Sanguo Zhang
    Qingzhao Zhang
    [J]. Annals of the Institute of Statistical Mathematics, 2021, 73 : 703 - 736
  • [3] Robust high-dimensional regression for data with anomalous responses
    Ren, Mingyang
    Zhang, Sanguo
    Zhang, Qingzhao
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (04) : 703 - 736
  • [4] Robust transfer learning for high-dimensional regression with linear constraints
    Chen, Xuan
    Song, Yunquan
    Wang, Yuanfeng
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (11) : 2462 - 2482
  • [5] On robust regression with high-dimensional predictors
    El Karoui, Noureddine
    Bean, Derek
    Bickel, Peter J.
    Lim, Chinghway
    Yu, Bin
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (36) : 14557 - 14562
  • [6] Robust and sparse estimation methods for high-dimensional linear and logistic regression
    Kurnaz, Fatma Sevinc
    Hoffmann, Irene
    Filzmoser, Peter
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 172 : 211 - 222
  • [7] HIGH-DIMENSIONAL LINEAR REGRESSION FOR DEPENDENT DATA WITH APPLICATIONS TO NOWCASTING
    Han, Yuefeng
    Tsay, Ruey S.
    [J]. STATISTICA SINICA, 2020, 30 (04) : 1797 - 1827
  • [8] Robust transfer learning for high-dimensional quantile regression model with linear constraints
    Longjie Cao
    Yunquan Song
    [J]. Applied Intelligence, 2024, 54 : 1263 - 1274
  • [9] Robust transfer learning for high-dimensional quantile regression model with linear constraints
    Cao, Longjie
    Song, Yunquan
    [J]. APPLIED INTELLIGENCE, 2024, 54 (02) : 1263 - 1274
  • [10] Scale calibration for high-dimensional robust regression
    Loh, Po-Ling
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 5933 - 5994