Fast robust variable selection using VIF regression in large datasets

被引:2
|
作者
Seo, Han Son [1 ]
机构
[1] Konkuk Univ, Dept Appl Stat, 120 Neungdong Ro, Seoul 05029, South Korea
关键词
large dataset; linear regression; stagewise regression; variable selection;
D O I
10.5351/KJAS.2018.31.4.463
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Variable selection algorithms for linear regression models of large data are considered. Many algorithms are proposed focusing on the speed and the robustness of algorithms. Among them variance inflation factor (VIF) regression is fast and accurate due to the use of a streamwise regression approach. But a VIF regression is susceptible to outliers because it estimates a model by a least-square method. A robust criterion using a weighted estimator has been proposed for the robustness of algorithm; in addition, a robust VIF regression has also been proposed for the same purpose. In this article a fast and robust variable selection method is suggested via a VIF regression with detecting and removing potential outliers. A simulation study and an analysis of a dataset are conducted to compare the suggested method with other methods.
引用
收藏
页码:463 / 473
页数:11
相关论文
共 50 条
  • [31] A FAST PROCEDURE OF VARIABLE SELECTION IN LINEAR REGRESSION MODEL
    安柏庆
    [J]. Journal of Systems Science & Complexity, 1989, (03) : 266 - 274
  • [32] Robust estimation and variable selection in heteroscedastic regression model using least favorable distribution
    Guney, Yesim
    Tuac, Yetkin
    Ozdemir, Senay
    Arslan, Olcay
    [J]. COMPUTATIONAL STATISTICS, 2021, 36 (02) : 805 - 827
  • [33] Robust estimation and variable selection in heteroscedastic regression model using least favorable distribution
    Yeşim Güney
    Yetkin Tuaç
    Şenay Özdemir
    Olcay Arslan
    [J]. Computational Statistics, 2021, 36 : 805 - 827
  • [34] Using Proximity Graph Cut for Fast and Robust Instance-Based Classification in Large Datasets
    Protasov, Stanislav
    Khan, Adil Mehmood
    [J]. COMPLEXITY, 2021, 2021
  • [35] Outlier Detection and Robust Variable Selection for Least Angle Regression
    Shahriari, Shirin
    Faria, Susana
    Manuela Goncalves, A.
    Van Aelst, Stefan
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2014, PT III, 2014, 8581 : 512 - +
  • [36] Robust Variable Selection and Estimation Based on Kernel Modal Regression
    Guo, Changying
    Song, Biqin
    Wang, Yingjie
    Chen, Hong
    Xiong, Huijuan
    [J]. ENTROPY, 2019, 21 (04)
  • [37] Robust Signed-Rank Variable Selection in Linear Regression
    Abebe, Asheber
    Bindele, Huybrechts F.
    [J]. ROBUST RANK-BASED AND NONPARAMETRIC METHODS, 2016, 168 : 25 - 45
  • [38] A simulation study on classic and robust variable selection in linear regression
    Çetin, Meral
    Erar, Aydin
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2006, 175 (02) : 1629 - 1643
  • [39] New Robust Variable Selection Methods for Linear Regression Models
    Chen, Ziqi
    Tang, Man-Lai
    Gao, Wei
    Shi, Ning-Zhong
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2014, 41 (03) : 725 - 741
  • [40] Robust estimation and variable selection for function-on-scalar regression
    Cai, Xiong
    Xue, Liugen
    Ca, Jiguo
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2022, 50 (01): : 162 - 179