VIF Regression: A Fast Regression Algorithm For Large Data

被引:1
|
作者
Lin, Dongyu [1 ]
Foster, Dean P. [1 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat, Philadelphia, PA 19104 USA
关键词
variable selection; stepwise regression; variance inflation factor; false discovery rate;
D O I
10.1109/ICDM.2009.146
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a fast regression algorithm that can substantially reduce the computational complexity of searching, yet retain good accuracy. It also guarantees to discover correlated features that are collectively predictive, and avoid model over-fitting. Its capability of controlling mFDR (marginal False Discovery Rate) statistically enables the one-pass. search of the fast algorithm and guarantees the accuracy of the sparse model chosen by the algorithm without cross validation. Numerical results show that our algorithm is much faster than any other algorithm and is competitively as accurate as the best but slower algorithms.
引用
收藏
页码:848 / 853
页数:6
相关论文
共 50 条
  • [1] VIF Regression: A Fast Regression Algorithm for Large Data
    Lin, Dongyu
    Foster, Dean P.
    Ungar, Lyle H.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (493) : 232 - 247
  • [2] Fast robust variable selection using VIF regression in large datasets
    Seo, Han Son
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2018, 31 (04) : 463 - 473
  • [3] ROBUST VIF REGRESSION WITH APPLICATION TO VARIABLE SELECTION IN LARGE DATA SETS
    Dupuis, Debbie J.
    Victoria-Feser, Maria-Pia
    [J]. ANNALS OF APPLIED STATISTICS, 2013, 7 (01): : 319 - 341
  • [4] Fast Algorithm for Multiway Regression
    Camarrone, Flavio
    Van Hulle, Marc M.
    [J]. 2017 22ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2017,
  • [5] The VIF and MSE in Raise Regression
    Salmeron Gomez, Roman
    Rodriguez Sanchez, Ainara
    Garcia Garcia, Catalina
    Garcia Perez, Jose
    [J]. MATHEMATICS, 2020, 8 (04)
  • [6] A fast imputation algorithm in quantile regression
    Hao Cheng
    Ying Wei
    [J]. Computational Statistics, 2018, 33 : 1589 - 1603
  • [7] A fast imputation algorithm in quantile regression
    Cheng, Hao
    Wei, Ying
    [J]. COMPUTATIONAL STATISTICS, 2018, 33 (04) : 1589 - 1603
  • [8] A Fast Structured Regression for Large Networks
    Zhou, Fang
    Ghalwash, Mohamed
    Obradovic, Zoran
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 106 - 115
  • [9] Algorithms for fast large scale data mining using logistic regression
    Rouhani-Kalleh, Omid
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 155 - 162
  • [10] Kernel Logistic Regression Algorithm for Large-Scale Data Classification
    Elbashir, Murtada
    Wang, Jianxin
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (05) : 465 - 472