Interpolating Predictors in High-Dimensional Factor Regression

被引:0
|
作者
Bunea, Florentina [1 ]
Strimas-Mackey, Seth [1 ]
Wegkamp, Marten [1 ,2 ]
机构
[1] Cornell Univ, Dept Stat & Data Sci, Ithaca, NY 14850 USA
[2] Cornell Univ, Dept Math, Ithaca, NY 14850 USA
基金
加拿大自然科学与工程研究理事会;
关键词
Interpolation; minimum-norm predictor; finite sample risk bounds; prediction; factor models; high-dimensional regression; PRINCIPAL COMPONENTS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work studies finite-sample properties of the risk of the minimum-norm interpolating predictor in high-dimensional regression models. If the effective rank of the covariance matrix sigma of the p regression features is much larger than the sample size n, we show that the min-norm interpolating predictor is not desirable, as its risk approaches the risk of trivially predicting the response by 0. However, our detailed finite-sample analysis reveals, surprisingly, that this behavior is not present when the regression response and the features are jointly low-dimensional, following a widely used factor regression model. Within this popular model class, and when the effective rank of sigma is smaller than n, while still allowing for p >> n, both the bias and the variance terms of the excess risk can be controlled, and the risk of the minimum-norm interpolating predictor approaches optimal benchmarks. Moreover, through a detailed analysis of the bias term, we exhibit model classes under which our upper bound on the excess risk approaches zero, while the corresponding upper bound in the recent work Bartlett et al. (2020) diverges. Furthermore, we show that the minimum-norm interpolating predictor analyzed under the factor regression model, despite being model-agnostic and devoid of tuning parameters, can have similar risk to predictors based on principal components regression and ridge regression, and can improve over LASSO based predictors, in the high-dimensional regime.
引用
收藏
页数:60
相关论文
共 50 条
  • [1] Interpolating Predictors in High-Dimensional Factor Regression
    Bunea, Florentina
    Strimas-Mackey, Seth
    Wegkamp, Marten
    [J]. Journal of Machine Learning Research, 2022, 23
  • [2] On robust regression with high-dimensional predictors
    El Karoui, Noureddine
    Bean, Derek
    Bickel, Peter J.
    Lim, Chinghway
    Yu, Bin
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (36) : 14557 - 14562
  • [3] High-dimensional regression with ordered multiple categorical predictors
    Huang, Lei
    Hang, Weiqiang
    Chao, Yue
    [J]. STATISTICS IN MEDICINE, 2020, 39 (03) : 294 - 309
  • [4] HIGH-DIMENSIONAL FACTOR REGRESSION FOR HETEROGENEOUS SUBPOPULATIONS
    Wang, Peiyao
    Li, Quefeng
    Shen, Dinggan
    Liu, Yufeng
    [J]. STATISTICA SINICA, 2023, 33 (01) : 27 - 53
  • [5] Adverse subpopulation regression for multivariate outcomes with high-dimensional predictors
    Zhu, Bin
    Dunson, David B.
    Ashley-Koch, Allison E.
    [J]. STATISTICS IN MEDICINE, 2012, 31 (29) : 4102 - 4113
  • [6] A comparative study on high-dimensional bayesian regression with binary predictors
    Slanzi, Debora
    Mameli, Valentina
    Brown, Philip J.
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (05) : 1979 - 1999
  • [7] Screen then select: a strategy for correlated predictors in high-dimensional quantile regression
    Jiang, Xuejun
    Liang, Yakun
    Wang, Haofeng
    [J]. STATISTICS AND COMPUTING, 2024, 34 (03)
  • [8] High-dimensional expectile regression incorporating graphical structure among predictors
    Pan, Yingli
    Zhao, Xiaoluo
    Wei, Sha
    Liu, Zhan
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (02) : 231 - 248
  • [9] Corrupted and missing predictors: Minimax bounds for high-dimensional linear regression
    Loh, Po-Ling
    Wainwright, Martin J.
    [J]. 2012 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2012,
  • [10] Factor Analysis Regression for Predictive Modeling with High-Dimensional Data
    Carter, Randy
    Michael, Netsanet
    [J]. JOURNAL OF QUANTITATIVE ECONOMICS, 2022, 20 (SUPPL 1) : 115 - 132