Confidence Intervals and Hypothesis Testing for High-Dimensional Regression

被引:0
|
作者
Javanmard, Adel [1 ]
Montanari, Andrea [1 ,2 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
hypothesis testing; confidence intervals; LASSO; high-dimensional models; bias of an estimator; VARIABLE SELECTION; MODEL SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fitting high-dimensional statistical models often requires the use of non-linear parameter estimation procedures. As a consequence, it is generally impossible to obtain an exact characterization of the probability distribution of the parameter estimates. This in turn implies that it is extremely challenging to quantify the uncertainty associated with a certain parameter estimate. Concretely, no commonly accepted procedure exists for computing classical measures of uncertainty and statistical significance as confidence intervals or p-values for these models. We consider here high-dimensional linear regression problem, and propose an efficient algorithm for constructing confidence intervals and p-values. The resulting confidence intervals have nearly optimal size. When testing for the null hypothesis that a certain parameter is vanishing, our method has nearly optimal power. Our approach is based on constructing a 'de-biased' version of regularized M-estimators. The new construction improves over recent work in the field in that it does not assume a special structure on the design matrix. We test our method on synthetic data and a highthroughput genomic data set about riboflavin production rate, made publicly available by Biihlmann et al. (2014).
引用
收藏
页码:2869 / 2909
页数:41
相关论文
共 50 条
  • [1] Confidence intervals and hypothesis testing for high-dimensional regression
    Javanmard, Adel
    Montanari, Andrea
    [J]. Journal of Machine Learning Research, 2014, 15 : 2869 - 2909
  • [2] Confidence Intervals and Hypothesis Testing for High-dimensional Quantile Regression: Convolution Smoothing and Debiasing
    Yan, Yibo
    Wang, Xiaozhou
    Zhang, Riquan
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [3] HYPOTHESIS TESTING FOR HIGH-DIMENSIONAL SPARSE BINARY REGRESSION
    Mukherjee, Rajarshi
    Pillai, Natesh S.
    Lin, Xihong
    [J]. ANNALS OF STATISTICS, 2015, 43 (01): : 352 - 381
  • [4] CONFIDENCE INTERVALS FOR HIGH-DIMENSIONAL LINEAR REGRESSION: MINIMAX RATES AND ADAPTIVITY
    Cai, T. Tony
    Guo, Zijian
    [J]. ANNALS OF STATISTICS, 2017, 45 (02): : 615 - 646
  • [5] Rate optimal estimation and confidence intervals for high-dimensional regression with missing covariates
    Wang, Yining
    Wang, Jialei
    Balakrishnan, Sivaraman
    Singh, Aarti
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 174
  • [6] CONFIDENCE INTERVALS FOR HIGH-DIMENSIONAL COX MODELS
    Yu, Yi
    Bradic, Jelena
    Samworth, Richard J.
    [J]. STATISTICA SINICA, 2021, 31 (01) : 243 - 267
  • [7] Nearly Optimal Sample Size in Hypothesis Testing for High-Dimensional Regression
    Javanmard, Adel
    Montanari, Andrea
    [J]. 2013 51ST ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2013, : 1427 - 1434
  • [8] Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models
    Ma, Rong
    Cai, T. Tony
    Li, Hongzhe
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (534) : 984 - 998
  • [9] Confidence intervals for high-dimensional inverse covariance estimation
    Jankova, Jana
    van de Geer, Sara
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 1205 - 1229
  • [10] HYPOTHESIS TESTING IN HIGH-DIMENSIONAL INSTRUMENTAL VARIABLES REGRESSION WITH AN APPLICATION TO GENOMICS DATA
    Lu, Jiarui
    Li, Hongzhe
    [J]. STATISTICA SINICA, 2022, 32 : 613 - 633