Confidence Intervals and Hypothesis Testing for High-Dimensional Regression

被引：0

作者：

Javanmard, Adel ^{[1
]}

Montanari, Andrea ^{[1
,2
]}

机构：

[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA

[2] Stanford Univ, Dept Stat, Stanford, CA 94305 USA

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2014年 / 15卷

基金：

美国国家科学基金会;

关键词：

hypothesis testing; confidence intervals; LASSO; high-dimensional models; bias of an estimator; VARIABLE SELECTION; MODEL SELECTION;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Fitting high-dimensional statistical models often requires the use of non-linear parameter estimation procedures. As a consequence, it is generally impossible to obtain an exact characterization of the probability distribution of the parameter estimates. This in turn implies that it is extremely challenging to quantify the uncertainty associated with a certain parameter estimate. Concretely, no commonly accepted procedure exists for computing classical measures of uncertainty and statistical significance as confidence intervals or p-values for these models. We consider here high-dimensional linear regression problem, and propose an efficient algorithm for constructing confidence intervals and p-values. The resulting confidence intervals have nearly optimal size. When testing for the null hypothesis that a certain parameter is vanishing, our method has nearly optimal power. Our approach is based on constructing a 'de-biased' version of regularized M-estimators. The new construction improves over recent work in the field in that it does not assume a special structure on the design matrix. We test our method on synthetic data and a highthroughput genomic data set about riboflavin production rate, made publicly available by Biihlmann et al. (2014).

引用

下载

页码：2869 / 2909

页数：41

共 50 条

[11] Hypothesis testing for high-dimensional multivariate regression with false discovery rate control
Zhu, Yunlong
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (21) : 7476 - 7495
[12] Testing covariates in high-dimensional regression
Lan, Wei
Wang, Hansheng
Tsai, Chih-Ling
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2014, 66 (02) : 279 - 301
[13] Testing covariates in high-dimensional regression
Wei Lan
Hansheng Wang
Chih-Ling Tsai
Annals of the Institute of Statistical Mathematics, 2014, 66 : 279 - 301
[14] Visual hypothesis testing with confidence intervals
Smith, RW
PROCEEDINGS OF THE TWENTY-SECOND ANNUAL SAS USERS GROUP INTERNATIONAL CONFERENCE, 1997, : 1252 - 1257
[15] Confidence Intervals and Tests for High-Dimensional Models: A Compact Review
Buhlmann, Peter
MODELING AND STOCHASTIC LEARNING FOR FORECASTING IN HIGH DIMENSIONS, 2015, 217 : 21 - 34
[16] Rank Conditional Coverage and Confidence Intervals in High-Dimensional Problems
Morrison, Jean
Simon, Noah
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2018, 27 (03) : 648 - 656
[17] Confidence intervals for parameters in high-dimensional sparse vector autoregression
Zhu, Ke
Liu, Hanzhong
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 168
[18] HYBRID RESAMPLING CONFIDENCE INTERVALS FOR CHANGE-POINT OR STATIONARY HIGH-DIMENSIONAL STOCHASTIC REGRESSION MODELS
Dai, Wei
Tsang, Ka Wai
STATISTICA SINICA, 2021, 31 : 2239 - 2255
[19] Hypothesis testing for high-dimensional covariance matrices
Li, Weiming
Qin, Yingli
JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 128 : 108 - 119
[20] Confidence intervals and hypothesis testing for beta diversity
Kiflawi, M
Spencer, M
ECOLOGY, 2004, 85 (10) : 2895 - 2900

← 1 2 3 4 5 →