GAUSSIAN APPROXIMATIONS AND MULTIPLIER BOOTSTRAP FOR MAXIMA OF SUMS OF HIGH-DIMENSIONAL RANDOM VECTORS

被引:248
|
作者
Chernozhukov, Victor [1 ,2 ]
Chetverikov, Denis [3 ]
Kato, Kengo [4 ]
机构
[1] MIT, Dept Econ, Cambridge, MA 02142 USA
[2] MIT, Ctr Operat Res, Cambridge, MA 02142 USA
[3] Univ Calif Los Angeles, Dept Econ, Los Angeles, CA 90095 USA
[4] Univ Tokyo, Grad Sch Econ, Bunkyo Ku, Tokyo 1130033, Japan
来源
ANNALS OF STATISTICS | 2013年 / 41卷 / 06期
基金
日本学术振兴会; 美国国家科学基金会;
关键词
Dantzig selector; Slepian; Stein method; maximum of vector sums; high dimensionality; anti-concentration; DANTZIG SELECTOR; DENSITY-ESTIMATION; MODELS; LASSO; TESTS;
D O I
10.1214/13-AOS1161
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We derive a Gaussian approximation result for the maximum of a sum of high-dimensional random vectors. Specifically, we establish conditions under which the distribution of the maximum is approximated by that of the maximum of a sum of the Gaussian random vectors with the same covariance matrices as the original vectors. This result applies when the dimension of random vectors (p) is large compared to the sample size (n); in fact, p can be much larger than n, without restricting correlations of the coordinates of these vectors. We also show that the distribution of the maximum of a sum of the random vectors with unknown covariance matrices can be consistently estimated by the distribution of the maximum of a sum of the conditional Gaussian random vectors obtained by multiplying the original vectors with i.i.d. Gaussian multipliers. This is the Gaussian multiplier (or wild) bootstrap procedure. Here too, p can be large or even much larger than n. These distributional approximations, either Gaussian or conditional Gaussian, yield a high-quality approximation to the distribution of the original maximum, often with approximation error decreasing polynomially in the sample size, and hence are of interest in many applications. We demonstrate how our Gaussian approximations and the multiplier bootstrap can be used for modern high-dimensional estimation, multiple hypothesis testing, and adaptive specification testing. All these results contain nonasymptotic bounds on approximation errors.
引用
收藏
页码:2786 / 2819
页数:34
相关论文
共 50 条