ROBUST HIGH-DIMENSIONAL TUNING FREE MULTIPLE TESTING

被引:0
|
作者
Fan, Jianqing [1 ]
Lou, Zhipeng [2 ]
Yu, Mengxin [3 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
[2] Univ Pittsburgh, Dept Stat, Pittsburgh, PA USA
[3] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA USA
来源
ANNALS OF STATISTICS | 2023年 / 51卷 / 05期
关键词
Key words and phrases. Robust statistical inference; heavy -tailed data; tuning free; weighted bootstrap; large; scale multiple testing; FALSE DISCOVERY RATE; RNA-SEQ; BOOTSTRAP APPROXIMATIONS; QUANTILE REGRESSION; EFFICIENT APPROACH; 2-SAMPLE TEST; U-STATISTICS; M-ESTIMATORS; REPRESENTATION; ASYMPTOTICS;
D O I
10.1214/23-AOS2322
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A stylized feature of high-dimensional data is that many variables have heavy tails, and robust statistical inference is critical for valid large-scale statistical inference. Yet, the existing developments such as Winsorization, Huberization and median of means require the bounded second moments and involve variable-dependent tuning parameters, which hamper their fidelity in applications to large-scale problems. To liberate these constraints, this paper revisits the celebrated Hodges-Lehmann (HL) estimator for estimating location parameters in both the one- and two-sample problems, from a nonasymptotic perspective. Our study develops Berry-Esseen inequality and Cramer-type moderate deviation for the HL estimator based on newly developed nonasymptotic Bahadur representation and builds data-driven confidence intervals via a weighted bootstrap approach. These results allow us to extend the HL estimator to large-scale studies and propose tuning-free and moment-free high-dimensional inference procedures for testing global null and for large-scale multiple testing with false discovery proportion control. It is convincingly shown that the resulting tuning-free and moment-free methods control false discovery proportion at a prescribed level. The simulation studies lend further support to our developed theory.
引用
收藏
页码:2093 / 2115
页数:23
相关论文
共 50 条
  • [1] Rejoinder to "A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression"
    Wang, Lan
    Peng, Bo
    Bradic, Jelena
    Li, Runze
    Wu, Yunan
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1726 - 1729
  • [2] Discussion of "A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression"
    Li, Xiudi
    Shojaie, Ali
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1717 - 1719
  • [3] Comment on "A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression"
    Loh, Po-Ling
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1715 - 1716
  • [4] Multiple testing for high-dimensional data
    Diao, Guoqing
    Hanlon, Bret
    Vidyashankar, Anand N.
    [J]. PERSPECTIVES ON BIG DATA ANALYSIS: METHODOLOGIES AND APPLICATIONS, 2014, 622 : 95 - 108
  • [5] Robust Testing in High-Dimensional Sparse Models
    George, Anand Jerry
    Canonne, Clement L.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [6] Association Testing for High-Dimensional Multiple Response Regression
    Wang, Jinjuan
    Jiang, Zhenzhen
    Liu, Hongzhi
    Meng, Zhen
    [J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2023, 36 (04) : 1680 - 1696
  • [7] Association Testing for High-Dimensional Multiple Response Regression
    Jinjuan Wang
    Zhenzhen Jiang
    Hongzhi Liu
    Zhen Meng
    [J]. Journal of Systems Science and Complexity, 2023, 36 : 1680 - 1696
  • [8] Association Testing for High-Dimensional Multiple Response Regression
    WANG Jinjuan
    JIANG Zhenzhen
    LIU Hongzhi
    MENG Zhen
    [J]. Journal of Systems Science & Complexity, 2023, 36 (04) : 1680 - 1696
  • [9] Effects of dependence in high-dimensional multiple testing problems
    Kim, Kyung In
    de Wiel, Mark A. van
    [J]. BMC BIOINFORMATICS, 2008, 9 (1)
  • [10] Testing the equality of multiple high-dimensional covariance matrices
    Shen J.
    [J]. Results in Applied Mathematics, 2022, 15