ROBUST HIGH-DIMENSIONAL TUNING FREE MULTIPLE TESTING

被引:0
|
作者
Fan, Jianqing [1 ]
Lou, Zhipeng [2 ]
Yu, Mengxin [3 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
[2] Univ Pittsburgh, Dept Stat, Pittsburgh, PA USA
[3] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA USA
来源
ANNALS OF STATISTICS | 2023年 / 51卷 / 05期
关键词
Key words and phrases. Robust statistical inference; heavy -tailed data; tuning free; weighted bootstrap; large; scale multiple testing; FALSE DISCOVERY RATE; RNA-SEQ; BOOTSTRAP APPROXIMATIONS; QUANTILE REGRESSION; EFFICIENT APPROACH; 2-SAMPLE TEST; U-STATISTICS; M-ESTIMATORS; REPRESENTATION; ASYMPTOTICS;
D O I
10.1214/23-AOS2322
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A stylized feature of high-dimensional data is that many variables have heavy tails, and robust statistical inference is critical for valid large-scale statistical inference. Yet, the existing developments such as Winsorization, Huberization and median of means require the bounded second moments and involve variable-dependent tuning parameters, which hamper their fidelity in applications to large-scale problems. To liberate these constraints, this paper revisits the celebrated Hodges-Lehmann (HL) estimator for estimating location parameters in both the one- and two-sample problems, from a nonasymptotic perspective. Our study develops Berry-Esseen inequality and Cramer-type moderate deviation for the HL estimator based on newly developed nonasymptotic Bahadur representation and builds data-driven confidence intervals via a weighted bootstrap approach. These results allow us to extend the HL estimator to large-scale studies and propose tuning-free and moment-free high-dimensional inference procedures for testing global null and for large-scale multiple testing with false discovery proportion control. It is convincingly shown that the resulting tuning-free and moment-free methods control false discovery proportion at a prescribed level. The simulation studies lend further support to our developed theory.
引用
收藏
页码:2093 / 2115
页数:23
相关论文
共 50 条
  • [41] Self-tuning clustering for high-dimensional data
    Wen, Guoqiu
    Zhu, Yonghua
    Cai, Zhiguo
    Zheng, Wei
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2018, 21 (06): : 1563 - 1573
  • [42] Self-tuning clustering for high-dimensional data
    Guoqiu Wen
    Yonghua Zhu
    Zhiguo Cai
    Wei Zheng
    World Wide Web, 2018, 21 : 1563 - 1573
  • [43] A tuning-free efficient test for marginal linear effects in high-dimensional quantile regression
    Xu, Kai
    An, Nan
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2024, 76 (01) : 93 - 110
  • [44] A tuning-free efficient test for marginal linear effects in high-dimensional quantile regression
    Kai Xu
    Nan An
    Annals of the Institute of Statistical Mathematics, 2024, 76 : 93 - 110
  • [45] An asymptotic viewpoint on high-dimensional Bayesian testing
    Spitzner, Dan J.
    BAYESIAN ANALYSIS, 2008, 3 (01): : 121 - 160
  • [46] TESTING FOR GROUP STRUCTURE IN HIGH-DIMENSIONAL DATA
    McLachlan, G. J.
    Rathnayake, Suren I.
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2011, 21 (06) : 1113 - 1125
  • [47] Testing linear hypotheses in high-dimensional regressions
    Bai, Zhidong
    Jiang, Dandan
    Yao, Jian-feng
    Zheng, Shurong
    STATISTICS, 2013, 47 (06) : 1207 - 1223
  • [48] PROPERTY TESTING IN HIGH-DIMENSIONAL ISING MODELS
    Neykov, Matey
    Liu, Han
    ANNALS OF STATISTICS, 2019, 47 (05): : 2472 - 2503
  • [49] Testing identity of high-dimensional covariance matrix
    Wang, Hao
    Liu, Baisen
    Shi, Ning-Zhong
    Zheng, Shurong
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (13) : 2600 - 2611
  • [50] Testing for High-Dimensional Geometry in Random Graphs
    Bubeck, Sebastien
    Ding, Jian
    Eldan, Ronen
    Racz, Miklos Z.
    RANDOM STRUCTURES & ALGORITHMS, 2016, 49 (03) : 503 - 532