A TWO-SAMPLE TEST FOR HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO GENE-SET TESTING

被引:400
|
作者
Chen, Song Xi [1 ,2 ]
Qin, Ying-Li [1 ]
机构
[1] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
[2] Peking Univ, Guanghua Sch Management, Beijing 100871, Peoples R China
来源
ANNALS OF STATISTICS | 2010年 / 38卷 / 02期
关键词
High dimension; gene-set testing; large p small n; martingale central limit theorem; multiple comparison; FALSE DISCOVERY RATE; MICROARRAY DATA; COVARIANCE-MATRIX; HYPOTHESIS TESTS; NORMALIZATION; CONSISTENCY; CATEGORIES; EXPRESSION; LIMIT; MODEL;
D O I
10.1214/09-AOS716
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a two-sample test for the means of high-dimensional data when the data dimension is much larger than the sample size. Hotelling's classical T(2) test does not work for this "large p, small n" situation. The proposed test does not require explicit conditions in the relationship between the data dimension and sample size. This offers much flexibility in analyzing high-dimensional data. An application of the proposed test is in testing significance for sets of genes which we demonstrate in an empirical study on a leukemia data set.
引用
收藏
页码:808 / 835
页数:28
相关论文
共 50 条
  • [1] A note on high-dimensional two-sample test
    Feng, Long
    Sun, Fasheng
    STATISTICS & PROBABILITY LETTERS, 2015, 105 : 29 - 36
  • [2] Two-sample mean vector projection test in high-dimensional data
    Huang, Caizhu
    Cui, Xia
    Pagui, Euloge Clovis Kenne
    COMPUTATIONAL STATISTICS, 2024, 39 (03) : 1061 - 1091
  • [3] Two-sample mean vector projection test in high-dimensional data
    Caizhu Huang
    Xia Cui
    Euloge Clovis Kenne Pagui
    Computational Statistics, 2024, 39 : 1061 - 1091
  • [4] Order test for high-dimensional two-sample means
    Lee, Sang H.
    Lim, Johan
    Li, Erning
    Vannucci, Marina
    Petkova, Eva
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (09) : 2719 - 2725
  • [5] An adaptive two-sample test for high-dimensional means
    Xu, Gongjun
    Lin, Lifeng
    Wei, Peng
    Pan, Wei
    BIOMETRIKA, 2016, 103 (03) : 609 - 624
  • [6] A Simple Scale-Invariant Two-Sample Test for High-dimensional Data
    Zhang, Liang
    Zhu, Tianming
    Zhang, Jin-Ting
    ECONOMETRICS AND STATISTICS, 2020, 14 : 131 - 144
  • [7] A two-sample test for the equality of univariate marginal distributions for high-dimensional data
    Cousido-Rocha, Marta
    de Una-Alvarez, Jacobo
    Hart, Jeffrey D.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 174
  • [8] A PERMUTATION TEST FOR TWO-SAMPLE MEANS AND SIGNAL IDENTIFICATION OF HIGH-DIMENSIONAL DATA
    Kong, Efang
    Wang, Lengyang
    Xia, Yingcun
    Liu, Jin
    STATISTICA SINICA, 2022, 32 (01) : 89 - 108
  • [9] Two-sample test for sparse high-dimensional multinomial distributions
    Amanda Plunkett
    Junyong Park
    TEST, 2019, 28 : 804 - 826
  • [10] Empirical likelihood test for high-dimensional two-sample model
    Ciuperca, Gabriela
    Salloum, Zahraa
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2016, 178 : 37 - 60