Parametric and nonparametric two-sample tests for feature screening in class comparison: a simulation study

被引:3
|
作者
Landoni, Elena [1 ]
Ambrogi, Federico [2 ]
Mariani, Luigi [1 ]
Miceli, Rosalba [1 ]
机构
[1] Fdn IRCCS Ist Nazl Tumori, Milan, Italy
[2] Univ Milan, Milan, Italy
关键词
high-dimensional data; class comparison; location-scale problem; general two-sample problem; mixtures;
D O I
10.2427/11808
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: The identification of a location-, scale-and shape-sensitive test to detect differentially expressed features between two comparison groups represents a key point in high dimensional studies. The most commonly used tests refer to differences in location, but general distributional discrepancies might be important to reveal differential biological processes. Methods: A simulation study was conducted to compare the performance of a set of two-sample tests, i.e. Student's t, Welch's t, Wilcoxon-Mann-Whitney (WMW), Podgor-Gastwirth PG2, Cucconi, Kolmogorov-Smirnov (KS), Cramer-von Mises (CvM), Anderson-Darling (AD) and Zhang tests (Z(K), Z(C) and Z(A)) which were investigated under different distributional patterns. We applied the same tests to a real data example. Results: AD, CvM, Z(A) and Z(C) tests proved to be the most sensitive tests in mixture distribution patterns, while still maintaining a high power in normal distribution patterns. At best, the AD test showed a power loss of similar to 2% in the comparison of two normal distributions, but a gain of similar to 32% with mixture distributions with respect to the parametric tests. Accordingly, the AD test detected the greatest number of differentially expressed features in the real data application. Conclusion: The tests for the general two-sample problem introduce a more general concept of 'differential expression', thus overcoming the limitations of the other tests restricted to specific moments of the feature distributions. In particular, the AD test should be considered as a powerful alternative to the parametric tests for feature screening in order to keep as many discriminative features as possible for the class prediction analysis.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A class of two-sample nonparametric statistics for binary and time-to-event outcomes
    Bofill Roig, Marta
    Gomez Melis, Guadalupe
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2022, 31 (02) : 225 - 239
  • [32] A Nonparametric Bayesian Approach for the Two-Sample Problem
    Ceregatti, Rafael de C.
    Izbicki, Rafael
    Salasar, Luis Ernesto B.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, MAXENT 37, 2018, 239 : 231 - 241
  • [33] Comparison of two-sample tests for edge detection in noisy images
    Lim, DH
    Jang, SJ
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 2002, 51 : 21 - 30
  • [34] Two-sample Bayesian Nonparametric Hypothesis Testing
    Holmes, Chris C.
    Caron, Francois
    Griffin, Jim E.
    Stephens, David A.
    BAYESIAN ANALYSIS, 2015, 10 (02): : 297 - 320
  • [35] A two-sample nonparametric likelihood ratio test
    Marsh, Patrick
    JOURNAL OF NONPARAMETRIC STATISTICS, 2010, 22 (08) : 1053 - 1065
  • [36] A two-sample nonparametric test with missing observations
    Lee, YJ
    AMERICAN JOURNAL OF MATHEMATICAL AND MANAGEMENT SCIENCES, VOL 17, NOS 1 AND 2, 1997: MULTIVARIATE STATISTICAL INFERENCE - MSI-2000L MULTIVARIATE STATISTICAL ANALYSIS IN HONOR OF PROFESSOR MINORU SIOTANI ON HIS 70TH BIRTHDAY, 1997, 17 (1&2): : 187 - 200
  • [37] A comparison of some two-sample tests with interval censored data
    Pan, W
    JOURNAL OF NONPARAMETRIC STATISTICS, 1999, 12 (01) : 133 - 146
  • [38] A nonparametric test for the general two-sample problem
    Baumgartner, W
    Weiss, P
    Schindler, H
    BIOMETRICS, 1998, 54 (03) : 1129 - 1135
  • [39] A NEW SUB-SAMPLES BASED NONPARAMETRIC TESTS FOR TWO-SAMPLE SCALE PROBLEM
    Kamat, Deepa Yogesh
    Pandit, Parameshwar, V
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2022, 18 (01): : 1 - 7
  • [40] A New Class of Robust Two-Sample Wald-Type Tests
    Gaosh, Abhik
    Martin, Nirian
    Basu, Ayanendranath
    Pardo, Leandro
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2018, 14 (02):