Combining p-values from various statistical methods for microbiome data

被引:1
|
作者
Ham, Hyeonjung [1 ]
Park, Taesung [1 ,2 ]
机构
[1] Seoul Natl Univ, Interdisciplinary Program Bioinformat, Seoul, South Korea
[2] Seoul Natl Univ, Dept Stat, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
microbiome analysis; integration method; p-value combination; power simulation; rank simulation; DIFFERENTIAL ABUNDANCE ANALYSIS; GENE-EXPRESSION; COLORECTAL-CANCER; REGRESSION; ASSOCIATION; COMBINATION; METABOLITES;
D O I
10.3389/fmicb.2022.990870
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Motivation: In the field of microbiome analysis, there exist various statistical methods that have been developed for identifying differentially expressed features, that account for the overdispersion and the high sparsity of microbiome data. However, due to the differences in statistical models or test formulations, it is quite often to have inconsistent significance results across statistical methods, that makes it difficult to determine the importance of microbiome taxa. Thus, it is practically important to have the integration of the result from all statistical methods to determine the importance of microbiome taxa. A standard meta-analysis is a powerful tool for integrative analysis and it provides a summary measure by combining p-values from various statistical methods. While there are many meta-analyses available, it is not easy to choose the best meta-analysis that is the most suitable for microbiome data. Results: In this study, we investigated which meta-analysis method most adequately represents the importance of microbiome taxa. We considered Fisher's method, minimum value of p method, Simes method, Stouffer's method, Kost method, and Cauchy combination test. Through simulation studies, we showed that Cauchy combination test provides the best combined value of p in the sense that it performed the best among the examined methods while controlling the type 1 error rates. Furthermore, it produced high rank similarity with the true ranks. Through the real data application of colorectal cancer microbiome data, we demonstrated that the most highly ranked microbiome taxa by Cauchy combination test have been reported to be associated with colorectal cancer.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Combining p-values in large-scale genomics experiments
    Zaykin, Dmitri V.
    Zhivotovsky, Lev A.
    Czika, Wendy
    Shao, Susan
    Wolfinger, Russell D.
    [J]. PHARMACEUTICAL STATISTICS, 2007, 6 (03) : 217 - 226
  • [32] P-values and decision-making: discussion of 'Limitations of empirical calibration of p-values using observational data'
    Franklin, Jessica M.
    [J]. STATISTICS IN MEDICINE, 2016, 35 (22) : 3889 - 3891
  • [33] Numerical inversion methods for computing approximate p-Values
    Kawakatsu H.
    [J]. Computational Economics, 2005, 26 (3-4) : 103 - 116
  • [34] Numerical Inversion Methods for Computing Approximate p-Values
    Hiroyuki Kawakatsu
    [J]. Computational Economics, 2007, 29 (3-4) : 429 - 429
  • [35] Improving the reporting of P-values generated by randomization methods
    Ruxton, Graeme D.
    Neuhaeuser, Markus
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2013, 4 (11): : 1033 - 1036
  • [36] The JMIG Issues New Guidelines on Statistical Reporting and p-values
    Wilson, Jeffrey R.
    Falcone, Tommaso
    [J]. JOURNAL OF MINIMALLY INVASIVE GYNECOLOGY, 2020, 27 (01) : 1 - 3
  • [37] ESTIMATING SMOOTHNESS IN STATISTICAL PARAMETRIC MAPS - VARIABILITY OF P-VALUES
    POLINE, JB
    WORSLEY, KJ
    HOLMES, AP
    FRACKOWIAK, RSJ
    FRISTON, KJ
    [J]. JOURNAL OF COMPUTER ASSISTED TOMOGRAPHY, 1995, 19 (05) : 788 - 796
  • [38] Acknowledged statistical help and a better use of p-values: a proposal
    Dincses, E.
    Guzelant, G.
    Hatemi, G.
    Sut, N.
    Yazici, H.
    [J]. CLINICAL AND EXPERIMENTAL RHEUMATOLOGY, 2019, 37 (05) : 855 - 857
  • [39] Statistical Inference of Lead-Lag Between Asynchronous Time Series from P-Values of Transfer Entropy at Various Timescales
    Bongiorno, Christian
    Challet, Damien
    [J]. COMPLEX NETWORKS AND THEIR APPLICATIONS XI, COMPLEX NETWORKS 2022, VOL 1, 2023, 1077 : 514 - 525
  • [40] Combining p-values to test for multiple structural breaks in cointegrated regressions
    Bergamelli, Michele
    Bianchi, Annamaria
    Khalaf, Lynda
    Urga, Giovanni
    [J]. JOURNAL OF ECONOMETRICS, 2019, 211 (02) : 461 - 482