The generalized Fisher's combination and accurate p-value calculation under dependence

被引:7
|
作者
Zhang, Hong [1 ]
Wu, Zheyang [2 ]
机构
[1] Merck Res Labs, Biostat & Res Decis Sci, Rahway, NJ USA
[2] Worcester Polytech Inst, Dept Math Sci, 100 Inst Rd, Worcester, MA 01609 USA
基金
美国国家科学基金会;
关键词
dependence; global hypothesis testing; meta-analysis; signal detection; the p-value combination; BONE-MINERAL DENSITY; WEIGHTED COMBINATION; PROBABILITIES; STATISTICS; TESTS; WOMEN;
D O I
10.1111/biom.13634
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Combining dependent tests of significance has broad applications but the related p-value calculation is challenging. For Fisher's combination test, current p-value calculation methods (eg, Brown's approximation) tend to inflate the type I error rate when the desired significance level is substantially less than 0.05. The problem could lead to significant false discoveries in big data analyses. This paper provides two main contributions. First, it presents a general family of Fisher type statistics, referred to as the GFisher, which covers many classic statistics, such as Fisher's combination, Good's statistic, Lancaster's statistic, weighted Z-score combination, and so forth. The GFisher allows a flexible weighting scheme, as well as an omnibus procedure that automatically adapts proper weights and the statistic-defining parameters to a given data. Second, the paper presents several new p-value calculation methods based on two novel ideas: moment-ratio matching and joint-distribution surrogating. Systematic simulations show that the new calculation methods are more accurate under multivariate Gaussian, and more robust under the generalized linear model and the multivariate t-distribution. The applications of the GFisher and the new p-value calculation methods are demonstrated by a gene-based single nucleotide polymorphism (SNP)-set association study. Relevant computation has been implemented to an R package GFisher available on the Comprehensive R Archive Network.
引用
收藏
页码:1159 / 1172
页数:14
相关论文
共 50 条
  • [1] Modified generalized p-value and confidence interval by Fisher's fiducial approach
    Ozkip, Evren
    Yazici, Berna
    Sezer, Ahmet
    [J]. HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2017, 46 (02): : 339 - 360
  • [2] Distributional properties for the generalized p-value for the Behrens-Fisher problem
    Tang, Shijie
    Tsui, Kam-Wah
    [J]. STATISTICS & PROBABILITY LETTERS, 2007, 77 (01) : 1 - 8
  • [3] On Samuel's p-value model and the Simes test under dependence
    Xiong, Peihan
    Hu, Taizhong
    [J]. STATISTICS & PROBABILITY LETTERS, 2022, 187
  • [4] A new generalized p-value for ANOVA under heteroscedasticity
    Xu, Li-Wen
    Wang, Song-Gui
    [J]. STATISTICS & PROBABILITY LETTERS, 2008, 78 (08) : 963 - 969
  • [5] Generalized p-value and Generalized Confidence Interval for the Behrens-Fisher Problem with Prior Information
    Niwitpong, Suparat
    Clayton, Gareth
    Koonprasert, Sanoe
    [J]. PROCEEDING OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES, 2009, : 257 - 260
  • [6] On P-value Combination Procedures
    Zhen Meng
    Yuke Shi
    Jinyi Lin
    Qizhai Li
    [J]. Acta Mathematica Sinica,English Series, 2025, (02) : 569 - 587
  • [7] A two-tailed P-value for Fisher's exact test
    Meulepas, E
    [J]. BIOMETRICAL JOURNAL, 1998, 40 (01) : 3 - 10
  • [8] A bootstrap method to calculate the p-value of Fisher's combination for a large number of weakly dependent p-values
    Zhu, Jiayan
    Ma, Li
    Ni, Mengying
    Li, Zhengbang
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (09) : 4210 - 4217
  • [9] Cauchy Combination Test: A Powerful Test With Analytic p-Value Calculation Under Arbitrary Dependency Structures
    Liu, Yaowu
    Xie, Jun
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (529) : 393 - 402
  • [10] Accurate and Ultra-Efficient p-Value Calculation for Higher Criticism Tests
    Wang, Wenjia
    Fang, Yusi
    Chang, Chung
    Tseng, George C.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (02) : 463 - 476