Statistical Power of Model Selection Strategies for Genome-Wide Association Studies

被引:25
|
作者
Wu, Zheyang [1 ]
Zhao, Hongyu [1 ,2 ]
机构
[1] Yale Univ, Sch Med, Dept Epidemiol & Publ Hlth, New Haven, CT 06510 USA
[2] Yale Univ, Sch Med, Dept Genet, New Haven, CT 06510 USA
来源
PLOS GENETICS | 2009年 / 5卷 / 07期
关键词
POLYMORPHISMS; LOCI;
D O I
10.1371/journal.pgen.1000582
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome-wide association studies (GWAS) aim to identify genetic variants related to diseases by examining the associations between phenotypes and hundreds of thousands of genotyped markers. Because many genes are potentially involved in common diseases and a large number of markers are analyzed, it is crucial to devise an effective strategy to identify truly associated variants that have individual and/or interactive effects, while controlling false positives at the desired level. Although a number of model selection methods have been proposed in the literature, including marginal search, exhaustive search, and forward search, their relative performance has only been evaluated through limited simulations due to the lack of an analytical approach to calculating the power of these methods. This article develops a novel statistical approach for power calculation, derives accurate formulas for the power of different model selection strategies, and then uses the formulas to evaluate and compare these strategies in genetic model spaces. In contrast to previous studies, our theoretical framework allows for random genotypes, correlations among test statistics, and a false-positive control based on GWAS practice. After the accuracy of our analytical results is validated through simulations, they are utilized to systematically evaluate and compare the performance of these strategies in a wide class of genetic models. For a specific genetic model, our results clearly reveal how different factors, such as effect size, allele frequency, and interaction, jointly affect the statistical power of each strategy. An example is provided for the application of our approach to empirical research. The statistical approach used in our derivations is general and can be employed to address the model selection problems in other random predictor settings. We have developed an R package markerSearchPower to implement our formulas, which can be downloaded from the Comprehensive R Archive Network (CRAN) or http://bioinformatics.med.yale.edu/group/.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Model Selection Strategies in Genome-Wide Association Studies
    Keildson, Sarah L.
    Farrall, Martin
    Morris, Andrew P.
    [J]. GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 792 - 792
  • [2] Enrichment of statistical power for genome-wide association studies
    Li, Meng
    Liu, Xiaolei
    Bradbury, Peter
    Yu, Jianming
    Zhang, Yuan-Ming
    Todhunter, Rory J.
    Buckler, Edward S.
    Zhang, Zhiwu
    [J]. BMC BIOLOGY, 2014, 12
  • [3] Enrichment of statistical power for genome-wide association studies
    Meng Li
    Xiaolei Liu
    Peter Bradbury
    Jianming Yu
    Yuan-Ming Zhang
    Rory J Todhunter
    Edward S Buckler
    Zhiwu Zhang
    [J]. BMC Biology, 12
  • [4] SNP Selection Strategies from Genome-Wide Association Studies
    Sinnwell, J. P.
    Schaid, D. J.
    [J]. GENETIC EPIDEMIOLOGY, 2008, 32 (07) : 714 - 714
  • [5] Impact of the Tagging on the Statistical Power of Association Tests in Genome-Wide Association Studies
    Emily, M.
    [J]. HUMAN HEREDITY, 2015, 79 (01) : 34 - 34
  • [6] Testing and genetic model selection in genome-wide association studies
    Loley, Christina
    Koenig, Inke R.
    Hothorn, Ludwig
    Ziegler, Andreas
    [J]. ANNALS OF HUMAN GENETICS, 2012, 76 : 420 - 420
  • [7] Testing and Genetic Model Selection in Genome-Wide Association Studies
    Loley, Christina
    Konig, Inke R.
    Hothorn, Ludwig
    Ziegler, Andreas
    [J]. GENETIC EPIDEMIOLOGY, 2012, 36 (02) : 149 - 149
  • [8] The Effect of Phenotype Transformations on Statistical Power in Genome-Wide Association Studies
    Charoen, Pimphen
    [J]. HUMAN HEREDITY, 2013, 76 (02) : 110 - 111
  • [9] Statistical methods for genome-wide association studies
    Wang, Maggie Haitian
    Cordell, Heather J.
    Van Steen, Kristel
    [J]. SEMINARS IN CANCER BIOLOGY, 2019, 55 : 53 - 60
  • [10] Statistical Methods in Genome-Wide Association Studies
    Sun, Ning
    Zhao, Hongyu
    [J]. ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, VOL 3, 2020, 2020, 3 : 265 - 288