Reconsidering Association Testing Methods Using Single-Variant Test Statistics as Alternatives to Pooling Tests for Sequence Data with Rare Variants

被引:26
|
作者
Kinnamon, Daniel D. [1 ]
Hershberger, Ray E. [2 ]
Martin, Eden R. [1 ]
机构
[1] Univ Miami, Miller Sch Med, Dept Human Genet, Dr John T Macdonald Fdn, Miami, FL 33136 USA
[2] Univ Miami, Miller Sch Med, Cardiovasc Div, Miami, FL 33136 USA
来源
PLOS ONE | 2012年 / 7卷 / 02期
基金
美国国家卫生研究院;
关键词
SAMPLE-SIZE; LINKAGE DISEQUILIBRIUM; COMMON DISEASES; GENETIC-MARKERS; ALLELE MODEL; SUSCEPTIBILITY; POWER; SNPS;
D O I
10.1371/journal.pone.0030238
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Association tests that pool minor alleles into a measure of burden at a locus have been proposed for case-control studies using sequence data containing rare variants. However, such pooling tests are not robust to the inclusion of neutral and protective variants, which can mask the association signal from risk variants. Early studies proposing pooling tests dismissed methods for locus-wide inference using nonnegative single-variant test statistics based on unrealistic comparisons. However, such methods are robust to the inclusion of neutral and protective variants and therefore may be more useful than previously appreciated. In fact, some recently proposed methods derived within different frameworks are equivalent to performing inference on weighted sums of squared single-variant score statistics. In this study, we compared two existing methods for locus-wide inference using nonnegative single-variant test statistics to two widely cited pooling tests under more realistic conditions. We established analytic results for a simple model with one rare risk and one rare neutral variant, which demonstrated that pooling tests were less powerful than even Bonferroni-corrected single-variant tests in most realistic situations. We also performed simulations using variants with realistic minor allele frequency and linkage disequilibrium spectra, disease models with multiple rare risk variants and extensive neutral variation, and varying rates of missing genotypes. In all scenarios considered, existing methods using nonnegative single-variant test statistics had power comparable to or greater than two widely cited pooling tests. Moreover, in disease models with only rare risk variants, an existing method based on the maximum single-variant Cochran-Armitage trend chi-square statistic in the locus had power comparable to or greater than another existing method closely related to some recently proposed methods. We conclude that efficient locus-wide inference using single-variant test statistics should be reconsidered as a useful framework for devising powerful association tests in sequence data with rare variants.
引用
收藏
页数:15
相关论文
共 30 条
  • [21] Evaluation of gene-based association tests for analyzing rare variants using Genetic Analysis Workshop 18 data
    Andriy Derkach
    Jerry F Lawless
    Daniele Merico
    Andrew D Paterson
    Lei Sun
    BMC Proceedings, 8 (Suppl 1)
  • [22] Rare variant association on unrelated individuals in case-control studies using aggregation tests: existing methods and current limitations
    Boutry, Simon
    Helaers, Raphael
    Lenaerts, Tom
    Vikkula, Miikka
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [23] Family-Based Association Test Using Both Common and Rare Variants and Accounting for Directions of Effects for Sequencing Data
    Chung, Ren-Hua
    Tsai, Wei-Yun
    Martin, Eden R.
    PLOS ONE, 2014, 9 (09):
  • [24] The Rare Variant Generalized Disequilibrium Test for Association Analysis of Nuclear and Extended Pedigrees with Application to Alzheimer's Disease Whole Genome Sequence Data
    Leal, Suzanne M.
    He, Zongxiao
    Zhang, Di
    Renton, Alan E.
    Li, Biao
    Zhao, Linhai
    Wang, Gao T.
    Goate, Alison M.
    Mayeux, Richard
    HUMAN HEREDITY, 2016, 81 (04) : 216 - 216
  • [25] Powerful association test combining rare variant and gene expression using family data from Genetic Analysis Workshop 19
    Yen-Yi Ho
    Weihua Guan
    Michael O’Connell
    Saonli Basu
    BMC Proceedings, 10 (Suppl 7)
  • [26] SEQSpark: A Complete Analysis Tool for Large-Scale Rare Variant Association Studies using Whole Genome and Exome Sequence Data
    Zhang, Di
    Zhao, Linhai
    Li, Biao
    He, Zongxiao
    Wang, Gao T.
    Liu, Dajiang J.
    Leal, Suzanne M.
    GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 646 - 646
  • [27] Using an integrated gene-based sequence kernel association test (intSKAT) to identify subtype specific single nucleotide variants in glioma
    Chen, Yian Ann
    Teer, Jamie K.
    Thompson, Zachary J.
    Baskin, Rebekah L.
    Zhang, Yonghong O.
    Fisher, Kate J.
    Chen, Zhihua
    Monteiro, Alvaro N.
    Egan, Kathleen M.
    CANCER RESEARCH, 2016, 76
  • [28] Using an Integrated Gene-Based Sequence Kernel Association Test (iSKAT) to Identify Subtype Specific Single Nucleotide Variants in Glioma
    Chen, Yian Ann
    Teer, Jamie K.
    Thompson, Zachary J.
    Baskin, Rebekah L.
    Zhang, Yonghong O.
    Fisher, Kate J.
    Chen, Zhihua
    Monteiro, Alvaro N.
    Egan, Kathleen M.
    HUMAN HEREDITY, 2016, 81 (02) : 52 - 52
  • [29] Joint analysis of multiple blood pressure phenotypes in GAW19 data by using a multivariate rare-variant association test
    Jianping Sun
    Sahir R. Bhatnagar
    Karim Oualkacha
    Antonio Ciampi
    Celia M. T. Greenwood
    BMC Proceedings, 10 (Suppl 7)
  • [30] SEQSpark: A Complete Analysis Tool for Large-Scale Rare Variant Association Studies Using Whole-Genome and Exome Sequence Data
    Zhang, Di
    Zhao, Linhai
    Li, Biao
    He, Zongxiao
    Wang, Gao T.
    Liu, Dajiang J.
    Leal, Suzanne M.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2017, 101 (01) : 115 - 122