A Permutation Procedure to Correct for Confounders in Case-Control Studies, Including Tests of Rare Variation

被引:56
|
作者
Epstein, Michael P. [1 ]
Duncan, Richard [1 ]
Jiang, Yunxuan [2 ]
Conneely, Karen N. [1 ]
Allen, Andrew S. [3 ]
Satten, Glen A. [4 ]
机构
[1] Emory Univ, Dept Human Genet, Atlanta, GA 30322 USA
[2] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[3] Duke Univ, Dept Biostat & Bioinformat, Durham, NC 27710 USA
[4] Ctr Dis Control & Prevent, Atlanta, GA 30333 USA
基金
美国国家卫生研究院;
关键词
WHOLE-GENOME ASSOCIATION; POPULATION STRATIFICATION; GENETIC-ASSOCIATION; DISEASE VARIANTS; COMMON DISEASES; SEQUENCE DATA; STRATEGIES; DISORDERS; MAP;
D O I
10.1016/j.ajhg.2012.06.004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Many case-control tests of rare variation are implemented in statistical frameworks that make correction for confounders like population stratification difficult. Simple permutation of disease status is unacceptable for resolving this issue because the replicate data sets do not have the same confounding as the original data set. These limitations make it difficult to apply rare-variant tests to samples in which confounding most likely exists, e.g., samples collected from admixed populations. To enable the use of such rare-variant methods in structured samples, as well as to facilitate permutation tests for any situation in which case-control tests require adjustment for confounding covariates, we propose to establish the significance of a rare-variant test via a modified permutation procedure. Our procedure uses Fisher's noncentral hypergeometric distribution to generate permuted data sets with the same structure present in the actual data set such that inference is valid in the presence of confounding factors. We use simulated sequence data based on coalescent models to show that our permutation strategy corrects for confounding due to population stratification that, if ignored, would otherwise inflate the size of a rare-variant test. We further illustrate the approach by using sequence data from the Dallas Heart Study of energy metabolism traits. Researchers can implement our permutation approach by using the R package BiasedUrn.
引用
收藏
页码:215 / 223
页数:9
相关论文
共 50 条
  • [41] Case-control studies
    Shapiro, ED
    PEDIATRIC INFECTIOUS DISEASE JOURNAL, 2003, 22 (01) : 85 - 87
  • [42] Fast and Robust Tests of Association for Untyped SNPs in Case-Control Studies
    Epstein, M. P.
    Allen, A. S.
    Griffiths, S.
    Dudbridge, E.
    Satten, G. A.
    GENETIC EPIDEMIOLOGY, 2008, 32 (07) : 674 - 674
  • [43] Fast and Robust Association Tests for Untyped SNPs in Case-Control Studies
    Allen, Andrew S.
    Satten, Glen A.
    Bray, Sarah L.
    Dudbridge, Frank
    Epstein, Michael P.
    HUMAN HEREDITY, 2010, 70 (03) : 167 - 176
  • [44] Score tests in the presence of errors in covariates in matched case-control studies
    Sinha, Samiran
    Yoo, Seungyoon
    JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 115 : 157 - 171
  • [45] Case-Control Studies
    Blakely, Tony
    Pearce, Neil
    Lynch, John
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2019, 321 (08): : 805 - 807
  • [46] Rare variant association on unrelated individuals in case-control studies using aggregation tests: existing methods and current limitations
    Boutry, Simon
    Helaers, Raphael
    Lenaerts, Tom
    Vikkula, Miikka
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [47] Association Testing of Clustered Rare Causal Variants in Case-Control Studies
    Lin, Wan-Yu
    PLOS ONE, 2014, 9 (04):
  • [48] Robust genomic control and robust delta centralization tests for case-control association studies
    Zang, Yong
    Zhang, Hong
    Yang, Yaning
    Zheng, Gang
    HUMAN HEREDITY, 2007, 63 (3-4) : 187 - 195
  • [49] METHODOLOGIC VARIATION IN CASE-CONTROL STUDIES OF RESERPINE AND BREAST-CANCER
    LABARTHE, DR
    JOURNAL OF CHRONIC DISEASES, 1979, 32 (1-2): : 95 - 104
  • [50] A simulation-based comparison of techniques to correct for measurement error in matched case-control studies
    Guolo, A.
    Brazzale, A. R.
    STATISTICS IN MEDICINE, 2008, 27 (19) : 3755 - 3775