A Permutation Procedure to Correct for Confounders in Case-Control Studies, Including Tests of Rare Variation

被引:56
|
作者
Epstein, Michael P. [1 ]
Duncan, Richard [1 ]
Jiang, Yunxuan [2 ]
Conneely, Karen N. [1 ]
Allen, Andrew S. [3 ]
Satten, Glen A. [4 ]
机构
[1] Emory Univ, Dept Human Genet, Atlanta, GA 30322 USA
[2] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[3] Duke Univ, Dept Biostat & Bioinformat, Durham, NC 27710 USA
[4] Ctr Dis Control & Prevent, Atlanta, GA 30333 USA
基金
美国国家卫生研究院;
关键词
WHOLE-GENOME ASSOCIATION; POPULATION STRATIFICATION; GENETIC-ASSOCIATION; DISEASE VARIANTS; COMMON DISEASES; SEQUENCE DATA; STRATEGIES; DISORDERS; MAP;
D O I
10.1016/j.ajhg.2012.06.004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Many case-control tests of rare variation are implemented in statistical frameworks that make correction for confounders like population stratification difficult. Simple permutation of disease status is unacceptable for resolving this issue because the replicate data sets do not have the same confounding as the original data set. These limitations make it difficult to apply rare-variant tests to samples in which confounding most likely exists, e.g., samples collected from admixed populations. To enable the use of such rare-variant methods in structured samples, as well as to facilitate permutation tests for any situation in which case-control tests require adjustment for confounding covariates, we propose to establish the significance of a rare-variant test via a modified permutation procedure. Our procedure uses Fisher's noncentral hypergeometric distribution to generate permuted data sets with the same structure present in the actual data set such that inference is valid in the presence of confounding factors. We use simulated sequence data based on coalescent models to show that our permutation strategy corrects for confounding due to population stratification that, if ignored, would otherwise inflate the size of a rare-variant test. We further illustrate the approach by using sequence data from the Dallas Heart Study of energy metabolism traits. Researchers can implement our permutation approach by using the R package BiasedUrn.
引用
收藏
页码:215 / 223
页数:9
相关论文
共 50 条