A New Expectation-Maximization Statistical Test for Case-Control Association Studies Considering Rare Variants Obtained by High-Throughput Sequencing

被引：5

作者：

Gordon, Derek ^{[1
]}

Finch, Stephen J. ^{[2
]}

De La Vega, Francisco

机构：

[1] Rutgers State Univ, Dept Genet, Piscataway, NJ USA

[2] SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USA

来源：

HUMAN HEREDITY | 2011年 / 71卷 / 02期

关键词：

Statistic; Genetics; Noncentrality parameter; Power; Misclassification; Sequence; Expectation-maximization; Multi-locus; CONTROL GENETIC ASSOCIATION; FALSE DISCOVERY RATE; GENOTYPE MISCLASSIFICATION; MISSING HERITABILITY; SAMPLE-SIZE; ERROR RATE; POWER; PHENOTYPE; HAPLOTYPE; DISEASES;

D O I：

10.1159/000325590

中图分类号：

Q3 [遗传学];

学科分类号：

071007 ; 090102 ;

摘要：

Genome-wide association studies (GWAS) have been successful in identifying common genetic variation reproducibly associated with disease. However, most associated variants confer very small risk and after meta-analysis of large cohorts a large fraction of expected heritability still remains unexplained. A possible explanation is that rare variants currently undetected by GWAS with SNP arrays could contribute a large fraction of risk when present in cases. This concept has spurred great interest in exploring the role of rare variants in disease. As the cost of sequencing continue to plummet, it is becoming feasible to directly sequence case-control samples for testing disease association including rare variants. We have developed a test statistic that allows for association testing among cases and controls using data directly from sequencing reads. In addition, our method allows for random errors in reads. We determine the probability of a true genotype call based on the observed base pair reads using the expectation-maximization algorithm. We apply the SumStat procedure to obtain a single statistic for a group of multiple rare variant loci. We document the validity of our method through simulations. Our results suggest that our statistic maintains the correct type I error rate, even in the presence of differential misclassification for sequence reads, and that it has good power under a number of scenarios. Finally, our SumStat results show power at least as good as the maximum single locus results. Copyright (C) 2011 S. Karger AG, Basel

引用

页码：113 / 125

页数：13

共 20 条

[1] A New Expectation-Maximization Statistical Test for Case-Control Association Studies Considering Rare Variants Obtained by High-Throughput Sequencing (vol 71, pg 113, 2011)
Gordon, Derek
[J]. HUMAN HEREDITY, 2011, 72 (01) : 53 - 53
[2] Association Testing of Clustered Rare Causal Variants in Case-Control Studies
Lin, Wan-Yu
[J]. PLOS ONE, 2014, 9 (04):
[3] A combined association test for rare variants using family and case-control data
Peng-Lin Lin
Wei-Yun Tsai
Ren-Hua Chung
[J]. BMC Proceedings, 10 (Suppl 7)
[4] Beyond Rare-Variant Association Testing: Pinpointing Rare Causal Variants in Case-Control Sequencing Study
Lin, Wan-Yu
[J]. SCIENTIFIC REPORTS, 2016, 6
[5] Beyond Rare-Variant Association Testing: Pinpointing Rare Causal Variants in Case-Control Sequencing Study
Wan-Yu Lin
[J]. Scientific Reports, 6
[6] A Comparison of Association Methods for Fine-mapping Rare Variants in Case-Control Studies
Nickchi, Payman
Karunarathna, Charith
Graham, Jinko
[J]. GENETIC EPIDEMIOLOGY, 2021, 45 (07) : 778 - 778
[7] A powerful allele based test for rare markers in case-control association studies
Jonker, Marianne A.
Bezzina, Connie R.
Tanck, MichaelW. T.
[J]. GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 559 - 559
[8] Blindly Using Wald's Test Can Miss Rare Disease-Causal Variants in Case-Control Association Studies
Xing, Guan
Lin, Chang-Yun
Wooding, Stephen P.
Xing, Chao
[J]. ANNALS OF HUMAN GENETICS, 2012, 76 : 168 - 177
[9] Association of ultra-rare coding variants with genetic generalized epilepsy: A case-control whole exome sequencing study
Koko, Mahmoud
Motelow, Joshua E.
Stanley, Kate E.
Bobbili, Dheeraj R.
Dhindsa, Ryan S.
May, Patrick
[J]. EPILEPSIA, 2022, 63 (03) : 723 - 735
[10] Efficient unified rare variant association test by modeling the population genetic distribution in case-control studies
Li, Huilin
Chen, Jinbo
[J]. GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 579 - 590

← 1 2 →