A Simple and Fast Two-Locus Quality Control Test to Detect False Positives Due to Batch Effects in Genome-Wide Association Studies

被引:10
|
作者
Lee, Sang Hong [1 ]
Nyholt, Dale R. [1 ]
Macgregor, Stuart [1 ]
Henders, Anjali K. [1 ]
Zondervan, Krina T. [2 ]
Montgomery, Grant W. [1 ]
Visscher, Peter M. [1 ]
机构
[1] Queensland Inst Med Res, Herston, Qld 4006, Australia
[2] Univ Oxford, John Radcliffe Hosp, Nuffield Dept Obstet & Gynaecol, Oxford OX3 9DU, England
基金
英国惠康基金;
关键词
genome-wide association study; batch effects; genotyping errors; linear model-based quality control; GENOTYPING ERRORS; LINKAGE; ENDOMETRIOSIS; HERITABILITY; FAMILIES; RISK;
D O I
10.1002/gepi.20541
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The impact of erroneous genotypes having passed standard quality control (QC) can be severe in genome-wide association studies, genotype imputation, and estimation of heritability and prediction of genetic risk based on single nucleotide polymorphisms (SNP). To detect such genotyping errors, a simple two-locus QC method, based on the difference in test statistic of association between single SNPs and pairs of SNPs, was developed and applied. The proposed approach could detect many problematic SNPs with statistical significance even when standard single SNP QC analyses fail to detect them in real data. Depending on the data set used, the number of erroneous SNPs that were not filtered out by standard single SNP QC but detected by the proposed approach varied from a few hundred to thousands. Using simulated data, it was shown that the proposed method was powerful and performed better than other tested existing methods. The power of the proposed approach to detect erroneous genotypes was similar to 80% for a 3% error rate per SNP. This novel QC approach is easy to implement and computationally efficient, and can lead to a better quality of genotypes for subsequent genotype-phenotype investigations. Genet. Epidemiol. 34:854-862, 2010. (C) 2010 Wiley-Liss, Inc.
引用
收藏
页码:854 / 862
页数:9
相关论文
共 50 条
  • [31] GWAtoolbox: an R package for fast quality control and handling of genome-wide association studies meta-analysis data
    Fuchsberger, Christian
    Taliun, Daniel
    Pramstaller, Peter P.
    Pattaro, Cristian
    BIOINFORMATICS, 2012, 28 (03) : 444 - 445
  • [32] SQC: secure quality control for meta-analysis of genome-wide association studies
    Huang, Zhicong
    Lin, Huang
    Fellay, Jacques
    Kutalik, Zoltan
    Hubaux, Jean-Pierre
    BIOINFORMATICS, 2017, 33 (15) : 2273 - 2280
  • [33] A Novel Single Nucleotide Polymorphisms Quality Control Method in Genome-Wide Association Studies
    Sun, Yuliang
    Li, Renfa
    Liao, Bo
    Li, Xiong
    Cao, Zhi
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2014, 11 (07) : 1649 - 1652
  • [34] GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies
    Gogarten, Stephanie M.
    Bhangale, Tushar
    Conomos, Matthew P.
    Laurie, Cecelia A.
    McHugh, Caitlin P.
    Painter, Ian
    Zheng, Xiuwen
    Crosslin, David R.
    Levine, David
    Lumley, Thomas
    Nelson, Sarah C.
    Rice, Kenneth
    Shen, Jess
    Swarnkar, Rohit
    Weir, Bruce S.
    Laurie, Cathy C.
    BIOINFORMATICS, 2012, 28 (24) : 3329 - 3331
  • [35] On using linkage signals to improve genome-wide association studies - Weighted or stratified false discovery control?
    Sun, L.
    GENETIC EPIDEMIOLOGY, 2007, 31 (05) : 498 - 499
  • [36] Quality control in genome-wide association studies revisited: A critical evaluation of the standard methods
    Brudermann, Hanna
    Rausch, Tanja K.
    Koenig, Inke R.
    GENETIC EPIDEMIOLOGY, 2020, 44 (05) : 471 - 471
  • [37] A new association test based on disease allele selection for case–control genome-wide association studies
    Zhongxue Chen
    BMC Genomics, 15
  • [38] A Fast Multi-Locus Ridge Regression Algorithm for High-Dimensional Genome-Wide Association Studies
    Zhang, Jin
    Chen, Min
    Wen, Yangjun
    Zhang, Yin
    Lu, Yunan
    Wang, Shengmeng
    Chen, Juncong
    FRONTIERS IN GENETICS, 2021, 12
  • [39] On Quality Control Measures in Genome-Wide Association Studies: A Test to Assess the Genotyping Quality of Individual Probands in Family-Based Association Studies and an Application to the HapMap Data
    Fardo, David W.
    Ionita-Laza, Iuliana
    Lange, Christoph
    PLOS GENETICS, 2009, 5 (07)
  • [40] PEPIS: A Pipeline for Estimating Epistatic Effects in Quantitative Trait Locus Mapping and Genome-Wide Association Studies
    Zhang, Wenchao
    Dai, Xinbin
    Wang, Qishan
    Xu, Shizhong
    Zhao, Patrick X.
    PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (05)