Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms

被引:64
|
作者
Li, SSY
Khalid, N
Carlson, C
Zhao, LP
机构
[1] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98109 USA
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
关键词
estimating equation; haplotype; Hardy-Weinberg equilibrium; single nucleotide polymorphism (SNP);
D O I
10.1093/biostatistics/4.4.513
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Estimating haplotype frequencies becomes increasingly important in the mapping of complex disease genes, as millions of single nucleotide polymorphisms (SNPs) are being identified and genotyped. When genotypes at multiple SNP loci are gathered from unrelated individuals, haplotype frequencies can be accurately estimated using expectation-maximization (EM) algorithms (Excoffier and Slatkin, 1995; Hawley and Kidd, 1995; Long et al., 1995), with standard errors estimated using bootstraps. However, because the number of possible haplotypes increases exponentially with the number of SNPs, handling data with a large number of SNPs poses a computational challenge for the EM methods and for other haplotype inference methods. To solve this problem, Niu and colleagues, in their Bayesian haplotype inference paper (Niu et al., 2002), introduced a computational algorithm called progressive ligation (PL). But their Bayesian method has a limitation on the number of subjects (no more than 100 subjects in the current implementation of the method). In this paper, we propose a new method in which we use the same likelihood formulation as in Excoffier and Slatkin's EM algorithm and apply the estimating equation idea and the PL computational algorithm with some modifications. Our proposed method can handle data sets with large number of SNPs as well as large numbers of subjects. Simultaneously, our method estimates standard errors efficiently, using the sandwich-estimate from the estimating equation, rather than the bootstrap method. Additionally, our method admits missing data and produces valid estimates of parameters and their standard errors under the assumption that the missing genotypes are missing at random in the sense defined by Rubin (1976).
引用
收藏
页码:513 / 522
页数:10
相关论文
共 50 条
  • [1] Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms
    Niu, TH
    Qin, ZHS
    Xu, XP
    Liu, JS
    AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (01) : 157 - 169
  • [2] Haplotype tagging single nucleotide polymorphisms and association studies
    Thompson, D
    Stram, D
    Goldgar, D
    Witte, JS
    HUMAN HEREDITY, 2003, 56 (1-3) : 48 - 55
  • [3] Single Nucleotide Polymorphisms Caused by Assembly Errors
    Kleffe, Juergen
    Weissmann, Robert
    Schmitzberger, Florian F.
    GENOMICS INSIGHTS, 2010, 3 : 1 - 8
  • [4] Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data
    Yang, Wen-Yun
    Hormozdiari, Farhad
    Wang, Zhanyong
    He, Dan
    Pasaniuc, Bogdan
    Eskin, Eleazar
    BIOINFORMATICS, 2013, 29 (18) : 2245 - 2252
  • [5] Accurate haplotype inference for multiple linked single-nucleotide polymorphisms using sibship data
    Liu, Peng-Yuan
    Lu, Yan
    Deng, Hong-Wen
    GENETICS, 2006, 174 (01) : 499 - 509
  • [6] Haplotype information and linkage disequilibrium mapping for single nucleotide polymorphisms
    Lu, X
    Niu, TH
    Liu, JS
    GENOME RESEARCH, 2003, 13 (09) : 2112 - 2117
  • [7] Haplotype tagging single nucleotide polymorphisms and association studies.
    Thompson, D
    Stram, D
    Goldgar, D
    Witte, JS
    AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (05) : 474 - 474
  • [8] Modeling evolution of relative frequencies of single nucleotide polymorphisms
    Polanski, A
    Kimmel, M
    PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2004, : 43 - 48
  • [9] Accuracy of haplotype reconstruction from haplotype-tagging single-nucleotide polymorphisms
    Forton, J
    Kwiatkowski, D
    Rockett, K
    Luoni, G
    Kimber, M
    Hull, J
    AMERICAN JOURNAL OF HUMAN GENETICS, 2005, 76 (03) : 438 - 448
  • [10] Recovering frequencies of known haplotype blocks from single-nucleotide polymorphism allele frequencies
    Pe'er, I
    Beckmann, JS
    GENETICS, 2004, 166 (04) : 2001 - 2006