Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms

被引:63
|
作者
Li, SSY
Khalid, N
Carlson, C
Zhao, LP
机构
[1] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98109 USA
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
关键词
estimating equation; haplotype; Hardy-Weinberg equilibrium; single nucleotide polymorphism (SNP);
D O I
10.1093/biostatistics/4.4.513
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Estimating haplotype frequencies becomes increasingly important in the mapping of complex disease genes, as millions of single nucleotide polymorphisms (SNPs) are being identified and genotyped. When genotypes at multiple SNP loci are gathered from unrelated individuals, haplotype frequencies can be accurately estimated using expectation-maximization (EM) algorithms (Excoffier and Slatkin, 1995; Hawley and Kidd, 1995; Long et al., 1995), with standard errors estimated using bootstraps. However, because the number of possible haplotypes increases exponentially with the number of SNPs, handling data with a large number of SNPs poses a computational challenge for the EM methods and for other haplotype inference methods. To solve this problem, Niu and colleagues, in their Bayesian haplotype inference paper (Niu et al., 2002), introduced a computational algorithm called progressive ligation (PL). But their Bayesian method has a limitation on the number of subjects (no more than 100 subjects in the current implementation of the method). In this paper, we propose a new method in which we use the same likelihood formulation as in Excoffier and Slatkin's EM algorithm and apply the estimating equation idea and the PL computational algorithm with some modifications. Our proposed method can handle data sets with large number of SNPs as well as large numbers of subjects. Simultaneously, our method estimates standard errors efficiently, using the sandwich-estimate from the estimating equation, rather than the bootstrap method. Additionally, our method admits missing data and produces valid estimates of parameters and their standard errors under the assumption that the missing genotypes are missing at random in the sense defined by Rubin (1976).
引用
收藏
页码:513 / 522
页数:10
相关论文
共 50 条
  • [21] True pedigree errors more frequent than apparent errors for single nucleotide polymorphisms
    Gordon, D
    Heath, SC
    Ott, J
    HUMAN HEREDITY, 1999, 49 (02) : 65 - 70
  • [22] HAPLOTYPE FREQUENCIES OF 3 POLYMORPHISMS AT THE TIMP LOCUS
    ALDRED, MA
    WRIGHT, AF
    MOLECULAR AND CELLULAR PROBES, 1994, 8 (04) : 333 - 334
  • [23] An artificial neural network for estimating haplotype frequencies
    Cartier, KC
    Baechle, D
    BMC GENETICS, 2005, 6 (Suppl 1)
  • [24] An artificial neural network for estimating haplotype frequencies
    Kevin C Cartier
    Daniel Baechle
    BMC Genetics, 6
  • [25] Characteristic Imprint of Single Nucleotide Polymorphisms in Multiple Sclerosis
    Zoltan Szolnoki
    Andras Kondacs
    Yvette Mandi
    Ferenc Somogyvari
    Journal of Molecular Neuroscience, 2009, 38
  • [26] Characteristic Imprint of Single Nucleotide Polymorphisms in Multiple Sclerosis
    Szolnoki, Zoltan
    Kondacs, Andras
    Mandi, Yvette
    Somogyvari, Ferenc
    JOURNAL OF MOLECULAR NEUROSCIENCE, 2009, 38 (02) : 166 - 172
  • [27] SELPLG and SELP single nucleotide polymorphisms in multiple sclerosis
    Fenoglio, C
    Galimberti, D
    Ban, M
    Maranian, M
    Scalabrini, D
    Venturelli, E
    Piccio, L
    De Riz, M
    Yeo, TW
    Goris, A
    Grey, J
    Bresolin, N
    Scarpini, E
    Compston, A
    Sawcer, S
    MULTIPLE SCLEROSIS, 2005, 11 : S118 - S118
  • [28] Flexible methods for estimating genetic distances from single nucleotide polymorphisms
    Joly, Simon
    Bryant, David
    Lockhart, Peter J.
    METHODS IN ECOLOGY AND EVOLUTION, 2015, 6 (08): : 938 - 948
  • [29] Identification and haplotype analysis of single nucleotide polymorphisms in AMPD1 gene.
    Toyama, K
    Morisaki, H
    Kitamura, Y
    Kamatani, N
    Morisaki, T
    AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 71 (04) : 539 - 539
  • [30] Haplotype analysis of two single nucleotide polymorphisms in ANP of essential hypertension in Xinjiang Kazakhs
    nanfang, Li
    yanmin, Zhang
    Tao, Li
    JOURNAL OF HYPERTENSION, 2006, 24 : 240 - 240