The accuracy of the vast amount of genotypic information generated by high-throughput genotyping technologies is crucial in haplotype analyses and linkage-disequilibrium mapping for complex diseases. To date, most automated programs lack quality measures for the allele calls; therefore, human interventions, which are both labor intensive and error prone, have to be performed. Here, we propose a novel genotype clustering algorithm, GeneScore, based on a bivariate t-mixture model, which assigns a set of probabilities for each data point belonging to the candidate genotype clusters. Furthermore, we describe an expectation-maximization ( EM) algorithm for haplotype phasing, GenoSpectrum (GS)-EM, which can use probabilistic multilocus genotype matrices ( called "GenoSpectrum") as inputs. Combining these two model-based algorithms, we can perform haplotype inference directly on raw readouts from a genotyping machine, such as the TaqMan assay. By using both simulated and real data sets, we demonstrate the advantages of our probabilistic approach over the current genotype scoring methods, in terms of both the accuracy of haplotype inference and the statistical power of haplotype-based association analyses.
机构:
Russian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, RussiaRussian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, Russia
Trifonova, E. A.
Popovich, A. A.
论文数: 0引用数: 0
h-index: 0
机构:
Russian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, RussiaRussian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, Russia
Popovich, A. A.
Vagaitseva, K. V.
论文数: 0引用数: 0
h-index: 0
机构:
Russian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, RussiaRussian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, Russia
Vagaitseva, K. V.
Bocharova, A. V.
论文数: 0引用数: 0
h-index: 0
机构:
Russian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, RussiaRussian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, Russia
Bocharova, A. V.
Gavrilenko, M. M.
论文数: 0引用数: 0
h-index: 0
机构:
Russian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, RussiaRussian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, Russia
Gavrilenko, M. M.
Ivanov, V. V.
论文数: 0引用数: 0
h-index: 0
机构:
Russian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, RussiaRussian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, Russia
Ivanov, V. V.
Stepanov, V. A.
论文数: 0引用数: 0
h-index: 0
机构:
Russian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, RussiaRussian Acad Sci, Tomsk Natl Res Med Ctr, Res Inst Med Genet, Tomsk 634050, Russia