Fast and accurate haplotype frequency estimation for large haplotype vectors from pooled DNA data

被引:7
|
作者
Iliadis, Alexandros
Anastassiou, Dimitris
Wang, Xiaodong [1 ]
机构
[1] Columbia Univ, Ctr Computat Biol & Bioinformat, New York, NY 10027 USA
来源
BMC GENETICS | 2012年 / 13卷
关键词
LARGE-SCALE ASSOCIATION; LINKAGE-DISEQUILIBRIUM; POPULATION; IDENTIFICATION; INFORMATION; EFFICIENCY; INFERENCE; SCREEN; TOOL;
D O I
10.1186/1471-2156-13-94
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Typically, the first phase of a genome wide association study (GWAS) includes genotyping across hundreds of individuals and validation of the most significant SNPs. Allelotyping of pooled genomic DNA is a common approach to reduce the overall cost of the study. Knowledge of haplotype structure can provide additional information to single locus analyses. Several methods have been proposed for estimating haplotype frequencies in a population from pooled DNA data. Results: We introduce a technique for haplotype frequency estimation in a population from pooled DNA samples focusing on datasets containing a small number of individuals per pool (2 or 3 individuals) and a large number of markers. We compare our method with the publicly available state-of-the-art algorithms HIPPO and HAPLOPOOL on datasets of varying number of pools and marker sizes. We demonstrate that our algorithm provides improvements in terms of accuracy and computational time over competing methods for large number of markers while demonstrating comparable performance for smaller marker sizes. Our method is implemented in the "Tree-Based Deterministic Sampling Pool" (TDSPool) package which is available for download at www.ee.columbia.edu/similar to anastas/tdspool. Conclusions: Using a tree-based determinstic sampling technique we present an algorithm for haplotype frequency estimation from pooled data. Our method demonstrates superior performance in datasets with large number of markers and could be the method of choice for haplotype frequency estimation in such datasets.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [1] Fast and accurate haplotype frequency estimation for large haplotype vectors from pooled DNA data
    Alexandros Iliadis
    Dimitris Anastassiou
    Xiaodong Wang
    BMC Genetics, 13
  • [2] Accurate estimation of haplotype frequency from pooled sequencing data and cost-effective identification of rare haplotype carriers by overlapping pool sequencing
    Cao, Chang-Chang
    Sun, Xiao
    BIOINFORMATICS, 2015, 31 (04) : 515 - 522
  • [3] Comparisons of two methods for haplotype reconstruction and haplotype frequency estimation from population data
    Zhang, SL
    Pakstis, AJ
    Kidd, KK
    Zhao, HY
    AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (04) : 906 - 912
  • [4] Comparisons of two methods for haplotype reconstruction and haplotype frequency estimation from population data - Reply
    Stephens, M
    Smith, NJ
    Donnelly, P
    AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (04) : 912 - 914
  • [5] Efficiency of single-nucleotide polymorphism haplotype estimation from pooled DNA
    Yang, YN
    Zhang, JS
    Hoh, J
    Matsuda, F
    Xu, P
    Lathrop, M
    Ott, J
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (12) : 7225 - 7230
  • [6] Estimation of haplotype frequencies, linkage-disequilibrium measures, and combination of haplotype copies in each pool by use of pooled DNA data
    Ito, T
    Chiku, S
    Inoue, E
    Tomita, M
    Morisaki, T
    Morisaki, H
    Kamatani, N
    AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (02) : 384 - 398
  • [7] Validation of programs for haplotype frequency estimation from registry data
    Eberhard, H. P.
    Feldmann, U.
    Appert, M-L
    Gourraud, P. A.
    Torres, H. Maldonado
    van der Zanden, H.
    Maiers, M.
    Marsh, S.
    Mueller, C. R.
    TISSUE ANTIGENS, 2008, 72 (03): : 264 - 264
  • [8] Identification and Frequency Estimation of Inversion Polymorphisms from Haplotype Data
    Sindi, Suzanne S.
    Raphael, Benjamin J.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2010, 17 (03) : 517 - 531
  • [9] Identification and Frequency Estimation of Inversion Polymorphisms from Haplotype Data
    Sindi, Suzanne S.
    Raphael, Benjamin J.
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2009, 5541 : 418 - +
  • [10] Estimation of haplotype frequencies and diplotype configuration for each subject using pooled DNA data.
    Ito, T
    Chiku, S
    Inoue, E
    Tomita, M
    Kamatani, N
    AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 71 (04) : 449 - 449