distAngsd: Fast and Accurate Inference of Genetic Distances for Next-Generation Sequencing Data

被引:1
|
作者
Zhao, Lei [1 ]
Nielsen, Rasmus [1 ,2 ,3 ]
Korneliussen, Thorfinn Sand [1 ]
机构
[1] Univ Copenhagen, Globe Inst, Sect Geogenet, Oster Voldgade 5-7, DK-1350 Copenhagen K, Denmark
[2] Univ Calif Berkeley, Dept Integrat Biol, 3040 Valley Life Sci Bldg 3140, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Dept Stat, 3040 Valley Life Sci Bldg 3140, Berkeley, CA 94720 USA
关键词
phylogeny reconstruction; genotype likelihood; genetic distance; high-throughput sequencing; next-generation sequencing; molecular evolution; maximum likelihood; expectation maximization; HAPLOTYPE RECONSTRUCTION; MAXIMUM-LIKELIHOOD; DNA; MITOCHONDRIAL; SITES; SUBSTITUTIONS; ASSOCIATION; FRAMEWORK; GENOTYPE; GENOMES;
D O I
10.1093/molbev/msac119
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Commonly used methods for inferring phylogenies were designed before the emergence of high-throughput sequencing and can generally not accommodate the challenges associated with noisy, diploid sequencing data. In many applications, diploid genomes are still treated as haploid through the use of ambiguity characters; while the uncertainty in genotype calling-arising as a consequence of the sequencing technology-is ignored. In order to address this problem, we describe two new probabilistic approaches for estimating genetic distances: distAngsd-geno and distAngsd-nuc, both implemented in a software suite named distAngsd. These methods are specifically designed for next-generation sequencing data, utilize the full information from the data, and take uncertainty in genotype calling into account. Through extensive simulations, we show that these new methods are markedly more accurate and have more stable statistical behaviors than other currently available methods for estimating genetic distances-even for very low depth data with high error rates.
引用
收藏
页码:1084 / 1097
页数:14
相关论文
共 50 条
  • [31] Visualizing next-generation sequencing data with JBrowse
    Westesson, Oscar
    Skinner, Mitchell
    Holmes, Ian
    BRIEFINGS IN BIOINFORMATICS, 2013, 14 (02) : 172 - 177
  • [32] Next-generation sequencing: adjusting to data overload
    Baker, Monya
    NATURE METHODS, 2010, 7 (07) : 495 - 499
  • [33] Identification of indels in next-generation sequencing data
    Ratan, Aakrosh
    Olson, Thomas L.
    Loughran, Thomas P., Jr.
    Miller, Webb
    BMC BIOINFORMATICS, 2015, 16
  • [34] Karect: accurate correction of substitution, insertion and deletion errors for next-generation sequencing data
    Allam, Amin
    Kalnis, Panos
    Solovyev, Victor
    BIOINFORMATICS, 2015, 31 (21) : 3421 - 3428
  • [35] Pisces: an accurate and versatile variant caller for somatic and germline next-generation sequencing data
    Dunn, Tamsen
    Berry, Gwenn
    Emig-Agius, Dorothea
    Jiang, Yu
    Lei, Serena
    Iyer, Anita
    Udar, Nitin
    Chuang, Han-Yu
    Hegarty, Jeff
    Dickover, Michael
    Klotzle, Brandy
    Robbins, Justin
    Bibikova, Marina
    Peeters, Marc
    Stromberg, Michael
    BIOINFORMATICS, 2019, 35 (09) : 1579 - 1581
  • [36] Identification of indels in next-generation sequencing data
    Aakrosh Ratan
    Thomas L Olson
    Thomas P Loughran
    Webb Miller
    BMC Bioinformatics, 16
  • [37] Applications and data analysis of next-generation sequencing
    Vogl, Ina
    Benet-Pages, Anna
    Eck, Sebastian H.
    Kuhn, Marius
    Vosberg, Sebastian
    Greif, Philipp A.
    Metzeler, Klaus H.
    Biskup, Saskia
    Mueller-Reible, Clemens
    Klein, Hanns-Georg
    LABORATORIUMSMEDIZIN-JOURNAL OF LABORATORY MEDICINE, 2013, 37 (06): : 305 - 315
  • [38] Next-generation sequencing and the evolution of data sharing
    de Macena Sobreira, Nara Lygia
    Hamosh, Ada
    AMERICAN JOURNAL OF MEDICAL GENETICS PART A, 2021, 185 (09) : 2633 - 2635
  • [39] Assembly algorithms for next-generation sequencing data
    Miller, Jason R.
    Koren, Sergey
    Sutton, Granger
    GENOMICS, 2010, 95 (06) : 315 - 327
  • [40] HLA-HD: An accurate HLA typing algorithm for next-generation sequencing data
    Kawaguchi, Shuji
    Higasa, Koichiro
    Shimizu, Masakazu
    Yamada, Ryo
    Matsuda, Fumihiko
    HUMAN MUTATION, 2017, 38 (07) : 788 - 797