NGSphy: phylogenomic simulation of next-generation sequencing data

被引:5
|
作者
Escalona, Merly [1 ]
Rocha, Sara [1 ]
Posada, David [1 ,2 ,3 ]
机构
[1] Univ Vigo, Dept Biochem Genet & Immunol, Vigo 36310, Spain
[2] Univ Vigo, Biomed Res Ctr CINBIO, Vigo 36310, Spain
[3] Galicia Hlth Res Inst, Vigo 36310, Spain
基金
欧洲研究理事会;
关键词
D O I
10.1093/bioinformatics/bty146
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Advances in sequencing technologies have made it feasible to obtain massive datasets for phylogenomic inference, often consisting of large numbers of loci from multiple species and individuals. The phylogenomic analysis of next-generation sequencing (NGS) data requires a complex computational pipeline where multiple technical and methodological decisions are necessary that can influence the final tree obtained, like those related to coverage, assembly, mapping, variant calling and/or phasing. Results: To assess the influence of these variables we introduce NGSphy, an open-source tool for the simulation of Illumina reads/read counts obtained from haploid/diploid individual genomes with thousands of independent gene families evolving under a common species tree. In order to resemble real NGS experiments, NGSphy includes multiple options to model sequencing coverage (depth) heterogeneity across species, individuals and loci, including off-target or uncaptured loci. For comprehensive simulations covering multiple evolutionary scenarios, parameter values for the different replicates can be sampled from user-defined statistical distributions.
引用
收藏
页码:2506 / 2507
页数:2
相关论文
共 50 条
  • [1] NGSNGS: next-generation simulator for next-generation sequencing data
    Henriksen, Rasmus Amund
    Zhao, Lei
    Korneliussen, Thorfinn Sand
    [J]. BIOINFORMATICS, 2023, 39 (01)
  • [2] A comparison of tools for the simulation of genomic next-generation sequencing data
    Merly Escalona
    Sara Rocha
    David Posada
    [J]. Nature Reviews Genetics, 2016, 17 : 459 - 469
  • [3] A comparison of tools for the simulation of genomic next-generation sequencing data
    Escalona, Merly
    Rocha, Sara
    Posada, David
    [J]. NATURE REVIEWS GENETICS, 2016, 17 (08) : 459 - 469
  • [4] Indexing Next-Generation Sequencing data
    Jalili, Vahid
    Matteucci, Matteo
    Masseroli, Marco
    Ceri, Stefano
    [J]. INFORMATION SCIENCES, 2017, 384 : 90 - 109
  • [5] Next-generation sequencing of the next generation
    Darren J. Burgess
    [J]. Nature Reviews Genetics, 2011, 12 : 78 - 79
  • [6] Phylogenomic analysis in Latilactobacillus sakei by using polymorphisms detected by next-generation sequencing
    Nishiyama, Chisato
    Sekiguchi, Suguru
    Sugihara, Yu
    Nishikawa, Minami
    Makita, Natsu
    Segawa, Tenta
    Terasaki, Momoka
    Takagi, Hiroki
    Koyanagi, Takashi
    [J]. BIOSCIENCE OF MICROBIOTA FOOD AND HEALTH, 2023, 42 (02) : 138 - 142
  • [7] Visualizing next-generation sequencing data with JBrowse
    Westesson, Oscar
    Skinner, Mitchell
    Holmes, Ian
    [J]. BRIEFINGS IN BIOINFORMATICS, 2013, 14 (02) : 172 - 177
  • [8] Next-generation sequencing: adjusting to data overload
    Monya Baker
    [J]. Nature Methods, 2010, 7 : 495 - 499
  • [9] Focus on next-generation sequencing data analysis
    Rusk N.
    [J]. Nature Methods, 2009, 6 (Suppl 11) : S1 - S1
  • [10] Next-generation sequencing: adjusting to data overload
    Baker, Monya
    [J]. NATURE METHODS, 2010, 7 (07) : 495 - 499