xAtlas: scalable small variant calling across heterogeneous next-generation sequencing experiments

被引:3
|
作者
Farek, Jesse [1 ]
Hughes, Daniel [1 ,2 ]
Salerno, William [1 ,3 ]
Zhu, Yiming [1 ]
Pisupati, Aishwarya [1 ]
Mansfield, Adam [1 ,3 ]
Krasheninina, Olga [1 ,3 ]
English, Adam C. [1 ]
Metcalf, Ginger [1 ]
Boerwinkle, Eric [1 ,4 ]
Muzny, Donna M. [1 ]
Gibbs, Richard [1 ]
Khan, Ziad [1 ]
Sedlazeck, Fritz J. [1 ]
机构
[1] Baylor Coll Med, Human Genome Sequencing Ctr, One Baylor Plaza, Houston, TX 77030 USA
[2] Columbia Univ, Inst Genom Med, New York, NY USA
[3] Regeneron Pharmaceut Inc, Tarrytown, NY USA
[4] Univ Texas Hlth Sci Ctr Houston, Human Genet Ctr, El Paso, TX USA
来源
GIGASCIENCE | 2023年 / 12卷
关键词
GENOTYPE; GENOMES; SNP;
D O I
10.1093/gigascience/giac125
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The growing volume and heterogeneity of next-generation sequencing (NGS) data complicate the further optimization of identifying DNA variation, especially considering that curated high-confidence variant call sets frequently used to validate these methods are generally developed from the analysis of comparatively small and homogeneous sample sets. Findings: We have developed xAtlas, a single-sample variant caller for single-nucleotide variants (SNVs) and small insertions and deletions (indels) in NGS data. xAtlas features rapid runtimes, support for CRAM and gVCF file formats, and retraining capabilities. xAtlas reports SNVs with 99.11% recall and 98.43% precision across a reference HG002 sample at 60x whole-genome coverage in less than 2 CPU hours. Applying xAtlas to 3,202 samples at 30x whole-genome coverage from the 1000 Genomes Project achieves an average runtime of 1.7 hours per sample and a clear separation of the individual populations in principal component analysis across called SNVs. Conclusions: xAtlas is a fast, lightweight, and accurate SNV and small indel calling method. Source code for xAtlas is available under a BSD 3-clause license at https://github.com/jfarek/xatlas.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Standardizing Next-Generation Sequencing Experiments and Analysis Methods
    Bavarva, Jasmin H.
    McMahon, Wyatt
    Bavarva, Megha J.
    Karunasena, Enusha
    Garner, Harold R.
    [J]. CLINICAL CHEMISTRY, 2012, 58 (12) : 1720 - 1722
  • [22] ParticleCall: A particle filter for base calling in next-generation sequencing systems
    Shen, Xiaohu
    Vikalo, Haris
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [23] Review of alignment and SNP calling algorithms for next-generation sequencing data
    M. Mielczarek
    J. Szyda
    [J]. Journal of Applied Genetics, 2016, 57 : 71 - 79
  • [24] Next-generation sequencing of the next generation
    Darren J. Burgess
    [J]. Nature Reviews Genetics, 2011, 12 : 78 - 79
  • [25] Targeted next-generation sequencing for clinical of genomic profiles of idiopathic infertile men and a comparison of variant calling pipelines
    Raicu, F.
    Coco, R.
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 656 - 656
  • [26] Review of alignment and SNP calling algorithms for next-generation sequencing data
    Mielczarek, M.
    Szyda, J.
    [J]. JOURNAL OF APPLIED GENETICS, 2016, 57 (01) : 71 - 79
  • [27] ParticleCall: A particle filter for base calling in next-generation sequencing systems
    Xiaohu Shen
    Haris Vikalo
    [J]. BMC Bioinformatics, 13
  • [28] Scalable Newborn Screening Solutions: Bioinformatics and Next-Generation Sequencing
    Ruiz-Schultz, Nicole
    Asay, Bryce
    Rohrwasser, Andreas
    [J]. INTERNATIONAL JOURNAL OF NEONATAL SCREENING, 2021, 7 (04)
  • [29] Implementation of standardized variant-calling nomenclature in the age of next-generation sequencing: where do we stand?
    Ann-Kathrin Eisfeld
    James S. Blachly
    Krzysztof Mrózek
    Jessica Kohlschmidt
    Christopher J. Walker
    Albert de la Chapelle
    Clara D. Bloomfield
    [J]. Leukemia, 2019, 33 : 809 - 810
  • [30] Implementation of standardized variant-calling nomenclature in the age of next-generation sequencing: where do we stand?
    Eisfeld, Ann-Kathrin
    Blachly, James S.
    Mrozek, Krzysztof
    Kohlschmidt, Jessica
    Walker, Christopher J.
    de la Chapelle, Albert
    Bloomfield, Clara D.
    [J]. LEUKEMIA, 2019, 33 (03) : 809 - 810