A study on fast calling variants from next-generation sequencing data using decision tree

被引:9
|
作者
Li, Zhentang [1 ,2 ]
Wang, Yi [3 ,4 ,5 ]
Wang, Fei [1 ,2 ]
机构
[1] Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[3] Fudan Univ, MOE Key Lab Contemporary Anthropol, Shanghai 200438, Peoples R China
[4] Fudan Univ, Collaborat Innovat Ctr Genet & Dev Biol, State Key Lab Genet Engn, Shanghai 200438, Peoples R China
[5] Fudan Univ, Sch Life Sci, Shanghai 200438, Peoples R China
来源
BMC BIOINFORMATICS | 2018年 / 19卷
基金
中国国家自然科学基金;
关键词
Next-generation sequencing; Variant calling; Decision tree; FRAMEWORK; FORMAT;
D O I
10.1186/s12859-018-2147-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The rapid development of next-generation sequencing (NGS) technology has continuously been refreshing the throughput of sequencing data. However, due to the lack of a smart tool that is both fast and accurate, the analysis task for NGS data, especially those with low coverage, remains challenging. Results: We proposed a decision-tree based variant calling algorithm. Experiments on a set of real data indicate that our algorithm achieves high accuracy and sensitivity for SNVs and indels and shows good adaptability on low-coverage data. In particular, our algorithm is obviously faster than 3 widely used tools in our experiments. Conclusions: We implemented our algorithm in a software named Fuwa and applied it together with 4 well-known variant callers, i.e., Platypus, GATK-UnifiedGenotyper, GATK-HaplotypeCaller and SAMtools, to three sequencing data sets of a well-studied sample NA12878, which were produced by whole-genome, whole-exome and low-coverage whole-genome sequencing technology respectively. We also conducted additional experiments on the WGS data of 4 newly released samples that have not been used to populate dbSNP.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Genotype calling and phasing using next-generation sequencing reads and a haplotype scaffold
    Menelaou, Androniki
    Marchini, Jonathan
    BIOINFORMATICS, 2013, 29 (01) : 84 - 91
  • [22] Next-Generation Sequencing and Eating of the Tree of Knowledge
    Pollyea, Daniel A.
    CLINICAL CANCER RESEARCH, 2018, 24 (23) : 5790 - 5791
  • [23] KRAS Variants in Different Tumor Types: A Study Using Next-Generation Sequencing (NGS)
    Pierce, Kirsten
    Putra, Juan
    de Abreu, Francine
    Peterson, Jason
    Pipas, J. Marc
    Smith, Kerrington
    Tsongalis, Gregory
    Liu, Xiaoying
    LABORATORY INVESTIGATION, 2015, 95 : 522A - 522A
  • [24] KRAS Variants in Different Tumor Types: A Study Using Next-Generation Sequencing (NGS)
    Pierce, Kirsten
    Putra, Juan
    de Abreu, Francine
    Peterson, Jason
    Pipas, J. Marc
    Smith, Kerrington
    Tsongalis, Gregory
    Liu, Xiaoying
    MODERN PATHOLOGY, 2015, 28 : 522A - 522A
  • [25] OnlineCall: fast online parameter estimation and base calling for illumina's next-generation sequencing
    Das, Shreepriya
    Vikalo, Haris
    BIOINFORMATICS, 2012, 28 (13) : 1677 - 1683
  • [26] Empirical Bayes single nucleotide variant-calling for next-generation sequencing data
    Karimnezhad, Ali
    Perkins, Theodore J.
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [27] A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data
    Xu, Chang
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2018, 16 : 15 - 24
  • [28] Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data
    Sarah Sandmann
    Aniek O. de Graaf
    Mohsen Karimi
    Bert A. van der Reijden
    Eva Hellström-Lindberg
    Joop H. Jansen
    Martin Dugas
    Scientific Reports, 7
  • [29] ASEQ: fast allele-specific studies from next-generation sequencing data
    Alessandro Romanel
    Sara Lago
    Davide Prandi
    Andrea Sboner
    Francesca Demichelis
    BMC Medical Genomics, 8
  • [30] Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data
    Sandmann, Sarah
    de Graaf, Aniek O.
    Karimi, Mohsen
    van der Reijden, Bert A.
    Hellstrom-Lindberg, Eva
    Jansen, Joop H.
    Dugas, Martin
    SCIENTIFIC REPORTS, 2017, 7