Benchmarking variant callers in next-generation and third-generation sequencing analysis

被引:49
|
作者
Pei, Surui [1 ,2 ]
Liu, Tao [2 ]
Ren, Xue [2 ]
Li, Weizhong [3 ]
Chen, Chongjian [2 ]
Xie, Zhi [4 ]
机构
[1] Sun Yat Sen Univ, Zhongshan Ophthalm Ctr, Guangzhou, Peoples R China
[2] Annoroad Gene Technol Beijing Co Ltd, Beijing 100176, Peoples R China
[3] Sun Yat Sen Univ, Zhongshan Sch Med, Guangzhou, Peoples R China
[4] Sun Yat Sen Univ, Zhongshan Ophthalm Ctr, Bioinformat, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
variant callers; germline variant; somatic variant;
D O I
10.1093/bib/bbaa148
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
DNA variants represent an important source of genetic variations among individuals. Next- generation sequencing (NGS) is the most popular technology for genome-wide variant calling. Third-generation sequencing (TGS) has also recently been used in genetic studies. Although many variant callers are available, no single caller can call both types of variants on NGS or TGS data with high sensitivity and specificity. In this study, we systematically evaluated 11 variant callers on 12 NGS and TGS datasets. For germline variant calling, we tested DNAseq and DNAscope modes from Sentieon, HaplotypeCaller mode from GATK and WGS mode from DeepVariant. All the four callers had comparable performance on NGS data and 30x coverage of WGS data was recommended. For germline variant calling on TGS data, we tested DNAseq mode from Sentieon, HaplotypeCaller mode from GATK and PACBIO mode from DeepVariant. All the three callers had similar performance in SNP calling, while DeepVariant outperformed the others in InDel calling. TGS detected more variants than NGS, particularly in complex and repetitive regions. For somatic variant calling on NGS, we tested TNscope and TNseq modes from Sentieon, MuTect2 mode from GATK, NeuSomatic, VarScan2, and Strelka2. TNscope and Mutect2 outperformed the other callers. A higher proportion of tumor sample purity (from 10 to 20%) significantly increased the recall value of calling. Finally, computational costs of the callers were compared and Sentieon required the least computational cost. These results suggest that careful selection of a tool and parameters is needed for accurate SNP or InDel calling under different scenarios.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Characterization of Third-Generation Cephalosporin-Resistant Escherichia coli Isolated from Pigs in Cuba Using Next-Generation Sequencing
    Hernandez-Fillor, Rosa Elena
    Brilhante, Michael
    Marrero-Moreno, Carelia Martha
    Baez, Michel
    Espinosa, Ivette
    Perreten, Vincent
    MICROBIAL DRUG RESISTANCE, 2021, 27 (07) : 1003 - 1010
  • [32] HUMAN DISEASE Next-generation sequencing of the next generation
    Burgess, Darren J.
    NATURE REVIEWS GENETICS, 2011, 12 (02) : 78 - 79
  • [33] An integrative variant analysis suite for whole exome next-generation sequencing data
    Challis, Danny
    Yu, Jin
    Evani, Uday S.
    Jackson, Andrew R.
    Paithankar, Sameer
    Coarfa, Cristian
    Milosavljevic, Aleksandar
    Gibbs, Richard A.
    Yu, Fuli
    BMC BIOINFORMATICS, 2012, 13
  • [34] An integrative variant analysis suite for whole exome next-generation sequencing data
    Danny Challis
    Jin Yu
    Uday S Evani
    Andrew R Jackson
    Sameer Paithankar
    Cristian Coarfa
    Aleksandar Milosavljevic
    Richard A Gibbs
    Fuli Yu
    BMC Bioinformatics, 13
  • [35] Comparative Benchmarking Analysis of Next-Generation Space Processors
    Gretok, Evan W.
    Kain, Evan T.
    George, Alan D.
    2019 IEEE AEROSPACE CONFERENCE, 2019,
  • [36] Validation and assessment of variant calling pipelines for next-generation sequencing
    Pirooznia, Mehdi
    Kramer, Melissa
    Parla, Jennifer
    Goes, Fernando S.
    Potash, James B.
    McCombie, W. Richard
    Zandi, Peter P.
    HUMAN GENOMICS, 2014, 8 : 14
  • [37] Validation and assessment of variant calling pipelines for next-generation sequencing
    Mehdi Pirooznia
    Melissa Kramer
    Jennifer Parla
    Fernando S Goes
    James B Potash
    W Richard McCombie
    Peter P Zandi
    Human Genomics, 8
  • [38] Acquired resistance to third-generation EGFR-TKIs and emerging next-generation EGFR inhibitors
    Du, Xiaojing
    Yang, Biwei
    An, Quanlin
    Assaraf, Yehuda G.
    Cao, Xin
    Xia, Jinglin
    INNOVATION, 2021, 2 (02):
  • [39] Pathway analysis with next-generation sequencing data
    Jinying Zhao
    Yun Zhu
    Eric Boerwinkle
    Momiao Xiong
    European Journal of Human Genetics, 2015, 23 : 507 - 515
  • [40] Chimerism analysis using next-generation sequencing
    Iozzi, Sara
    Ciappi, Dario
    Palchetti, Simona
    Ricci, Ugo
    Rombola, Giovanni
    Pelo, Elisabetta
    HLA, 2023, 101 (04) : 391 - 391