SNVer: a statistical tool for variant calling in analysis of pooled or individual next-generation sequencing data

被引:177
|
作者
Wei, Zhi [1 ]
Wang, Wei [1 ]
Hu, Pingzhao [2 ]
Lyon, Gholson J. [3 ]
Hakonarson, Hakon [3 ]
机构
[1] New Jersey Inst Technol, Dept Comp Sci, Princeton, NJ 08540 USA
[2] Hosp Sick Children, TCAG, Toronto, ON M5G 1L7, Canada
[3] Univ Penn, Childrens Hosp Philadelphia, Dept Pediat, Ctr Appl Genom, Philadelphia, PA 19104 USA
关键词
GENOME-WIDE ASSOCIATION; LARGE-SCALE ASSOCIATION; HIGH-THROUGHPUT; RARE VARIANTS; COMPLEX TRAITS; DNA; DISCOVERY; DISEASES; IDENTIFICATION; FRAMEWORK;
D O I
10.1093/nar/gkr599
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We develop a statistical tool SNVer for calling common and rare variants in analysis of pooled or individual next-generation sequencing (NGS) data. We formulate variant calling as a hypothesis testing problem and employ a binomial-binomial model to test the significance of observed allele frequency against sequencing error. SNVer reports one single overall P-value for evaluating the significance of a candidate locus being a variant based on which multiplicity control can be obtained. This is particularly desirable because tens of thousands loci are simultaneously examined in typical NGS experiments. Each user can choose the false-positive error rate threshold he or she considers appropriate, instead of just the dichotomous decisions of whether to 'accept or reject the candidates' provided by most existing methods. We use both simulated data and real data to demonstrate the superior performance of our program in comparison with existing methods. SNVer runs very fast and can complete testing 300 K loci within an hour. This excellent scalability makes it feasible for analysis of whole-exome sequencing data, or even whole-genome sequencing data using high performance computing cluster. SNVer is freely available at http://snver.sourceforge.net/.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] SNVerGUI: a desktop tool for variant analysis of next-generation sequencing data
    Wang, Wei
    Hu, Weicheng
    Hou, Fang
    Hu, Pingzhao
    Wei, Zhi
    [J]. JOURNAL OF MEDICAL GENETICS, 2012, 49 (12) : 753 - 755
  • [2] Validation and assessment of variant calling pipelines for next-generation sequencing
    Pirooznia, Mehdi
    Kramer, Melissa
    Parla, Jennifer
    Goes, Fernando S.
    Potash, James B.
    McCombie, W. Richard
    Zandi, Peter P.
    [J]. HUMAN GENOMICS, 2014, 8 : 14
  • [3] Validation and assessment of variant calling pipelines for next-generation sequencing
    Mehdi Pirooznia
    Melissa Kramer
    Jennifer Parla
    Fernando S Goes
    James B Potash
    W Richard McCombie
    Peter P Zandi
    [J]. Human Genomics, 8
  • [4] Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data
    Sarah Sandmann
    Aniek O. de Graaf
    Mohsen Karimi
    Bert A. van der Reijden
    Eva Hellström-Lindberg
    Joop H. Jansen
    Martin Dugas
    [J]. Scientific Reports, 7
  • [5] Empirical Bayes single nucleotide variant-calling for next-generation sequencing data
    Karimnezhad, Ali
    Perkins, Theodore J.
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [6] A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data
    Xu, Chang
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2018, 16 : 15 - 24
  • [7] Coval: Improving Alignment Quality and Variant Calling Accuracy for Next-Generation Sequencing Data
    Kosugi, Shunichi
    Natsume, Satoshi
    Yoshida, Kentaro
    MacLean, Daniel
    Cano, Liliana
    Kamoun, Sophien
    Terauchi, Ryohei
    [J]. PLOS ONE, 2013, 8 (10):
  • [8] Empirical Bayes single nucleotide variant-calling for next-generation sequencing data
    Ali Karimnezhad
    Theodore J. Perkins
    [J]. Scientific Reports, 14
  • [9] Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data
    Sandmann, Sarah
    de Graaf, Aniek O.
    Karimi, Mohsen
    van der Reijden, Bert A.
    Hellstrom-Lindberg, Eva
    Jansen, Joop H.
    Dugas, Martin
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [10] NGS Variant Manager - The Ultimate Tool for Analysis of High Throughput Next-Generation Sequencing Data
    Oultache, A.
    Chenni, A. E.
    Hedvat, C. V.
    Nafa, K.
    [J]. JOURNAL OF MOLECULAR DIAGNOSTICS, 2013, 15 (06): : 933 - 933