Optimized detection of insertions/deletions (INDELs) in whole-exome sequencing data

被引:22
|
作者
Kim, Bo-Young [1 ]
Park, Jung Hoon [2 ]
Jo, Hye-Yeong [1 ]
Koo, Soo Kyung [1 ]
Park, Mi-Hyun [1 ]
机构
[1] Korea Natl Inst Hlth, Ctr Biomed Sci, Div Intractable Dis, Chungcheongbuk Do, South Korea
[2] Macrogen Inc, Seoul, South Korea
来源
PLOS ONE | 2017年 / 12卷 / 08期
关键词
GENERATION; GENOME; TOOL;
D O I
10.1371/journal.pone.0182272
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Insertion and deletion (INDEL) mutations, the most common type of structural variance, are associated with several human diseases. The detection of INDELs through next-generation sequencing (NGS) is becoming more common due to the decrease in costs, the increase in efficiency, and sensitivity improvements demonstrated by the various sequencing platforms and analytical tools. However, there are still many errors associated with INDEL variant calling, and distinguishing INDELs from errors in NGS remains challenging. To evaluate INDEL calling from whole-exome sequencing (WES) data, we performed Sanger sequencing for all INDELs called from the several calling algorithm. We compared the performance of the four algorithms (i.e. GATK, SAMtools, Dindel, and Freebayes) for INDEL detection from the same sample. We examined the sensitivity and PPV of GATK (90.2 and 89.5%, respectively), SAMtools (75.3 and 94.4%, respectively), Dindel (90.1 and 88.6%, respectively), and Freebayes (80.1 and 94.4%, respectively). GATK had the highest sensitivity. Furthermore, we identified INDELs with high PPV (4 algorithms intersection: 98.7%, 3 algorithms intersection: 97.6%, and GATK and SAMtools intersection INDELs: 97.6%). We presented two key sources of difficulties in accurate INDEL detection: 1) the presence of repeat, and 2) heterozygous INDELs. Herein we could suggest the accessible algorithms that selectively reduce error rates and thereby facilitate INDEL detection. Our study may also serve as a basis for understanding the accuracy and completeness of INDEL detection.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Detection of homozygous and hemizygous complete or partial exon deletions by whole-exome sequencing
    Bigio, Benedetta
    Seeleuthner, Yoann
    Kerner, Gaspard
    Migaud, Melanie
    Rosain, Jeremie
    Boisson, Bertrand
    Nasca, Carla
    Puel, Anne
    Bustamante, Jacinta
    Casanova, Jean-Laurent
    Abel, Laurent
    Cobat, Aurelie
    [J]. NAR GENOMICS AND BIOINFORMATICS, 2021, 3 (02) : 1 - 10
  • [2] Fusion Gene Detection Using Whole-Exome Sequencing Data in Cancer Patients
    Deng, Wenjiang
    Murugan, Sarath
    Lindberg, Johan
    Chellappa, Venkatesh
    Shen, Xia
    Pawitan, Yudi
    Vu, Trung Nghia
    [J]. FRONTIERS IN GENETICS, 2022, 13
  • [3] Optimized pipeline of MuTect and GATK tools to improve the detection of somatic single nucleotide polymorphisms in whole-exome sequencing data
    Ítalo Faria do Valle
    Enrico Giampieri
    Giorgia Simonetti
    Antonella Padella
    Marco Manfrini
    Anna Ferrari
    Cristina Papayannidis
    Isabella Zironi
    Marianna Garonzi
    Simona Bernardi
    Massimo Delledonne
    Giovanni Martinelli
    Daniel Remondini
    Gastone Castellani
    [J]. BMC Bioinformatics, 17
  • [4] Optimized pipeline of MuTect and GATK tools to improve the detection of somatic single nucleotide polymorphisms in whole-exome sequencing data
    do Valle, Italo Faria
    Giampieri, Enrico
    Simonetti, Giorgia
    Padella, Antonella
    Manfrini, Marco
    Ferrari, Anna
    Papayannidis, Cristina
    Zironi, Isabella
    Garonzi, Marianna
    Bernardi, Simona
    Delledonne, Massimo
    Martinelli, Giovanni
    Remondini, Daniel
    Castellani, Gastone
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [5] ExomeAI: detection of recurrent allelic imbalance in tumors using whole-exome sequencing data
    Nadaf, Javad
    Majewski, Jacek
    Fahiminiya, Somayyeh
    [J]. BIOINFORMATICS, 2015, 31 (03) : 429 - 431
  • [6] An Evaluation of Copy Number Variation Detection Tools from Whole-Exome Sequencing Data
    Tan, Renjie
    Wang, Yadong
    Kleinstein, Sarah E.
    Liu, Yongzhuang
    Zhu, Xiaolin
    Guo, Hongzhe
    Jiang, Qinghua
    Allen, Andrew S.
    Zhu, Mingfu
    [J]. HUMAN MUTATION, 2014, 35 (07) : 899 - 907
  • [7] HPexome: An automated tool for processing whole-exome sequencing data
    Cendes, Lucas L.
    de Souza, Welliton
    Lopes-Cendes, Iscia
    Carvalho, Benilton S.
    [J]. SOFTWAREX, 2020, 11
  • [8] Can whole-exome sequencing data be used for linkage analysis?
    Steven Gazal
    Simon Gosset
    Edgard Verdura
    Françoise Bergametti
    Stéphanie Guey
    Marie-Claude Babron
    Elisabeth Tournier-Lasserve
    [J]. European Journal of Human Genetics, 2016, 24 : 581 - 586
  • [9] Can whole-exome sequencing data be used for linkage analysis?
    Gazal, Steven
    Gosset, Simon
    Verdura, Edgard
    Bergametti, Francoise
    Guey, Stephanie
    Babron, Marie-Claude
    Tournier-Lasserve, Elisabeth
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2016, 24 (04) : 581 - 586
  • [10] Whole-Exome Sequencing Study of Trichotillomania
    Olfson, Emily
    Bloch, Michael
    Fernandez, Thomas
    [J]. BIOLOGICAL PSYCHIATRY, 2019, 85 (10) : S223 - S223