From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing

被引:39
|
作者
Laurie, Steve [1 ,2 ]
Fernandez-Callejo, Marcos [1 ,2 ]
Marco-Sola, Santiago [1 ,2 ]
Trotta, Jean-Remi [1 ,2 ]
Camps, Jordi [1 ,2 ]
Chacon, Alejandro [3 ]
Espinosa, Antonio [3 ]
Gut, Marta [1 ,2 ]
Gut, Ivo [1 ,2 ]
Heath, Simon [1 ,2 ]
Beltran, Sergi [1 ,2 ]
机构
[1] BIST, Ctr Genom Regulat CRG, CNAG CRG, Baldiri & Reixac 4, Barcelona 08028, Spain
[2] UPF, Barcelona, Spain
[3] Univ Autonoma Barcelona, Bellaterra, Spain
关键词
whole genome sequencing; whole exome sequencing; NGS; NA12878; alignment; variant calling; bioinformatics; computing speed; benchmark; DISCOVERY; GENERATION; FRAMEWORK; SNP;
D O I
10.1002/humu.23114
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next-generation sequencing as standard practice in research and diagnostics. However, computing cost-performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state-of-the-art read aligners (BWA-MEM and GEM3) and variant callers (FreeBayes, GATK Haplotype-Caller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available. Published 2016 Wiley Periodicals, Inc.
引用
收藏
页码:1263 / 1271
页数:9
相关论文
共 50 条
  • [31] Whole-exome sequencing and whole genome re-sequencing for prenatal diagnosis of achondroplasia
    Zhao, Rong
    Ruan, Yan
    Wang, Xin
    INTERNATIONAL JOURNAL OF CLINICAL AND EXPERIMENTAL MEDICINE, 2015, 8 (10): : 19241 - 19249
  • [32] Assessing the impact of sequencing platforms and analytical pipelines on whole-exome sequencing
    Sun, Yanping
    Zhao, Xiaochao
    Fan, Xue
    Wang, Miao
    Li, Chaoyang
    Liu, Yongfeng
    Wu, Ping
    Yan, Qin
    Sun, Lei
    FRONTIERS IN GENETICS, 2024, 15
  • [33] Whole Exome Sequencing and Whole Genome Sequencing for Investigation of the Genetic Basis of Obesity: A Rapid Review
    Dehghan, Roghayeh
    Salehi, Mansoor
    BAHRAIN MEDICAL BULLETIN, 2023, 45 (02) : 1492 - 1497
  • [34] Diagnostic value of exome and whole genome sequencing in craniosynostosis
    Miller, Kerry A.
    Twigg, Stephen R. F.
    McGowan, Simon J.
    Phipps, Julie M.
    Fenwick, Aimee L.
    Johnson, David
    Wall, Steven A.
    Noons, Peter
    Rees, Katie E. M.
    Tidey, Elizabeth A.
    Craft, Judith
    Taylor, John
    Taylor, Jenny C.
    Goos, Jacqueline A. C.
    Swagemakers, Sigrid M. A.
    Mathijssen, Irene M. J.
    van der Spek, Peter J.
    Lord, Helen
    Lester, Tracy
    Abid, Noina
    Cilliers, Deirdre
    Hurst, Jane A.
    Morton, Jenny E. V.
    Sweeney, Elizabeth
    Weber, Astrid
    Wilson, Louise C.
    Wilkie, Andrew O. M.
    JOURNAL OF MEDICAL GENETICS, 2017, 54 (04) : 260 - 268
  • [35] Use of Metaphors About Exome and Whole Genome Sequencing
    Nelson, Sarah C.
    Crouch, Julia M.
    Bamshad, Michael J.
    Tabor, Holly K.
    Yu, Joon-Ho
    AMERICAN JOURNAL OF MEDICAL GENETICS PART A, 2016, 170 (05) : 1127 - 1133
  • [36] Whole genome/exome sequencing in mood and psychotic disorders
    Kato, Tadafumi
    PSYCHIATRY AND CLINICAL NEUROSCIENCES, 2015, 69 (02) : 65 - 76
  • [37] Opportunities and challenges of whole-genome and -exome sequencing
    Britt-Sabina Petersen
    Broder Fredrich
    Marc P. Hoeppner
    David Ellinghaus
    Andre Franke
    BMC Genetics, 18
  • [38] Opportunities and challenges of whole-genome and -exome sequencing
    Petersen, Britt-Sabina
    Fredrich, Broder
    Hoeppner, Marc P.
    Ellinghaus, David
    Franke, Andre
    BMC GENETICS, 2017, 18
  • [39] Evaluation of Secondary Findings from Whole Genome and Whole Exome Sequencing in Inherited Retinal Disease Patients
    Mehta, Setu
    Aguirre, Bani
    Esposito, Edward
    Pan, Annabelle
    Singh, Mandeep
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [40] FROM WHOLE GENE SEQUENCING TO WHOLE GENOME SEQUENCING IN HUMANS
    Cereb, Nezih
    Kim, HwaRan
    Ryu, Jaejun
    Kim, Eunsil
    Yang, Soo Young
    HLA, 2017, 89 (06) : 381 - 381