From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing

被引:39
|
作者
Laurie, Steve [1 ,2 ]
Fernandez-Callejo, Marcos [1 ,2 ]
Marco-Sola, Santiago [1 ,2 ]
Trotta, Jean-Remi [1 ,2 ]
Camps, Jordi [1 ,2 ]
Chacon, Alejandro [3 ]
Espinosa, Antonio [3 ]
Gut, Marta [1 ,2 ]
Gut, Ivo [1 ,2 ]
Heath, Simon [1 ,2 ]
Beltran, Sergi [1 ,2 ]
机构
[1] BIST, Ctr Genom Regulat CRG, CNAG CRG, Baldiri & Reixac 4, Barcelona 08028, Spain
[2] UPF, Barcelona, Spain
[3] Univ Autonoma Barcelona, Bellaterra, Spain
关键词
whole genome sequencing; whole exome sequencing; NGS; NA12878; alignment; variant calling; bioinformatics; computing speed; benchmark; DISCOVERY; GENERATION; FRAMEWORK; SNP;
D O I
10.1002/humu.23114
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next-generation sequencing as standard practice in research and diagnostics. However, computing cost-performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state-of-the-art read aligners (BWA-MEM and GEM3) and variant callers (FreeBayes, GATK Haplotype-Caller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available. Published 2016 Wiley Periodicals, Inc.
引用
收藏
页码:1263 / 1271
页数:9
相关论文
共 50 条
  • [1] Whole exome and whole genome sequencing
    Bick, David
    Dimmock, David
    CURRENT OPINION IN PEDIATRICS, 2011, 23 (06) : 594 - 600
  • [2] Bioinformatics for wet-lab scientists: practical application in sequencing analysis
    Laub, Vera
    Devraj, Kavi
    Elias, Lena
    Schulte, Dorothea
    BMC GENOMICS, 2023, 24 (01)
  • [3] Bioinformatics for wet-lab scientists: practical application in sequencing analysis
    Vera Laub
    Kavi Devraj
    Lena Elias
    Dorothea Schulte
    BMC Genomics, 24
  • [4] When Whole Exome Sequencing fails: Lessons learned from Whole Genome Sequencing
    Eliyahu, A.
    Marek-Yagel, D.
    Pode-Shakked, B.
    Veber, A.
    Philosoph, A.
    Shalva, N.
    Ortal, B.
    Bar-Joseph, I.
    Nayshool, O.
    Ben-Zeev, B.
    Heimer, G.
    Staretz-Chacham, O.
    Oz-Levi, D.
    Pode-Shakked, N.
    Anikster, Y.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2020, 28 (SUPPL 1) : 605 - 605
  • [5] Comparison of genomic biomarkers identified by the whole exome, RNASeq and whole genome sequencing pipelines developed for the PDMR
    Chen, Li
    Patidar, Rajesh
    Das, Biswajit
    Karlovich, Chris
    Vilimas, Tomas
    Camalier, Corinne
    Datta, Vivekananda
    Jiwani, Shahanawaz
    Walsh, William
    Fliss, Palmer
    McDermott, Sean
    McCutcheon, Justine N.
    Peach, Amanda
    Ahalt-Gottholm, Michelle
    Bonomi, Carrie
    Dougherty, Kelly
    Carter, John
    Alcoser, Sergio Y.
    Chase, Tiffanie
    Divelbiss, Raymond
    Gibson, Marion
    Hedger, Kelly
    Mallow, Candace
    McGlynn, Chelsea
    Morris, Malorie
    Radzyminski, Marianne
    Stotler, Howard
    Stottlemyer, Jesse
    Trail, Debbie
    Evrard, Yvonne
    Hollingshead, Melinda G.
    Williams, Mickey
    Doroshow, James H.
    CANCER RESEARCH, 2019, 79 (13)
  • [6] Computational and Bioinformatics Frameworks for Next-Generation Whole Exome and Genome Sequencing
    Dolled-Filhart, Marisa P.
    Lee, Michael, Jr.
    Ou-yang, Chih-wen
    Haraksingh, Rajini Rani
    Lin, Jimmy Cheng-Ho
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [7] Concomitant mitochondrial genome coverage in whole exome and whole genome sequencing
    Guo, Fen
    Voultsis, Evangeline
    Lesperance, Daniel
    Kaur, Supan
    Croft, Daniel
    Ma, Zeqiang
    Mathur, Abhinav
    Collins, Christin
    Hegde, Madhuri
    MOLECULAR GENETICS AND METABOLISM, 2021, 132 : S240 - S240
  • [8] Whole genome and whole exome sequencing of head and neck cancer
    Stransky, Nicolas
    Egloff, Ann Marie
    Tward, Aaron
    Auclair, Daniel
    Cibulskis, Kristian
    Lawrence, Michael
    Sivachenko, Andrey
    Berger, Michael
    Seethala, Raja
    Sougnez, Carrie
    Cortes, Maria
    Winckler, Wendy
    Gabriel, Stacey
    Getz, Gad
    Golub, Todd
    Grandis, Jennifer
    Garraway, Levi
    CANCER RESEARCH, 2011, 71
  • [9] Whole exome and whole genome sequencing for the genetic diagnosis of dystrophinopathies
    Selvatici, R.
    Fang, M.
    Falzarano, M.
    Gualandi, F.
    Delin, S.
    Bensemmane, S.
    Shatillo, A.
    Bello, L.
    Pegoraro, E.
    Ferlini, A.
    NEUROMUSCULAR DISORDERS, 2020, 30 : S142 - S142
  • [10] The Role of Whole Genome and Whole Exome Sequencing in Preventive Genomic Sequencing Programs
    Bertier, Gabrielle
    Zawati, Ma'n H.
    Joly, Yann
    AMERICAN JOURNAL OF BIOETHICS, 2015, 15 (07): : 22 - 24