Large-scale contamination of microbial isolate genomes by Illumina PhiX control

被引:151
|
作者
Mukherjee, Supratim [1 ]
Huntemann, Marcel [1 ]
Ivanova, Natalia [1 ,2 ]
Kyrpides, Nikos C. [1 ]
Pati, Amrita [1 ]
机构
[1] DOE Joint Genome Inst, Walnut Creek, CA USA
[2] King Abdulaziz Univ, Jeddah 21413, Saudi Arabia
来源
关键词
Next-generation sequencing; PhiX; Contamination; Comparative genomics; ANNOTATION; PHYLOGENY; BACTERIA; SEQUENCE; PROPOSAL; SYSTEM;
D O I
10.1186/1944-3277-10-18
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the rapid growth and development of sequencing technologies, genomes have become the new go-to for exploring solutions to some of the world's biggest challenges such as searching for alternative energy sources and exploration of genomic dark matter. However, progress in sequencing has been accompanied by its share of errors that can occur during template or library preparation, sequencing, imaging or data analysis. In this study we screened over 18,000 publicly available microbial isolate genome sequences in the Integrated Microbial Genomes database and identified more than 1000 genomes that are contaminated with PhiX, a control frequently used during Illumina sequencing runs. Approximately 10% of these genomes have been published in literature and 129 contaminated genomes were sequenced under the Human Microbiome Project. Raw sequence reads are prone to contamination from various sources and are usually eliminated during downstream quality control steps. Detection of PhiX contaminated genomes indicates a lapse in either the application or effectiveness of proper quality control measures. The presence of PhiX contamination in several publicly available isolate genomes can result in additional errors when such data are used in comparative genomics analyses. Such contamination of public databases have far-reaching consequences in the form of erroneous data interpretation and analyses, and necessitates better measures to proofread raw sequences before releasing them to the broader scientific community.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Large-scale contamination of microbial isolate genomes by Illumina PhiX control
    Supratim Mukherjee
    Marcel Huntemann
    Natalia Ivanova
    Nikos C Kyrpides
    Amrita Pati
    [J]. Standards in Genomic Sciences, 10
  • [2] Control of microbial contamination for large-scale photoautotrophic micropropagation
    Kubota, C
    Tadokoro, N
    [J]. IN VITRO CELLULAR & DEVELOPMENTAL BIOLOGY-PLANT, 1999, 35 (04) : 296 - 298
  • [3] Control of microbial contamination for large-scale photoautotrophic micropropagation
    Chieri Kubota
    Niki Tadokoro
    [J]. In Vitro Cellular & Developmental Biology - Plant, 1999, 35 (4) : 296 - 298
  • [4] MICROBIAL AIR CONTAMINATION IN LARGE-SCALE FARMS
    FISER, A
    [J]. ACTA VETERINARIA BRNO, 1976, 45 (04) : 235 - 244
  • [5] Large-scale comparative analysis of microbial pan-genomes using PanOCT
    Inman, Jason M.
    Sutton, Granger G.
    Beck, Erin
    Brinkac, Lauren M.
    Clarke, Thomas H.
    Fouts, Derrick E.
    [J]. BIOINFORMATICS, 2019, 35 (06) : 1049 - 1050
  • [6] Developing Bioprospecting Strategies for Bioplastics Through the Large-Scale Mining of Microbial Genomes
    Vuong, Paton
    Lim, Daniel J.
    Murphy, Daniel V.
    Wise, Michael J.
    Whiteley, Andrew S.
    Kaur, Parwinder
    [J]. FRONTIERS IN MICROBIOLOGY, 2021, 12
  • [7] Large-scale sequencing of plant genomes
    Rounsley, S
    Lin, XY
    Ketchum, KA
    [J]. CURRENT OPINION IN PLANT BIOLOGY, 1998, 1 (02) : 136 - 141
  • [8] Flood management: Prediction of microbial contamination in large-scale floods in urban environments
    Taylor, Jonathon
    Lai, Ka Man
    Davies, Mike
    Clifton, David
    Ridley, Ian
    Biddulph, Phillip
    [J]. ENVIRONMENT INTERNATIONAL, 2011, 37 (05) : 1019 - 1029
  • [9] Assembling genomes on large-scale parallel computers
    Kalyanaraman, A.
    Emrich, S. J.
    Schnable, P. S.
    Aluru, S.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2007, 67 (12) : 1240 - 1255
  • [10] Universality in large-scale structure of complete genomes
    Li-Ching Hsieh
    Ta-Yuan Chen
    Chang-Heng Chang
    Wen-Lang Fan
    Hoong-Chien Lee
    [J]. Genome Biology, 5 (3):