Cont-ID: detection of sample cross-contamination in viral metagenomic data

被引:2
|
作者
Rollin, Johan [1 ,2 ]
Rong, Wei [1 ]
Massart, Sebastien [1 ]
机构
[1] Univ Liege, Plant Pathol Lab, Gembloux Agrobio Tech, B-5030 Gembloux, Belgium
[2] DNAVision, B-6041 Gosselies, Belgium
关键词
Bioinformatic; Virus; Detection; Sequencing; Contamination; Metagenomic; PIPELINE; IMPACT;
D O I
10.1186/s12915-023-01708-w
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundHigh-throughput sequencing (HTS) technologies completed by the bioinformatic analysis of the generated data are becoming an important detection technique for virus diagnostics. They have the potential to replace or complement the current PCR-based methods thanks to their improved inclusivity and analytical sensitivity, as well as their overall good repeatability and reproducibility. Cross-contamination is a well-known phenomenon in molecular diagnostics and corresponds to the exchange of genetic material between samples. Cross-contamination management was a key drawback during the development of PCR-based detection and is now adequately monitored in routine diagnostics. HTS technologies are facing similar difficulties due to their very high analytical sensitivity. As a single viral read could be detected in millions of sequencing reads, it is mandatory to fix a detection threshold that will be informed by estimated cross-contamination. Cross-contamination monitoring should therefore be a priority when detecting viruses by HTS technologies.ResultsWe present Cont-ID, a bioinformatic tool designed to check for cross-contamination by analysing the relative abundance of virus sequencing reads identified in sequence metagenomic datasets and their duplication between samples. It can be applied when the samples in a sequencing batch have been processed in parallel in the laboratory and with at least one specific external control called Alien control. Using 273 real datasets, including 68 virus species from different hosts (fruit tree, plant, human) and several library preparation protocols (Ribodepleted total RNA, small RNA and double-stranded RNA), we demonstrated that Cont-ID classifies with high accuracy (91%) viral species detection into (true) infection or (cross) contamination. This classification raises confidence in the detection and facilitates the downstream interpretation and confirmation of the results by prioritising the virus detections that should be confirmed.ConclusionsCross-contamination between samples when detecting viruses using HTS (Illumina technology) can be monitored and highlighted by Cont-ID (provided an alien control is present). Cont-ID is based on a flexible methodology relying on the output of bioinformatics analyses of the sequencing reads and considering the contamination pattern specific to each batch of samples. The Cont-ID method is adaptable so that each laboratory can optimise it before its validation and routine use.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Retrospective detection of laboratory cross-contamination of Mycobacterium tuberculosis cultures with use of DNA fingerprint analysis
    Braden, CR
    Templeton, GL
    Stead, WW
    Bates, JH
    Cave, MD
    Valway, SE
    CLINICAL INFECTIOUS DISEASES, 1997, 24 (01) : 35 - 40
  • [32] An enzyme-linked immunosorbent assay (ELISA) for detection of egg cross-contamination in foods.
    Jeanniton, E
    Hefle, SL
    Taylor, SL
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 1997, 99 (01) : 593 - 593
  • [33] Cross-contamination and strong mitonuclear discordance in Empria sawflies (Hymenoptera, Tenthredinidae) in the light of phylogenomic data
    Prous, Marko
    Lee, Kyung Min
    Mutanen, Marko
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2020, 143
  • [34] Bacteriological safety assessment, hygienic habits and cross-contamination risks in a Nigerian urban sample of household kitchen environment
    Ejechi, Bernard O.
    Ochei, Ono P.
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2017, 189 (06)
  • [35] Quantification of Cross-Contamination among Indexed Samples in Multiplexed Next-Generation Sequencing Data
    Spencer, D.
    Duncavage, E. J.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2013, 15 (06): : 932 - 932
  • [36] Bacteriological safety assessment, hygienic habits and cross-contamination risks in a Nigerian urban sample of household kitchen environment
    Bernard O. Ejechi
    Ono P. Ochei
    Environmental Monitoring and Assessment, 2017, 189
  • [37] ViroMatch: A Computational Pipeline for the Detection of Viral Sequences from Complex Metagenomic Data
    Wylie, Todd N.
    Wylie, Kristine M.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2021, 10 (09):
  • [38] Polyphonia: detecting inter-sample contamination in viral genomic sequencing data
    Krasilnikova, Lydia A.
    Tomkins-Tinch, Christopher H.
    Gayton, Alton C.
    Schaffner, Stephen F.
    Dobbins, Sabrina T.
    Gladden-Young, Adrianne
    Siddle, Katherine J.
    Park, Daniel J.
    Sabeti, Pardis C.
    BIOINFORMATICS, 2024, 40 (12)
  • [39] Cross-Contamination of Ignitable Liquid Residues on Wildfire Debris-Effects of Packaging and Storage on Detection and Characterization
    Boegelsack, Nadin
    Walker, James
    Sandau, Court D.
    McMartin, Dena W.
    Withey, Jonathan M.
    O'Sullivan, Gwen
    SEPARATIONS, 2024, 11 (02)
  • [40] Optimized Configuration of Fixed-Tip Robotic Liquid-Handling Stations for the Elimination of Biological Sample Cross-Contamination
    Frégeau, Chantal J.
    Yensen, Craig
    Elliott, Jim
    Fourney, Ron M.
    JALA - Journal of the Association for Laboratory Automation, 2007, 12 (06): : 339 - 354