A comprehensive performance evaluation, comparison, and integration of computational methods for detecting and estimating cross-contamination of human samples in cancer next-generation sequencing analysis

被引:0
|
作者
Chen, Huijuan [1 ,2 ,3 ]
Wang, Bing [1 ]
Cai, Lili [1 ]
Yang, Xiaotian [1 ]
Hu, Yali [1 ]
Zhang, Yiran [1 ]
Leng, Xue [1 ]
Liu, Wen [1 ]
Fan, Dongjie [4 ]
Niu, Beifang [1 ,2 ,5 ]
Zhou, Qiming [1 ,5 ]
机构
[1] Beijing ChosenMed Clin Lab Co Ltd, Jinghai Ind Pk,Econ & Technol Dev Area, Beijing 100176, Peoples R China
[2] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[3] WillingMed Technol Beijing Co Ltd, Beijing 100176, Peoples R China
[4] Chinese Ctr Dis Control & Prevent, Natl Inst Communicable Dis Control & Prevent, Natl Key Lab Intelligent Tracking & Forecasting In, Beijing 102206, Peoples R China
[5] ChosenMed Technol Zhejiang Co Ltd, Hangzhou 311103, Zhejiang, Peoples R China
关键词
Cross; -contamination; Next -generation sequencing; Bioinformatics; Computational methods;
D O I
10.1016/j.jbi.2024.104625
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cross-sample contamination is one of the major issues in next-generation sequencing (NGS)-based molecular assays. This type of contamination, even at very low levels, can significantly impact the results of an analysis, especially in the detection of somatic alterations in tumor samples. Several contamination identification tools have been developed and implemented as a crucial quality-control step in the routine NGS bioinformatic pipeline. However, no study has been published to comprehensively and systematically investigate, evaluate, and compare these computational methods in the cancer NGS analysis. In this study, we comprehensively investigated nine state-of-the-art computational methods for detecting cross-sample contamination. To explore their application in cancer NGS analysis, we further compared the performance of five representative tools by qualitative and quantitative analyses using in silico and simulated experimental NGS data. The results showed that Conpair achieved the best performance for identifying contamination and predicting the level of contamination in solid tumors NGS analysis. Moreover, based on Conpair, we developed a Python script, Contamination Source Predictor (ConSPr), to identify the source of contamination. We anticipate that this comprehensive survey and the proposed tool for predicting the source of contamination will assist researchers in selecting appropriate crosscontamination detection tools in cancer NGS analysis and inspire the development of computational methods for detecting sample cross-contamination and identifying its source in the future.
引用
收藏
页数:9
相关论文
共 17 条
  • [1] ContEst: estimating cross-contamination of human samples in next-generation sequencing data
    Cibulskis, Kristian
    McKenna, Aaron
    Fennell, Tim
    Banks, Eric
    DePristo, Mark
    Getz, Gad
    BIOINFORMATICS, 2011, 27 (18) : 2601 - 2602
  • [2] Quantification of Cross-Contamination among Indexed Samples in Multiplexed Next-Generation Sequencing Data
    Spencer, D.
    Duncavage, E. J.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2013, 15 (06): : 932 - 932
  • [3] A comparison of next-generation sequencing analysis methods for cancer xenograft samples
    Dai, Wentao
    Liu, Jixiang
    Li, Quanxue
    Liu, Wei
    Li, Yi-Xue
    Li, Yuan-Yuan
    JOURNAL OF GENETICS AND GENOMICS, 2018, 45 (07) : 345 - 350
  • [4] A comparison of next-generation sequencing analysis methods for cancer xenograft samples
    Wentao Dai
    Jixiang Liu
    Quanxue Li
    Wei Liu
    Yi-Xue Li
    Yuan-Yuan Li
    JournalofGeneticsandGenomics, 2018, 45 (07) : 345 - 350
  • [5] An Effective Strategy to Eliminate Inherent Cross-Contamination in mtDNA Next-Generation Sequencing of Multiple Samples
    Yin, Chun
    Liu, Yang
    Guo, Xu
    Li, Deyang
    Fang, Wan
    Yang, Jin
    Zhou, Feng
    Niu, Wancheng
    Jia, Yongfeng
    Yang, Hushan
    Xing, Jinliang
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2019, 21 (04): : 593 - 601
  • [6] ART-DeCo: easy tool for detection and characterization of cross-contamination of DNA samples in diagnostic next-generation sequencing analysis
    Alice Fiévet
    Virginie Bernard
    Henrique Tenreiro
    Catherine Dehainault
    Elodie Girard
    Vivien Deshaies
    Philippe Hupe
    Olivier Delattre
    Marc-Henri Stern
    Dominique Stoppa-Lyonnet
    Lisa Golmard
    Claude Houdayer
    European Journal of Human Genetics, 2019, 27 : 792 - 800
  • [7] ART-DeCo: easy tool for detection and characterization of cross-contamination of DNA samples in diagnostic next-generation sequencing analysis
    Fievet, Alice
    Bernard, Virginie
    Tenreiro, Henrique
    Dehainault, Catherine
    Girard, Elodie
    Deshaies, Vivien
    Hupe, Philippe
    Delattre, Olivier
    Stern, Marc-Henri
    Stoppa-Lyonnet, Dominique
    Golmard, Lisa
    Houdayer, Claude
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 (05) : 792 - 800
  • [8] Monitoring Germline SNPs to Control for Sample Cross-Contamination in the Ion AmpliSeq Cancer Hotspot Panel Next-Generation Sequencing Assay
    Kenny, P. A.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2017, 19 (06): : 1000 - 1000
  • [9] Comparison of KRAS mutation analysis of colorectal cancer samples by standard testing and next-generation sequencing
    Kothari, Nishi
    Schell, Michael J.
    Teer, Jamie K.
    Yeatman, Timothy
    Shibata, David
    Kim, Richard
    JOURNAL OF CLINICAL PATHOLOGY, 2014, 67 (09) : 764 - 767
  • [10] Optimizing Workflows and Processing of Cytologic Samples for Comprehensive Analysis by Next-Generation Sequencing Memorial Sloan Kettering Cancer Center Experience
    Tian, Shaozhou Ken
    Killian, J. Keith
    Rekhtman, Natasha
    Benayed, Ryma
    Middha, Sumit
    Ladanyi, Marc
    Lin, Oscar
    Arcila, Maria E.
    ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2016, 140 (11) : 1200 - 1205