Evaluation of somatic copy number estimation tools for whole-exome sequencing data

被引:33
|
作者
Nam, Jae-Yong
Kim, Nayoung K. D. [1 ]
Kim, Sang Cheol [1 ]
Joung, Je-Gun [1 ]
Xi, Ruibin [3 ]
Lee, Semin [2 ]
Park, Peter J. [2 ]
Park, Woong-Yang [1 ,4 ]
机构
[1] Samsung Med Ctr, Samsung Genome Inst, Seoul 135710, South Korea
[2] Harvard Univ, Sch Med, Ctr Biomed Informat, Boston, MA 02115 USA
[3] Peking Univ, Ctr Stat Sci, Beijing, Peoples R China
[4] Sungkyunkwan Univ, Sch Med, Dept Mol Cell Biol, Seoul, South Korea
关键词
CNV prediction; somatic alterations; the cancer genome atlas; CNV algorithms; DISCOVERY; VARIANTS; CANCER; ALGORITHMS; LANDSCAPE; DELETIONS;
D O I
10.1093/bib/bbv055
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Whole-exome sequencing (WES) has become a standard method for detecting genetic variants in human diseases. Although the primary use of WES data has been the identification of single nucleotide variations and indels, these data also offer a possibility of detecting copy number variations (CNVs) at high resolution. However, WES data have uneven read coverage along the genome owing to the target capture step, and the development of a robust WES-based CNV tool is challenging. Here, we evaluate six WES somatic CNV detection tools: ADTEx, CONTRA, Control-FREEC, EXCAVATOR, ExomeCNV and Varscan2. Using WES data from 50 kidney chromophobe, 50 bladder urothelial carcinoma, and 50 stomach adenocarcinoma patients from The Cancer Genome Atlas, we compared the CNV calls from the six tools with a reference CNV set that was identified by both single nucleotide polymorphism array 6.0 and whole-genome sequencing data. We found that these algorithms gave highly variable results: visual inspection reveals significant differences between the WES-based segmentation profiles and the reference profile, as well as among the WES-based profiles. Using a 50% overlap criterion, 13-77% of WES CNV calls were covered by CNVs from the reference set, up to 21% of the copy gains were called as losses or vice versa, and dramatic differences in CNV sizes and CNV numbers were observed. Overall, ADTEx and EXCAVATOR had the best performance with relatively high precision and sensitivity. We suggest that the current algorithms for somatic CNV detection from WES data are limited in their performance and that more robust algorithms are needed.
引用
收藏
页码:185 / 192
页数:8
相关论文
共 50 条
  • [41] Experimental Study of Somatic Variants of Osteosarcoma by Whole-Exome Sequencing
    Hou, Jingyu
    Liu, Guoqing
    Zhang, Peng
    Wang, Bangmin
    Yan, Qiang
    Wu, Pin
    Wang, Chuchu
    Yao, Weitao
    MEDICAL SCIENCE MONITOR, 2020, 26
  • [42] Allele-specific copy-number discovery from whole-genome and whole-exome sequencing
    Wang, WeiBo
    Wang, Wei
    Sun, Wei
    Crowley, James J.
    Szatkiewicz, Jin P.
    NUCLEIC ACIDS RESEARCH, 2015, 43 (14)
  • [43] Optimized pipeline of MuTect and GATK tools to improve the detection of somatic single nucleotide polymorphisms in whole-exome sequencing data
    do Valle, Italo Faria
    Giampieri, Enrico
    Simonetti, Giorgia
    Padella, Antonella
    Manfrini, Marco
    Ferrari, Anna
    Papayannidis, Cristina
    Zironi, Isabella
    Garonzi, Marianna
    Bernardi, Simona
    Delledonne, Massimo
    Martinelli, Giovanni
    Remondini, Daniel
    Castellani, Gastone
    BMC BIOINFORMATICS, 2016, 17
  • [44] Somatic copy number alterations by whole-exome sequencing implicates YWHAZ and PTK2 in castration-resistant prostate cancer
    Menon, Roopika
    Deng, Mario
    Rueenauver, Kerstin
    Queisser, Angela
    Pfeifer, Martin
    Offermann, Anne
    Boehm, Diana
    Vogel, Wenzel
    Scheble, Veit
    Fend, Falko
    Kristiansen, Glen
    Wernert, Nicolas
    Oberbeckmann, Nicole
    Biskup, Saskia
    Rubin, Mark A.
    Shaikhibrahim, Zaki
    Perner, Sven
    JOURNAL OF PATHOLOGY, 2013, 231 (04): : 505 - 516
  • [45] ECOLE: Learning to call copy number variants on whole exome sequencing data
    Mandiracioglu, Berk
    Ozden, Furkan
    Kaynar, Gun
    Yilmaz, Mehmet Alper
    Alkan, Can
    Cicek, A. Ercument
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [46] ECOLE: Learning to call copy number variants on whole exome sequencing data
    Berk Mandiracioglu
    Furkan Ozden
    Gun Kaynar
    Mehmet Alper Yilmaz
    Can Alkan
    A. Ercument Cicek
    Nature Communications, 15
  • [47] Exome sequencing and whole genome sequencing for the detection of copy number variation
    Hehir-Kwa, Jayne Y.
    Pfundt, Rolph
    Veltman, Joris A.
    EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2015, 15 (08) : 1023 - 1032
  • [48] Evaluation of three read-depth based CNV detection tools using whole-exome sequencing data
    Yao, Ruen
    Zhang, Cheng
    Yu, Tingting
    Li, Niu
    Hu, Xuyun
    Wang, Xiumin
    Wang, Jian
    Shen, Yiping
    MOLECULAR CYTOGENETICS, 2017, 10
  • [49] Identification of Copy Number Variants Through Whole-Exome Sequencing in Autosomal Recessive Nonsyndromic Hearing Loss
    Bademci, Guney
    Diaz-Horta, Oscar
    Guo, Shengru
    Duman, Duygu
    Van Booven, Derek
    Foster, Joseph, II
    Cengiz, Filiz Basak
    Blanton, Susan
    Tekin, Mustafa
    GENETIC TESTING AND MOLECULAR BIOMARKERS, 2014, 18 (09) : 658 - 661
  • [50] ERDS-pe: a paired hidden Markov model for copy number variant detection from whole-exome sequencing data
    Tan, Renjie
    Wang, Jixuan
    Wu, Xiaoliang
    Wan, Guoqiang
    Wang, Rongjie
    Ma, Rui
    Han, Zhijie
    Zhou, Wenyang
    Jin, Shuilin
    Jiang, Qinghua
    Wang, Yadong
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 141 - 144