Evaluation of somatic copy number estimation tools for whole-exome sequencing data

被引:33
|
作者
Nam, Jae-Yong
Kim, Nayoung K. D. [1 ]
Kim, Sang Cheol [1 ]
Joung, Je-Gun [1 ]
Xi, Ruibin [3 ]
Lee, Semin [2 ]
Park, Peter J. [2 ]
Park, Woong-Yang [1 ,4 ]
机构
[1] Samsung Med Ctr, Samsung Genome Inst, Seoul 135710, South Korea
[2] Harvard Univ, Sch Med, Ctr Biomed Informat, Boston, MA 02115 USA
[3] Peking Univ, Ctr Stat Sci, Beijing, Peoples R China
[4] Sungkyunkwan Univ, Sch Med, Dept Mol Cell Biol, Seoul, South Korea
关键词
CNV prediction; somatic alterations; the cancer genome atlas; CNV algorithms; DISCOVERY; VARIANTS; CANCER; ALGORITHMS; LANDSCAPE; DELETIONS;
D O I
10.1093/bib/bbv055
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Whole-exome sequencing (WES) has become a standard method for detecting genetic variants in human diseases. Although the primary use of WES data has been the identification of single nucleotide variations and indels, these data also offer a possibility of detecting copy number variations (CNVs) at high resolution. However, WES data have uneven read coverage along the genome owing to the target capture step, and the development of a robust WES-based CNV tool is challenging. Here, we evaluate six WES somatic CNV detection tools: ADTEx, CONTRA, Control-FREEC, EXCAVATOR, ExomeCNV and Varscan2. Using WES data from 50 kidney chromophobe, 50 bladder urothelial carcinoma, and 50 stomach adenocarcinoma patients from The Cancer Genome Atlas, we compared the CNV calls from the six tools with a reference CNV set that was identified by both single nucleotide polymorphism array 6.0 and whole-genome sequencing data. We found that these algorithms gave highly variable results: visual inspection reveals significant differences between the WES-based segmentation profiles and the reference profile, as well as among the WES-based profiles. Using a 50% overlap criterion, 13-77% of WES CNV calls were covered by CNVs from the reference set, up to 21% of the copy gains were called as losses or vice versa, and dramatic differences in CNV sizes and CNV numbers were observed. Overall, ADTEx and EXCAVATOR had the best performance with relatively high precision and sensitivity. We suggest that the current algorithms for somatic CNV detection from WES data are limited in their performance and that more robust algorithms are needed.
引用
收藏
页码:185 / 192
页数:8
相关论文
共 50 条
  • [21] CEQer: A Graphical Tool for Copy Number and Allelic Imbalance Detection from Whole-Exome Sequencing Data
    Piazza, Rocco
    Magistroni, Vera
    Pirola, Alessandra
    Redaelli, Sara
    Spinelli, Roberta
    Redaelli, Serena
    Galbiati, Marta
    Valletta, Simona
    Giudici, Giovanni
    Cazzaniga, Giovanni
    Gambacorti-Passerini, Carlo
    PLOS ONE, 2013, 8 (10):
  • [22] DeAnnCNV: a tool for online detection and annotation of copy number variations from whole-exome sequencing data
    Zhang, Yuanwei
    Yu, Zhenhua
    Ban, Rongjun
    Zhang, Huan
    Iqbal, Furhan
    Zhao, Aiwu
    Li, Ao
    Shi, Qinghua
    NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) : W289 - W294
  • [23] Copy number alterations detected by whole-exome and whole-genome sequencing of esophageal adenocarcinoma
    Xiaoyu Wang
    Xiaohong Li
    Yichen Cheng
    Xin Sun
    Xibin Sun
    Steve Self
    Charles Kooperberg
    James Y. Dai
    Human Genomics, 9
  • [24] Copy number alterations detected by whole-exome and whole-genome sequencing of esophageal adenocarcinoma
    Wang, Xiaoyu
    Li, Xiaohong
    Cheng, Yichen
    Sun, Xin
    Sun, Xibin
    Self, Steve
    Kooperberg, Charles
    Dai, James Y.
    HUMAN GENOMICS, 2015, 9
  • [25] ReCapSeg: Validation of somatic copy number alterations for CLIA whole exome sequencing
    Lichtenstein, Lee
    Woolf, Betty
    MacBeth, Alyssa
    Birsoy, Ozge
    Lennon, Niall
    CANCER RESEARCH, 2016, 76
  • [26] Evaluation of Copy Number Variation (CNV) detection methods in whole exome sequencing data
    Zhang, Peng
    Ling, Hua
    Pugh, Elizabeth
    Hetrick, Kurt
    Witmer, Dane
    Sobreira, Nara
    Valle, David
    Doheny, Kimberly
    GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 597 - 597
  • [27] Whole-exome sequencing demonstrates recurrent somatic copy number alterations and sporadic mutations in specialized stromal tumors of the prostate
    Pan, Chin-Chen
    Tsuzuki, Toyonori
    Morii, Eiichi
    Fushimi, Hiroaki
    Chen, Paul Chih-Hsueh
    Epstein, Jonathan, I
    HUMAN PATHOLOGY, 2018, 76 : 9 - 16
  • [28] A Comparison of Tools for Copy-Number Variation Detection in Germline Whole Exome and Whole Genome Sequencing Data
    Gabrielaite, Migle
    Torp, Mathias Husted
    Rasmussen, Malthe Sebro
    Andreu-Sanchez, Sergio
    Vieira, Filipe Garrett
    Pedersen, Christina Bligaard
    Kinalis, Savvas
    Madsen, Majbritt Busk
    Kodama, Miyako
    Demircan, Guel Sude
    Simonyan, Arman
    Yde, Christina Westmose
    Olsen, Lars Ronn
    Marvig, Rasmus L.
    ostrup, Olga
    Rossing, Maria
    Nielsen, Finn Cilius
    Winther, Ole
    Bagger, Frederik Otzen
    CANCERS, 2021, 13 (24)
  • [29] ALLELE-SPECIFIC COPY NUMBER ESTIMATION BY WHOLE EXOME SEQUENCING
    Chen, Hao
    Jiang, Yuchao
    Maxwell, Kara N.
    Nathanson, Katherine L.
    Zhang, Nancy
    ANNALS OF APPLIED STATISTICS, 2017, 11 (02): : 1169 - 1192
  • [30] Estimation of Copy Number Alterations from Exome Sequencing Data
    Valdes-Mas, Rafael
    Bea, Silvia
    Puente, Diana A.
    Lopez-Otin, Carlos
    Puente, Xose S.
    PLOS ONE, 2012, 7 (12):