Evaluation of somatic copy number estimation tools for whole-exome sequencing data

被引:33
|
作者
Nam, Jae-Yong
Kim, Nayoung K. D. [1 ]
Kim, Sang Cheol [1 ]
Joung, Je-Gun [1 ]
Xi, Ruibin [3 ]
Lee, Semin [2 ]
Park, Peter J. [2 ]
Park, Woong-Yang [1 ,4 ]
机构
[1] Samsung Med Ctr, Samsung Genome Inst, Seoul 135710, South Korea
[2] Harvard Univ, Sch Med, Ctr Biomed Informat, Boston, MA 02115 USA
[3] Peking Univ, Ctr Stat Sci, Beijing, Peoples R China
[4] Sungkyunkwan Univ, Sch Med, Dept Mol Cell Biol, Seoul, South Korea
关键词
CNV prediction; somatic alterations; the cancer genome atlas; CNV algorithms; DISCOVERY; VARIANTS; CANCER; ALGORITHMS; LANDSCAPE; DELETIONS;
D O I
10.1093/bib/bbv055
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Whole-exome sequencing (WES) has become a standard method for detecting genetic variants in human diseases. Although the primary use of WES data has been the identification of single nucleotide variations and indels, these data also offer a possibility of detecting copy number variations (CNVs) at high resolution. However, WES data have uneven read coverage along the genome owing to the target capture step, and the development of a robust WES-based CNV tool is challenging. Here, we evaluate six WES somatic CNV detection tools: ADTEx, CONTRA, Control-FREEC, EXCAVATOR, ExomeCNV and Varscan2. Using WES data from 50 kidney chromophobe, 50 bladder urothelial carcinoma, and 50 stomach adenocarcinoma patients from The Cancer Genome Atlas, we compared the CNV calls from the six tools with a reference CNV set that was identified by both single nucleotide polymorphism array 6.0 and whole-genome sequencing data. We found that these algorithms gave highly variable results: visual inspection reveals significant differences between the WES-based segmentation profiles and the reference profile, as well as among the WES-based profiles. Using a 50% overlap criterion, 13-77% of WES CNV calls were covered by CNVs from the reference set, up to 21% of the copy gains were called as losses or vice versa, and dramatic differences in CNV sizes and CNV numbers were observed. Overall, ADTEx and EXCAVATOR had the best performance with relatively high precision and sensitivity. We suggest that the current algorithms for somatic CNV detection from WES data are limited in their performance and that more robust algorithms are needed.
引用
收藏
页码:185 / 192
页数:8
相关论文
共 50 条
  • [31] Detecting copy-number variations in whole-exome sequencing data using the eXome Hidden Markov Model: an ‘exome-first’ approach
    Satoko Miyatake
    Eriko Koshimizu
    Atsushi Fujita
    Ryoko Fukai
    Eri Imagawa
    Chihiro Ohba
    Ichiro Kuki
    Megumi Nukui
    Atsushi Araki
    Yoshio Makita
    Tsutomu Ogata
    Mitsuko Nakashima
    Yoshinori Tsurusaki
    Noriko Miyake
    Hirotomo Saitsu
    Naomichi Matsumoto
    Journal of Human Genetics, 2015, 60 : 175 - 182
  • [32] Detecting copy-number variations in whole-exome sequencing data using the eXome Hidden Markov Model: an 'exome-first' approach
    Miyatake, Satoko
    Koshimizu, Eriko
    Fujita, Atsushi
    Fukai, Ryoko
    Imagawa, Eri
    Ohba, Chihiro
    Kuki, Ichiro
    Nukui, Megumi
    Araki, Atsushi
    Makita, Yoshio
    Ogata, Tsutomu
    Nakashima, Mitsuko
    Tsurusaki, Yoshinori
    Miyake, Noriko
    Saitsu, Hirotomo
    Matsumoto, Naomichi
    JOURNAL OF HUMAN GENETICS, 2015, 60 (04) : 175 - 182
  • [33] Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2
    D'Aurizio, Romina
    Pippucci, Tommaso
    Tattini, Lorenzo
    Giusti, Betti
    Pellegrini, Marco
    Magi, Alberto
    NUCLEIC ACIDS RESEARCH, 2016, 44 (20)
  • [34] The combination of whole-exome sequencing and copy number variation sequencing enables the diagnosis of rare neurological disorders
    Jiao, Qingguo
    Sun, Haiming
    Zhang, Haoya
    Wang, Ran
    Li, Suting
    Sun, Dan
    Yang, Xiu-An
    Jin, Yan
    CLINICAL GENETICS, 2019, 96 (02) : 140 - 150
  • [35] Overcoming Challenges in Copy Number Estimation from Whole Exome Sequencing in Tumors
    Anderson, S.
    Che, Z.
    Keshavan, R.
    Venna, S.
    Collins, C.
    Shams, S.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2017, 19 (06): : 1035 - 1035
  • [36] Application of whole-exome sequencing for detecting copy number variants in CMT1A/HNPP
    Jo, H. -Y.
    Park, M. -H.
    Woo, H. -M.
    Han, M. H.
    Kim, B. -Y.
    Choi, B. -O.
    Chung, K. W.
    Koo, S. K.
    CLINICAL GENETICS, 2016, 90 (02) : 177 - 181
  • [37] Copy number variants in a large cohort analysed with whole-exome sequencing: lessons for genetic diagnosis
    Lopes, Fatima
    Lopes, Alexandra M.
    Silva, Paulo
    Sousa, Susana
    Morais, Sara
    Sa, Joana
    Brandao, Ana Filipa
    Lopes, Ana
    Bastos-Ferreira, Rita
    Freixo, Joao Parente
    Sequeiros, Jorge
    Oliveira, Jorge
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (SUPPL 1) : 470 - 470
  • [38] Discovery and Statistical Genotyping of Copy-Number Variation from Whole-Exome Sequencing Depth
    Fromer, Menachem
    Moran, Jennifer L.
    Chambert, Kimberly
    Banks, Eric
    Bergen, Sarah E.
    Ruderfer, Douglas M.
    Handsaker, Robert E.
    McCarroll, Steven A.
    O'Donovan, Michael C.
    Owen, Michael J.
    Kirov, George
    Sullivan, Patrick F.
    Hultman, Christina M.
    Sklar, Pamela
    Purcell, Shaun M.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 91 (04) : 597 - 607
  • [39] Adaptive Savitzky-Golay Filters for Analysis of Copy Number Variation Peaks from Whole-Exome Sequencing Data
    Ochieng, Peter Juma
    Maroti, Zoltan
    Dombi, Jozsef
    Kresz, Miklos
    Bekesi, Jozsef
    Kalmar, Tibor
    INFORMATION, 2023, 14 (02)
  • [40] Optimized pipeline of MuTect and GATK tools to improve the detection of somatic single nucleotide polymorphisms in whole-exome sequencing data
    Ítalo Faria do Valle
    Enrico Giampieri
    Giorgia Simonetti
    Antonella Padella
    Marco Manfrini
    Anna Ferrari
    Cristina Papayannidis
    Isabella Zironi
    Marianna Garonzi
    Simona Bernardi
    Massimo Delledonne
    Giovanni Martinelli
    Daniel Remondini
    Gastone Castellani
    BMC Bioinformatics, 17