Polishing copy number variant calls on exome sequencing data via deep learning

被引:6
|
作者
Ozden, Furkan [1 ]
Alkan, Can [1 ]
Cicek, A. Ercument [1 ,2 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] Carnegie Mellon Univ, Computat Biol Dept, Pittsburgh, PA 15213 USA
关键词
WHOLE-GENOME;
D O I
10.1101/gr.274845.120
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accurate and efficient detection of copy number variants (CNVs) is of critical importance owing to their significant association with complex genetic diseases. Although algorithms that use whole-genome sequencing (WGS) data provide stable results with mostly valid statistical assumptions, copy number detection on whole-exome sequencing (WES) data shows comparatively lower accuracy. This is unfortunate as WES data are cost-efficient, compact, and relatively ubiquitous. The bottleneck is primarily due to the noncontiguous nature of the targeted capture: biases in targeted genomic hybridization, GC content, targeting probes, and sample batching during sequencing. Here, we present a novel deep learning model, DECoNT, which uses the matched WES and WGS data, and learns to correct the copy number variations reported by any off-the-shelf WES-based germline CNV caller. We train DECoNT on the 1000 Genomes Project data, and we show that we can efficiently triple the duplication call precision and double the deletion call precision of the state-of-the-art algorithms. We also show that our model consistently improves the performance independent of (1) sequencing technology, (2) exome capture kit, and (3) CNV caller. Using DECoNT as a universal exome CNV call polisher has the potential to improve the reliability of germline CNV detection on WES data sets.
引用
收藏
页码:1170 / 1182
页数:13
相关论文
共 50 条
  • [41] Development of a Convolutional Neural Network Algorithm for Detection of Copy Number Loss in Exome Sequencing Data
    Muthusamy, S.
    Voelkerding, K.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2019, 21 (06): : 1174 - 1175
  • [42] An Evaluation of Copy Number Variation Detection Tools from Whole-Exome Sequencing Data
    Tan, Renjie
    Wang, Yadong
    Kleinstein, Sarah E.
    Liu, Yongzhuang
    Zhu, Xiaolin
    Guo, Hongzhe
    Jiang, Qinghua
    Allen, Andrew S.
    Zhu, Mingfu
    HUMAN MUTATION, 2014, 35 (07) : 899 - 907
  • [43] Inferring copy number and genotype in tumour exome data
    Amarasinghe, Kaushalya C.
    Li, Jason
    Hunter, Sally M.
    Ryland, Georgina L.
    Cowin, Prue A.
    Campbell, Ian G.
    Halgamuge, Saman K.
    BMC GENOMICS, 2014, 15
  • [44] Inferring copy number and genotype in tumour exome data
    Kaushalya C Amarasinghe
    Jason Li
    Sally M Hunter
    Georgina L Ryland
    Prue A Cowin
    Ian G Campbell
    Saman K Halgamuge
    BMC Genomics, 15
  • [45] VEGAWES: variational segmentation on whole exome sequencing for copy number detection
    Anjum, Samreen
    Morganella, Sandro
    D'Angelo, Fulvio
    Iavarone, Antonio
    Ceccarelli, Michele
    BMC BIOINFORMATICS, 2015, 16
  • [46] Copy number estimation from whole-exome sequencing in tumors
    Anderson, Shawn
    Che, Zhiwei
    Keshavan, Raja
    O'Hara, Andrea
    Lin, Dong
    Wang, Yuzhuo
    Collins, Colin
    Shams, Soheil
    CANCER RESEARCH, 2018, 78 (13)
  • [47] ALLELE-SPECIFIC COPY NUMBER ESTIMATION BY WHOLE EXOME SEQUENCING
    Chen, Hao
    Jiang, Yuchao
    Maxwell, Kara N.
    Nathanson, Katherine L.
    Zhang, Nancy
    ANNALS OF APPLIED STATISTICS, 2017, 11 (02): : 1169 - 1192
  • [48] Sacral agenesis: a pilot whole exome sequencing and copy number study
    Porsch, Robert M.
    Merello, Elisa
    De Marco, Patrizia
    Cheng, Guo
    Rodriguez, Laura
    So, Manting
    Sham, Pak C.
    Tam, Paul K.
    Capra, Valeria
    Cherny, Stacey S.
    Garcia-Barcelo, Maria-Merce
    Campbell, Desmond D.
    BMC MEDICAL GENETICS, 2016, 17
  • [49] VEGAWES: variational segmentation on whole exome sequencing for copy number detection
    Samreen Anjum
    Sandro Morganella
    Fulvio D’Angelo
    Antonio Iavarone
    Michele Ceccarelli
    BMC Bioinformatics, 16
  • [50] An Efficient Noise Reduction Method for Copy Number Variations Detection from Whole Exome Sequencing Data
    Kong, Jinhwa
    Shin, Jaemoon
    Won, Jungim
    Yoon, Jeehee
    Lee, Unjoo
    2016 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2016), 2016, : 1027 - 1028