Polishing copy number variant calls on exome sequencing data via deep learning

被引:6
|
作者
Ozden, Furkan [1 ]
Alkan, Can [1 ]
Cicek, A. Ercument [1 ,2 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] Carnegie Mellon Univ, Computat Biol Dept, Pittsburgh, PA 15213 USA
关键词
WHOLE-GENOME;
D O I
10.1101/gr.274845.120
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accurate and efficient detection of copy number variants (CNVs) is of critical importance owing to their significant association with complex genetic diseases. Although algorithms that use whole-genome sequencing (WGS) data provide stable results with mostly valid statistical assumptions, copy number detection on whole-exome sequencing (WES) data shows comparatively lower accuracy. This is unfortunate as WES data are cost-efficient, compact, and relatively ubiquitous. The bottleneck is primarily due to the noncontiguous nature of the targeted capture: biases in targeted genomic hybridization, GC content, targeting probes, and sample batching during sequencing. Here, we present a novel deep learning model, DECoNT, which uses the matched WES and WGS data, and learns to correct the copy number variations reported by any off-the-shelf WES-based germline CNV caller. We train DECoNT on the 1000 Genomes Project data, and we show that we can efficiently triple the duplication call precision and double the deletion call precision of the state-of-the-art algorithms. We also show that our model consistently improves the performance independent of (1) sequencing technology, (2) exome capture kit, and (3) CNV caller. Using DECoNT as a universal exome CNV call polisher has the potential to improve the reliability of germline CNV detection on WES data sets.
引用
收藏
页码:1170 / 1182
页数:13
相关论文
共 50 条
  • [31] ERDS-pe: a paired hidden Markov model for copy number variant detection from whole-exome sequencing data
    Tan, Renjie
    Wang, Jixuan
    Wu, Xiaoliang
    Wan, Guoqiang
    Wang, Rongjie
    Ma, Rui
    Han, Zhijie
    Zhou, Wenyang
    Jin, Shuilin
    Jiang, Qinghua
    Wang, Yadong
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 141 - 144
  • [32] A machine-learning approach for accurate detection of copy number variants from exome sequencing
    Pounraja, Vijay Kumar
    Jayakar, Gopal
    Jensen, Matthew
    Kelkar, Neil
    Girirajan, Santhosh
    GENOME RESEARCH, 2019, 29 (07) : 1134 - 1143
  • [33] An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
    Zare, Fatima
    Dow, Michelle
    Monteleone, Nicholas
    Hosny, Abdelrahman
    Nabavi, Sheida
    BMC BIOINFORMATICS, 2017, 18
  • [34] COMBINATORIAL ANALYSIS OF EXOME SEQUENCING DATA AND COPY NUMBER VARIANTS IN CONGENITAL HEART DISEASE PATIENTS
    Fotiou, Elisavet
    Williams, Simon
    Keavney, Bernard
    HEART, 2017, 103 : A115 - A116
  • [35] Combinatorial approach to estimate copy number genotype using whole-exome sequencing data
    Hwang, Mi Yeong
    Moon, Sanghoon
    Heo, Lyong
    Kim, Young Jin
    Oh, Ji Hee
    Kim, Yeon-Jung
    Kim, Yun Kyoung
    Lee, Juyoung
    Han, Bok-Ghee
    Kim, Bong-Jo
    GENOMICS, 2015, 105 (03) : 145 - 149
  • [36] Copy number detection from exome sequencing data for patients with neurodevelopmental disorder: an effective approach
    D'haenens, Erika
    Delbaere, Sarah
    Rosseel, Toon
    De Bruyne, Marieke
    Syryn, Hannes
    Callewaert, Bert
    Menten, Bjorn
    Dheedene, Annelies
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 452 - 452
  • [37] An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
    Fatima Zare
    Michelle Dow
    Nicholas Monteleone
    Abdelrahman Hosny
    Sheida Nabavi
    BMC Bioinformatics, 18
  • [38] Exome CNV Overlapping (ECO): an Integrative Copy Number Variation Caller for Exome Sequencing
    Zhang, Peng
    Ling, Hua
    Pugh, Elizabeth
    Doheny, Kim
    GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 700 - 701
  • [39] Modeling exome sequencing data with generalized Gaussian distribution with application to copy number variation detection
    Duan, Junbo
    Wan, Mingxi
    Deng, Hong-Wen
    Wang, Yu-Ping
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [40] Evaluation of homologous recombination deficiency by copy number scar analysis with exome and target sequencing data
    Tatsuno, Kenji
    Tsutsumi, Shuichi
    Ueda, Hiroki
    Oda, Katsutoshi
    Aburatani, Hiroyuki
    CANCER SCIENCE, 2023, 114 : 1242 - 1242