cnvCapSeq: detecting copy number variation in long-range targeted resequencing data

被引:11
|
作者
Bellos, Evangelos [1 ]
Kumar, Vikrant [2 ]
Lin, Clarabelle [2 ]
Maggi, Jordi [3 ]
Phua, Zai Yang [2 ]
Cheng, Ching-Yu [4 ,5 ]
Cheung, Chui Ming Gemmy [4 ,5 ]
Hibberd, Martin L. [2 ,6 ]
Wong, Tien Yin [4 ,5 ]
Coin, Lachlan J. M. [1 ,7 ]
Davila, Sonia [2 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Sch Publ Hlth, Dept Genom Common Dis, London W12 0NN, England
[2] Genome Inst Singapore, Singapore 138672, Singapore
[3] Univ Zurich, Inst Med Mol Genet, CH-8952 Schlieren, Switzerland
[4] Singapore Natl Eye Ctr, Singapore Eye Res Inst, Singapore 168751, Singapore
[5] Natl Univ Singapore, Dept Ophthalmol, Singapore 119228, Singapore
[6] Univ London London Sch Hyg & Trop Med, Fac Infect & Trop Dis, London WC1E 7HT, England
[7] Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, Australia
基金
澳大利亚研究理事会;
关键词
DNA-SEQUENCING DATA; DE-NOVO MUTATIONS; READ ALIGNMENT; EXOME; DISCOVERY; SUSCEPTIBILITY; ASSOCIATION; FRAMEWORK; VARIANTS; DELETION;
D O I
10.1093/nar/gku849
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spectrum of genetic variation, including copy number variants (CNVs). Various methods exist for detecting CNV in whole-genome and exome sequencing datasets. However, no algorithms have been specifically designed for contiguous target sequencing, despite its increasing importance in clinical and research applications. We have developed cnvCapSeq, a novel method for accurate and sensitive CNV discovery and genotyping in long-range targeted resequencing. cnvCapSeq was benchmarked using a simulated contiguous capture sequencing dataset comprising 21 genomic loci of various lengths. cnvCapSeq was shown to outperform the best existing exome CNV method by a wide margin both in terms of sensitivity (92.0 versus 48.3%) and specificity (99.8 versus 70.5%). We also applied cnvCapSeq to a real capture sequencing cohort comprising a contiguous 358 kb region that contains the Complement Factor H gene cluster. In this dataset, cnvCapSeq identified 41 samples with CNV, including two with duplications, with a genotyping accuracy of 99%, as ascertained by quantitative real-time PCR.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Analysis of multiple resistance genes of C. albicans and C. glabrata using targeted resequencing with multiplex long-range PCR
    Spettel, K.
    Reiser, M.
    Barousch, W.
    Nehr, M.
    Selitsch, B.
    Willinger, B.
    [J]. MYCOSES, 2018, 61 : 27 - 28
  • [42] ExomeHMM: A Hidden Markov Model for Detecting Copy Number Variation Using Whole-Exome Sequencing Data
    Guo, Cheng
    Yu, Zhenhua
    Wang, Minghui
    Li, Ao
    [J]. CURRENT BIOINFORMATICS, 2017, 12 (02) : 147 - 155
  • [43] Detecting long-range dependence with truncated ratios of periodogram ordinates
    Reschenhofer, Erhard
    Mangat, Manveer Kaur
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (15) : 3645 - 3660
  • [44] Detecting long-range correlations in time series of neuronal discharges
    Blesic, S
    Milosevic, S
    Stratimirovic, D
    Ljubisavljevic, M
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2003, 330 (3-4) : 391 - 399
  • [45] CGH for detecting copy number variation (vol 27, pg 34, 2007)
    Glaser, V
    [J]. GENETIC ENGINEERING & BIOTECHNOLOGY NEWS, 2007, 27 (09): : 68 - 68
  • [46] A Hidden Markov Model for Copy Number Variant prediction from whole genome resequencing data
    Yufeng Shen
    Yiwei Gu
    Itsik Pe’er
    [J]. BMC Bioinformatics, 12
  • [47] Detecting copy number variation in the human genome using comparative genomic hybridization
    Tchinda, Joelle
    Lee, Charles
    [J]. BIOTECHNIQUES, 2006, 41 (04) : 385 - +
  • [48] A Hidden Markov Model for Copy Number Variant prediction from whole genome resequencing data
    Shen, Yufeng
    Gu, Yiwei
    Pe'er, Itsik
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [49] US inflation dynamics on long-range data
    Plakandaras, Vasilios
    Gogas, Periklis
    Gupta, Rangan
    Papadimitriou, Theophilos
    [J]. APPLIED ECONOMICS, 2015, 47 (36) : 3874 - 3890
  • [50] Copy number variation analysis in 83 children with early-onset developmental and epileptic encephalopathy after targeted resequencing of a 109-epilepsy gene panel
    Hirabayashi, Kyoko
    Uehara, Daniela Tiaki
    Abe, Hidetoshi
    Ishii, Atsushi
    Moriyama, Keiji
    Hirose, Shinichi
    Inazawa, Johji
    [J]. JOURNAL OF HUMAN GENETICS, 2019, 64 (11) : 1097 - 1106