Statistical challenges associated with detecting copy number variations with next-generation sequencing

被引:161
|
作者
Teo, Shu Mei [1 ,2 ,3 ]
Pawitan, Yudi [3 ]
Ku, Chee Seng [3 ]
Chia, Kee Seng [1 ,2 ]
Salim, Agus [1 ]
机构
[1] Natl Univ Singapore, Saw Swee Hock Sch Publ Hlth, Singapore 117597, Singapore
[2] Natl Univ Singapore, NUS Grad Sch Integrat Sci & Engn, Singapore 117456, Singapore
[3] Karolinska Inst, Dept Med Epidemiol & Biostat, S-17177 Stockholm, Sweden
关键词
STRUCTURAL VARIATION; PAIRED-END; SHORT-READ; ALGORITHMS; VARIANTS; MODEL; TOOL;
D O I
10.1093/bioinformatics/bts535
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Analysing next-generation sequencing (NGS) data for copy number variations (CNVs) detection is a relatively new and challenging field, with no accepted standard protocols or quality control measures so far. There are by now several algorithms developed for each of the four broad methods for CNV detection using NGS, namely the depth of coverage (DOC), read-pair, split-read and assembly-based methods. However, because of the complexity of the genome and the short read lengths from NGS technology, there are still many challenges associated with the analysis of NGS data for CNVs, no matter which method or algorithm is used. Results: In this review, we describe and discuss areas of potential biases in CNV detection for each of the four methods. In particular, we focus on issues pertaining to (i) mappability, (ii) GC-content bias, (iii) quality control measures of reads and (iv) difficulty in identifying duplications. To gain insights to some of the issues discussed, we also download real data from the 1000 Genomes Project and analyse its DOC data. We show examples of how reads in repeated regions can affect CNV detection, demonstrate current GC-correction algorithms, investigate sensitivity of DOC algorithm before and after quality control of reads and discuss reasons for which duplications are harder to detect than deletions.
引用
收藏
页码:2711 / 2718
页数:8
相关论文
共 50 条
  • [21] Chromosomal copy number variations in products of conception from spontaneous abortion by next-generation sequencing technology
    Dai, Rulin
    Xi, Qi
    Wang, Ruixue
    Zhang, Hongguo
    Jiang, Yuting
    Li, Leilei
    Liu, Ruizhi
    MEDICINE, 2019, 98 (47)
  • [22] Detection of copy number variations with next generation sequencing: laboratory validation
    Klancar, G.
    Dragos, V. Setrajcic
    Novakovic, S.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 639 - 639
  • [23] A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data
    Liu, Guojun
    Zhang, Junying
    FRONTIERS IN GENETICS, 2021, 12
  • [24] Validation of copy number variation analysis for next-generation sequencing diagnostics
    Jamie M Ellingford
    Christopher Campbell
    Stephanie Barton
    Sanjeev Bhaskar
    Saurabh Gupta
    Rachel L Taylor
    Panagiotis I Sergouniotis
    Bradley Horn
    Janine A Lamb
    Michel Michaelides
    Andrew R Webster
    William G Newman
    Binay Panda
    Simon C Ramsden
    Graeme CM Black
    European Journal of Human Genetics, 2017, 25 : 719 - 724
  • [25] Validation of copy number variation analysis for next-generation sequencing diagnostics
    Ellingford, Jamie M.
    Campbell, Christopher
    Barton, Stephanie
    Bhaskar, Sanjeev
    Gupta, Saurabh
    Taylor, Rachel L.
    Sergouniotis, Panagiotis I.
    Horn, Bradley
    Lamb, Janine A.
    Michaelides, Michel
    Webster, Andrew R.
    Newman, William G.
    Panda, Binay
    Ramsden, Simon C.
    Black, Graeme C. M.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2017, 25 (06) : 719 - 724
  • [26] Next-generation sequencing identifies recurrent copy number variations in invasive breast carcinomas from Ghana
    Anwar, Talha
    Rufail, Miguel L.
    Djomehri, Sabra I.
    Gonzalez, Maria E.
    Lazo de la Vega, Lorena
    Tomlins, Scott A.
    Newman, Lisa A.
    Kleer, Celina G.
    MODERN PATHOLOGY, 2020, 33 (08) : 1537 - 1545
  • [27] Copy Number Variant Detection by Targeted Gene Next-Generation Sequencing
    Myers, C. E.
    Nguyen, H. L.
    Hauenstein, J.
    Saxe, D. F.
    Smith, G. H.
    Hill, C. E.
    Zhang, L.
    Deeb, K. K.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2019, 21 (06): : 1150 - 1151
  • [28] Next-Generation Sequencing Challenges
    Baker S.C.
    2017, Mary Ann Liebert Inc. (37): : 1and14 - 15
  • [29] A Rapid PCR-Free Next-Generation Sequencing Method for the Detection of Copy Number Variations in Prenatal Samples
    Zhou, Xiya
    Chen, Xiangbin
    Jiang, Yulin
    Qi, Qingwei
    Hao, Na
    Liu, Chengkun
    Xu, Mengnan
    Cram, David S.
    Liu, Juntao
    LIFE-BASEL, 2021, 11 (02): : 1 - 14
  • [30] Detection of copy number variations by pair analysis using next-generation sequencing data in inherited kidney diseases
    China Nagano
    Kandai Nozu
    Naoya Morisada
    Masahiko Yazawa
    Daisuke Ichikawa
    Keita Numasawa
    Hiroyo Kourakata
    Chieko Matsumura
    Satoshi Tazoe
    Ryojiro Tanaka
    Tomohiko Yamamura
    Shogo Minamikawa
    Tomoko Horinouchi
    Keita Nakanishi
    Junya Fujimura
    Nana Sakakibara
    Yoshimi Nozu
    Ming Juan Ye
    Hiroshi Kaito
    Kazumoto Iijima
    Clinical and Experimental Nephrology, 2018, 22 : 881 - 888