Aspects of coverage in medical DNA sequencing

被引:22
|
作者
Wendl, Michael C. [1 ]
Wilson, Richard K.
机构
[1] Washington Univ, Genome Sequencing Ctr, St Louis, MO 63108 USA
[2] Washington Univ, Dept Genet, St Louis, MO 63108 USA
关键词
D O I
10.1186/1471-2105-9-239
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: DNA sequencing is now emerging as an important component in biomedical studies of diseases like cancer. Short-read, highly parallel sequencing instruments are expected to be used heavily for such projects, but many design specifications have yet to be conclusively established. Perhaps the most fundamental of these is the redundancy required to detect sequence variations, which bears directly upon genomic coverage and the consequent resolving power for discerning somatic mutations. Results: We address the medical sequencing coverage problem via an extension of the standard mathematical theory of haploid coverage. The expected diploid multi-fold coverage, as well as its generalization for aneuploidy are derived and these expressions can be readily evaluated for any project. The resulting theory is used as a scaling law to calibrate performance to that of standard BAC sequencing at 8 x to 10 x redundancy, i.e. for expected coverages that exceed 99% of the unique sequence. A differential strategy is formalized for tumor/normal studies wherein tumor samples are sequenced more deeply than normal ones. In particular, both tumor alleles should be detected at least twice, while both normal alleles are detected at least once. Our theory predicts these requirements can be met for tumor and normal redundancies of approximately 26 x and 21 x, respectively. We explain why these values do not differ by a factor of 2, as might intuitively be expected. Future technology developments should prompt even deeper sequencing of tumors, but the 21 x value for normal samples is essentially a constant. Conclusion: Given the assumptions of standard coverage theory, our model gives pragmatic estimates for required redundancy. The differential strategy should be an efficient means of identifying potential somatic mutations for further study.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Cover Your Bases: How to Minimize the Sequencing Coverage in DNA Storage Systems
    Bar-Lev, Daniella
    Sabary, Omer
    Gabrys, Ryan
    Yaakobi, Eitan
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2025, 71 (01) : 192 - 218
  • [22] Investigating and Correcting Plasma DNA Sequencing Coverage Bias to Enhance Aneuploidy Discovery
    Chandrananda, Dineika
    Thorne, Natalie P.
    Ganesamoorthy, Devika
    Bruno, Damien L.
    Benjamini, Yuval
    Speed, Terence P.
    Slater, Howard R.
    Bahlo, Melanie
    PLOS ONE, 2014, 9 (01):
  • [23] Kun Zhou, 33 Using fast DNA sequencing for medical tests
    Fairley, Peter
    TECHNOLOGY REVIEW, 2011, 114 (05) : 46 - 46
  • [24] Yemi Adesokan, 34 Using fast DNA sequencing for medical tests
    Singer, Emily
    TECHNOLOGY REVIEW, 2011, 114 (05) : 46 - 46
  • [25] Lightweight Pattern Matching Method for DNA Sequencing in Internet of Medical Things
    Rexie, J. A. M.
    Raimond, Kumudha
    Murugaaboopathy, Mythily
    Brindha, D.
    Mulugeta, Henock
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [26] Low coverage sequencing for repetitive DNA analysis in Passiflora edulis Sims: citogenomic characterization of transposable elements and satellite DNA
    Cayres Pamponet, Vanessa Carvalho
    Souza, Margarete Magalhaes
    Silva, Goncalo Santos
    Micheli, Fabienne
    Ferreira de Melo, Clausio Antonio
    de Oliveira, Sarah Gomes
    Costa, Eduardo Almeida
    Correa, Ronan Xavier
    BMC GENOMICS, 2019, 20 (1)
  • [27] Low coverage sequencing for repetitive DNA analysis in Passiflora edulis Sims: citogenomic characterization of transposable elements and satellite DNA
    Vanessa Carvalho Cayres Pamponét
    Margarete Magalhães Souza
    Gonçalo Santos Silva
    Fabienne Micheli
    Cláusio Antônio Ferreira de Melo
    Sarah Gomes de Oliveira
    Eduardo Almeida Costa
    Ronan Xavier Corrêa
    BMC Genomics, 20
  • [28] Theoretical aspects of the adsorption of normal and modified base pairs of DNA on graphene models toward DNA sequencing
    Radhika, R.
    Shankar, R.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024, 42 (23): : 13059 - 13073
  • [29] High coverage sequencing of DNA from microorganisms living in an oil reservoir 2.5 kilometres subsurface
    Kotlar, Hans K.
    Lewin, Anna
    Johansen, Jostein
    Throne-Holst, Mimmi
    Haverkamp, Thomas
    Markussen, Sidsel
    Winnberg, Asgeir
    Ringrose, Philip
    Aakvik, Trine
    Ryeng, Einar
    Jakobsen, Kjetill
    Drablos, Finn
    Valla, Svein
    ENVIRONMENTAL MICROBIOLOGY REPORTS, 2011, 3 (06): : 674 - 681
  • [30] Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants
    Wilkinson, Mike J.
    Szabo, Claudia
    Ford, Caroline S.
    Yarom, Yuval
    Croxford, Adam E.
    Camp, Amanda
    Gooding, Paul
    SCIENTIFIC REPORTS, 2017, 7