Short read fragment assembly of bacterial genomes

被引:275
|
作者
Chaisson, Mark J. [2 ]
Pevzner, Pavel A. [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
关键词
D O I
10.1101/gr.7088808
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In the last year, high-throughput sequencing technologies have progressed from proof-of-concept to production quality. While these methods produce high-quality reads, they have yet to produce reads comparable in length to Sanger-based sequencing. Current fragment assembly algorithms have been implemented and optimized for mate-paired Sanger-based reads, and thus do not perform well on short reads produced by short read technologies. We present a new Eulerian assembler that generates nearly optimal short read assemblies of bacterial genomes and describe an approach to assemble reads in the case of the popular hybrid protocol when short and long Sanger-based reads are combined.
引用
收藏
页码:324 / 330
页数:7
相关论文
共 50 条
  • [1] A fast hybrid short read fragment assembly algorithm
    Schmidt, Bertil
    Sinha, Ranjan
    Beresford-Smith, Bryan
    Puglisi, Simon J.
    BIOINFORMATICS, 2009, 25 (17) : 2279 - 2280
  • [2] Efficient de novo assembly of single-cell bacterial genomes from short-read data sets
    Hamidreza Chitsaz
    Joyclyn L Yee-Greenbaum
    Glenn Tesler
    Mary-Jane Lombardo
    Christopher L Dupont
    Jonathan H Badger
    Mark Novotny
    Douglas B Rusch
    Louise J Fraser
    Niall A Gormley
    Ole Schulz-Trieglaff
    Geoffrey P Smith
    Dirk J Evers
    Pavel A Pevzner
    Roger S Lasken
    Nature Biotechnology, 2011, 29 : 915 - 921
  • [3] Efficient de novo assembly of single-cell bacterial genomes from short-read data sets
    Chitsaz, Hamidreza
    Yee-Greenbaum, Joyclyn L.
    Tesler, Glenn
    Lombardo, Mary-Jane
    Dupont, Christopher L.
    Badger, Jonathan H.
    Novotny, Mark
    Rusch, Douglas B.
    Fraser, Louise J.
    Gormley, Niall A.
    Schulz-Trieglaff, Ole
    Smith, Geoffrey P.
    Evers, Dirk J.
    Pevzner, Pavel A.
    Lasken, Roger S.
    NATURE BIOTECHNOLOGY, 2011, 29 (10) : 915 - U214
  • [4] Parallelized short read assembly of large genomes using de Bruijn graphs
    Yongchao Liu
    Bertil Schmidt
    Douglas L Maskell
    BMC Bioinformatics, 12
  • [5] Parallelized short read assembly of large genomes using de Bruijn graphs
    Liu, Yongchao
    Schmidt, Bertil
    Maskell, Douglas L.
    BMC BIOINFORMATICS, 2011, 12
  • [6] De novo assembly of human genomes with massively parallel short read sequencing
    Li, Ruiqiang
    Zhu, Hongmei
    Ruan, Jue
    Qian, Wubin
    Fang, Xiaodong
    Shi, Zhongbin
    Li, Yingrui
    Li, Shengting
    Shan, Gao
    Kristiansen, Karsten
    Li, Songgang
    Yang, Huanming
    Wang, Jian
    Wang, Jun
    GENOME RESEARCH, 2010, 20 (02) : 265 - 272
  • [7] Effective Identification of Bacterial Genomes From Short and Long Read Sequencing Data
    Liu, Jian
    Sun, Jialiang
    Liu, Yongzhuang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (05) : 2806 - 2816
  • [8] Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
    De Maio, Nicola
    Shaw, Liam P.
    Hubbard, Alasdair
    George, Sophie
    Sanderson, Nicholas D.
    Swann, Jeremy
    Wick, Ryan
    AbuOun, Manal
    Stubberfield, Emma
    Hoosdally, Sarah J.
    Crook, Derrick W.
    Peto, Timothy E. A.
    Sheppard, Anna E.
    Bailey, Mark J.
    Read, Daniel S.
    Anjum, Muna F.
    Walker, A. Sarah
    Stoesser, Nicole
    Brett, H.
    Bowes, M.
    Chau, K.
    Duggett, N.
    Gilson, D.
    Gweon, H. S.
    Floosdally, S.
    Kavanaugh, J.
    Jones, H.
    Sebra, R.
    Smith, R.
    Swann, J.
    Woodford, N.
    MICROBIAL GENOMICS, 2019, 5 (09):
  • [9] Short read alignment with populations of genomes
    Huang, Lin
    Popic, Victoria
    Batzoglou, Serafim
    BIOINFORMATICS, 2013, 29 (13) : 361 - 370
  • [10] The challenge of detecting indels in bacterial genomes from short-read sequencing data
    Steglich, Matthias
    Nuebel, Ulrich
    JOURNAL OF BIOTECHNOLOGY, 2017, 250 : 11 - 15