Enhanced De Novo Assembly of High Throughput Pyrosequencing Data Using Whole Genome Mapping

被引:28
|
作者
Onmus-Leone, Fatma [1 ]
Hang, Jun [2 ]
Clifford, Robert J. [1 ]
Yang, Yu [2 ]
Riley, Matthew C. [1 ]
Kuschner, Robert A. [2 ]
Waterman, Paige E. [1 ]
Lesho, Emil P. [1 ]
机构
[1] Walter Reed Army Inst Res, Multidrug Resistant Organism Surveillance Network, Silver Spring, MD USA
[2] Walter Reed Army Inst Res, Viral Dis Branch, Silver Spring, MD USA
来源
PLOS ONE | 2013年 / 8卷 / 04期
关键词
PROVIDENCIA-STUARTII; CLINICAL ISOLATE; SEQUENCE; STRAIN; VALIDATION; BLA(NDM-1); DIVERSITY; OUTBREAK; ERA;
D O I
10.1371/journal.pone.0061762
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite major advances in next-generation sequencing, assembly of sequencing data, especially data from novel microorganisms or re-emerging pathogens, remains constrained by the lack of suitable reference sequences. De novo assembly is the best approach to achieve an accurate finished sequence, but multiple sequencing platforms or paired-end libraries are often required to achieve full genome coverage. In this study, we demonstrated a method to assemble complete bacterial genome sequences by integrating shotgun Roche 454 pyrosequencing with optical whole genome mapping (WGM). The whole genome restriction map (WGRM) was used as the reference to scaffold de novo assembled sequence contigs through a stepwise process. Large de novo contigs were placed in the correct order and orientation through alignment to the WGRM. De novo contigs that were not aligned to WGRM were merged into scaffolds using contig branching structure information. These extended scaffolds were then aligned to the WGRM to identify the overlaps to be eliminated and the gaps and mismatches to be resolved with unused contigs. The process was repeated until a sequence with full coverage and alignment with the whole genome map was achieved. Using this method we were able to achieved 100% WGRM coverage without a paired-end library. We assembled complete sequences for three distinct genetic components of a clinical isolate of Providencia stuartii: a bacterial chromosome, a novel bla(NDM-1) plasmid, and a novel bacteriophage, without separately purifying them to homogeneity.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] A pilot study for channel catfish whole genome sequencing and de novo assembly
    Yanliang Jiang
    Jianguo Lu
    Eric Peatman
    Huseyin Kucuktas
    Shikai Liu
    Shaolin Wang
    Fanyue Sun
    Zhanjiang Liu
    BMC Genomics, 12
  • [22] De novo assembly and annotation of the CHOZN® GS-/- genome supports high-throughput genome-scale screening
    Kretzmer, Corey
    Narasimhan, Rajagopalan Lakshmi
    Lal, Rahul Deva
    Balassi, Vincent
    Ravellette, James
    Manjunath, Ajaya Kumar Kotekar
    Koshy, Jesvin Joy
    Viano, Marta
    Torre, Serena
    Zanda, Valeria M.
    Kumravat, Mausam
    Saldanha, Keith Metelo Raul
    Chandranpillai, Harikrishnan
    Nihad, Ifra
    Zhong, Fei
    Sun, Yi
    Gustin, Jason
    Borgschulte, Trissa
    Liu, Jiajian
    Razafsky, David
    BIOTECHNOLOGY AND BIOENGINEERING, 2022, 119 (12) : 3632 - 3646
  • [23] High Level Design Approach to Accelerate De Novo Genome Assembly using FPGAs
    Varma, B. Sharat Chandra
    Paul, Kolin
    Balakrishnan, M.
    2014 17TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2014, : 66 - 73
  • [25] De Novo Assembly and Transcriptome Characterization of Canine Retina Using High-Throughput Sequencing
    Reddy, Bhaskar
    Patel, Amrutlal K.
    Singh, Krishna M.
    Patil, Deepak B.
    Parikh, Pinesh V.
    Kelawala, Divyesh N.
    Koringa, Prakash G.
    Bhatt, Vaibhav D.
    Rao, Mandava V.
    Joshi, Chaitanya G.
    GENETICS RESEARCH INTERNATIONAL, 2015, 2015
  • [26] Genome-scale de novo assembly using ALGA
    Swat, Sylwester
    Laskowski, Artur
    Badura, Jan
    Frohmberg, Wojciech
    Wojciechowski, Pawel
    Swiercz, Aleksandra
    Kasprzak, Marta
    Blazewicz, Jacek
    BIOINFORMATICS, 2021, 37 (12) : 1644 - 1651
  • [27] An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
    Jayakumar, Vasanthan
    Ishii, Hiromi
    Seki, Misato
    Kumita, Wakako
    Inoue, Takashi
    Hase, Sumitaka
    Sato, Kengo
    Okano, Hideyuki
    Sasaki, Erika
    Sakakibara, Yasubumi
    BMC GENOMICS, 2020, 21 (Suppl 3)
  • [28] An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
    Vasanthan Jayakumar
    Hiromi Ishii
    Misato Seki
    Wakako Kumita
    Takashi Inoue
    Sumitaka Hase
    Kengo Sato
    Hideyuki Okano
    Erika Sasaki
    Yasubumi Sakakibara
    BMC Genomics, 21
  • [29] De Novo Assembly of a Bell Pepper Endornavirus Genome Sequence Using RNA Sequencing Data
    Jo, Yeonhwa
    Choi, Hoseng
    Cho, Won Kyong
    GENOME ANNOUNCEMENTS, 2015, 3 (02)
  • [30] De novo whole-genome assembly of Chrysanthemum makinoi, a key wild chrysanthemum
    van Lieshout, Natascha
    van Kaauwen, Martijn
    Kodde, Linda
    Arens, Paul
    Smulders, Marinus J. M.
    Visser, Richard G. F.
    Finkers, Richard
    G3-GENES GENOMES GENETICS, 2021, 12 (01):