Analysis of High-Throughput Sequencing and Annotation Strategies for Phage Genomes

被引:63
|
作者
Henn, Matthew R. [1 ]
Sullivan, Matthew B. [2 ]
Stange-Thomann, Nicole [1 ]
Osburne, Marcia S. [2 ]
Berlin, Aaron M. [1 ]
Kelly, Libusha [2 ]
Yandava, Chandri [1 ]
Kodira, Chinnappa [1 ]
Zeng, Qiandong [1 ]
Weiand, Michael [1 ]
Sparrow, Todd [1 ]
Saif, Sakina [1 ]
Giannoukos, Georgia [1 ]
Young, Sarah K. [1 ]
Nusbaum, Chad [1 ]
Birren, Bruce W. [1 ]
Chisholm, Sallie W. [2 ]
机构
[1] MIT & Harvard, Broad Inst, Cambridge, MA 02139 USA
[2] MIT, Dept Civil & Environm Engn, Cambridge, MA 02139 USA
来源
PLOS ONE | 2010年 / 5卷 / 02期
关键词
MARINE SYNECHOCOCCUS STRAINS; PHOTOSYNTHESIS GENES; MICROBIAL GENOMES; PROTEIN FAMILIES; RNA GENES; VIRUSES; PROCHLOROCOCCUS; IDENTIFICATION; BACTERIOPHAGE; DATABASE;
D O I
10.1371/journal.pone.0009083
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Bacterial viruses (phages) play a critical role in shaping microbial populations as they influence both host mortality and horizontal gene transfer. As such, they have a significant impact on local and global ecosystem function and human health. Despite their importance, little is known about the genomic diversity harbored in phages, as methods to capture complete phage genomes have been hampered by the lack of knowledge about the target genomes, and difficulties in generating sufficient quantities of genomic DNA for sequencing. Of the approximately 550 phage genomes currently available in the public domain, fewer than 5% are marine phage. Methodology/Principal Findings: To advance the study of phage biology through comparative genomic approaches we used marine cyanophage as a model system. We compared DNA preparation methodologies (DNA extraction directly from either phage lysates or CsCl purified phage particles), and sequencing strategies that utilize either Sanger sequencing of a linker amplification shotgun library (LASL) or of a whole genome shotgun library (WGSL), or 454 pyrosequencing methods. We demonstrate that genomic DNA sample preparation directly from a phage lysate, combined with 454 pyrosequencing, is best suited for phage genome sequencing at scale, as this method is capable of capturing complete continuous genomes with high accuracy. In addition, we describe an automated annotation informatics pipeline that delivers high-quality annotation and yields few false positives and negatives in ORF calling. Conclusions/Significance: These DNA preparation, sequencing and annotation strategies enable a high-throughput approach to the burgeoning field of phage genomics.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] High-throughput bisulfite sequencing in mammalian genomes
    Smith, Zachary D.
    Gu, Hongcang
    Bock, Christoph
    Gnirke, Andreas
    Meissner, Alexander
    [J]. METHODS, 2009, 48 (03) : 226 - 232
  • [2] Standards for Sequencing Viral Genomes in the Era of High-Throughput Sequencing
    Ladner, Jason T.
    Beitzel, Brett
    Chain, Patrick S. G.
    Davenport, Matthew G.
    Donaldson, Eric F.
    Frieman, Matthew
    Kugelman, Jeffrey R.
    Kuhn, Jens H.
    O'Rear, Jules
    Sabeti, Pardis C.
    Wentworth, David E.
    Wiley, Michael R.
    Yu, Guo-Yun
    Sozhamannan, Shanmuga
    Bradburne, Christopher
    Palacios, Gustavo
    [J]. MBIO, 2014, 5 (03): : 1 - 5
  • [3] High-throughput robotic system for sequencing of microbial genomes
    Hilbert, H
    Schäfer, A
    Collasius, M
    Düsterhöft, A
    [J]. ELECTROPHORESIS, 1998, 19 (04) : 500 - 503
  • [4] Next-Generation High-Throughput Functional Annotation of Microbial Genomes
    Baric, Ralph S.
    Crosson, Sean
    Damania, Blossom
    Miller, Samuel I.
    Rubin, Eric J.
    [J]. MBIO, 2016, 7 (05):
  • [5] Standard finishing categories for high-throughput sequencing of viral genomes
    Ladner, J. T.
    Kuhn, J. H.
    Palacios, G.
    [J]. REVUE SCIENTIFIQUE ET TECHNIQUE-OFFICE INTERNATIONAL DES EPIZOOTIES, 2016, 35 (01): : 43 - 52
  • [6] Assessing Illumina technology for the high-throughput sequencing of bacteriophage genomes
    Rihtman, Branko
    Meaden, Sean
    Clokie, Martha R. J.
    Koskella, Britt
    Millard, Andrew D.
    [J]. PEERJ, 2016, 4
  • [7] High-throughput sequencing data and the impact of plant gene annotation quality
    Vaattovaara, Aleksia
    Leppala, Johanna
    Salojarvi, Jarkko
    Wrzaczek, Michael
    [J]. JOURNAL OF EXPERIMENTAL BOTANY, 2019, 70 (04) : 1069 - 1076
  • [8] CoronaHiT: high-throughput sequencing of SARS-CoV-2 genomes
    Baker, Dave J.
    Aydin, Alp
    Le-Viet, Thanh
    Kay, Gemma L.
    Rudder, Steven
    Martins, Leonardo de Oliveira
    Tedim, Ana P.
    Kolyva, Anastasia
    Diaz, Maria
    Alikhan, Nabil-Fareed
    Meadows, Lizzie
    Bell, Andrew
    Gutierrez, Ana Victoria
    Trotter, Alexander J.
    Thomson, Nicholas M.
    Gilroy, Rachel
    Griffith, Luke
    Adriaenssens, Evelien M.
    Stanley, Rachael
    Charles, Ian G.
    Elumogo, Ngozi
    Wain, John
    Prakash, Reenesh
    Meader, Emma
    Mather, Alison E.
    Webber, Mark A.
    Dervisevic, Samir
    Page, Andrew J.
    O'Grady, Justin
    [J]. GENOME MEDICINE, 2021, 13 (01)
  • [9] High-throughput sequencing of complete human mtDNA genomes from the Philippines
    Gunnarsdottir, Ellen D.
    Li, Mingkun
    Bauchet, Marc
    Finstermeier, Knut
    Stoneking, Mark
    [J]. GENOME RESEARCH, 2011, 21 (01) : 1 - 11
  • [10] CoronaHiT: high-throughput sequencing of SARS-CoV-2 genomes
    Dave J. Baker
    Alp Aydin
    Thanh Le-Viet
    Gemma L. Kay
    Steven Rudder
    Leonardo de Oliveira Martins
    Ana P. Tedim
    Anastasia Kolyva
    Maria Diaz
    Nabil-Fareed Alikhan
    Lizzie Meadows
    Andrew Bell
    Ana Victoria Gutierrez
    Alexander J. Trotter
    Nicholas M. Thomson
    Rachel Gilroy
    Luke Griffith
    Evelien M. Adriaenssens
    Rachael Stanley
    Ian G. Charles
    Ngozi Elumogo
    John Wain
    Reenesh Prakash
    Emma Meader
    Alison E. Mather
    Mark A. Webber
    Samir Dervisevic
    Andrew J. Page
    Justin O’Grady
    [J]. Genome Medicine, 13