Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data

被引:262
|
作者
Birol, Inanc [1 ,2 ,3 ]
Raymond, Anthony [1 ]
Jackman, Shaun D. [1 ]
Pleasance, Stephen [1 ]
Coope, Robin [1 ]
Taylor, Greg A. [1 ]
Saint Yuen, Macaire Man [4 ]
Keeling, Christopher I. [4 ]
Brand, Dana [1 ]
Vandervalk, Benjamin P. [1 ]
Kirk, Heather [1 ]
Pandoh, Pawan [1 ]
Moore, Richard A. [1 ]
Zhao, Yongjun [1 ]
Mungall, Andrew J. [1 ]
Jaquish, Barry [5 ]
Yanchuk, Alvin [5 ]
Ritland, Carol [4 ,6 ]
Boyle, Brian [7 ]
Bousquet, Jean [7 ,8 ]
Ritland, Kermit [6 ]
MacKay, John [7 ,8 ]
Bohlmann, Joerg [4 ,6 ]
Jones, Steven J. M. [1 ,2 ,9 ]
机构
[1] British Columbia Canc Agcy, Genome Sci Ctr, Vancouver, BC V5Z 4S6, Canada
[2] Univ British Columbia, Dept Med Genet, Vancouver, BC V6H 3N1, Canada
[3] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[4] Univ British Columbia, Michael Smith Labs, Vancouver, BC V6T 1Z4, Canada
[5] British Columbia Minist Forests,Lands & Nat Resou, Victoria, BC V8W 9C2, Canada
[6] Univ British Columbia, Dept Forest Sci, Vancouver, BC V6T 1Z4, Canada
[7] Univ Laval, Inst Syst & Integrat Biol, Quebec City, PQ G1K 7P4, Canada
[8] Univ Laval, Dept Wood & Forest Sci, Quebec City, PQ G1V 0A6, Canada
[9] Simon Fraser Univ, Dept Mol Biol & Biochem, Burnaby, BC V5A 1S6, Canada
关键词
MOUNTAIN PINE-BEETLE; NOVO; IDENTIFICATION; SYNTHASE; GENES;
D O I
10.1093/bioinformatics/btt178
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though pushes the boundaries of the current technology. Here, we describe a whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software. We report a 20.8 giga base pairs draft genome in 4.9 million scaffolds, with a scaffold N50 of 20 356bp. We demonstrate how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity. We also note that scalable bioinformatics tools are instrumental in providing rapid draft assemblies.
引用
收藏
页码:1492 / 1497
页数:6
相关论文
共 50 条
  • [1] Human whole-genome shotgun sequencing
    Weber, JL
    Myers, EW
    [J]. GENOME RESEARCH, 1997, 7 (05) : 401 - 409
  • [2] A whole-genome shotgun approach to human reference genome sequencing
    Morishita, Shinichi
    [J]. NATURE REVIEWS GENETICS, 2024, 25 (04) : 236 - 236
  • [3] A whole-genome shotgun approach to human reference genome sequencing
    Shinichi Morishita
    [J]. Nature Reviews Genetics, 2024, 25 : 236 - 236
  • [4] Analysis of the breadwheat genome using whole-genome shotgun sequencing
    Brenchley, Rachel
    Spannagl, Manuel
    Pfeifer, Matthias
    Barker, Gary L. A.
    D'Amore, Rosalinda
    Allen, Alexandra M.
    McKenzie, Neil
    Kramer, Melissa
    Kerhornou, Arnaud
    Bolser, Dan
    Kay, Suzanne
    Waite, Darren
    Trick, Martin
    Bancroft, Ian
    Gu, Yong
    Huo, Naxin
    Luo, Ming-Cheng
    Sehgal, Sunish
    Gill, Bikram
    Kianian, Sharyar
    Anderson, Olin
    Kersey, Paul
    Dvorak, Jan
    McCombie, W. Richard
    Hall, Anthony
    Mayer, Klaus F. X.
    Edwards, Keith J.
    Bevan, Michael W.
    Hall, Neil
    [J]. NATURE, 2012, 491 (7426) : 705 - 710
  • [5] A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome
    Jarrod A Chapman
    Martin Mascher
    Aydın Buluç
    Kerrie Barry
    Evangelos Georganas
    Adam Session
    Veronika Strnadova
    Jerry Jenkins
    Sunish Sehgal
    Leonid Oliker
    Jeremy Schmutz
    Katherine A Yelick
    Uwe Scholz
    Robbie Waugh
    Jesse A Poland
    Gary J Muehlbauer
    Nils Stein
    Daniel S Rokhsar
    [J]. Genome Biology, 16
  • [6] A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome
    Chapman, Jarrod A.
    Mascher, Martin
    Buluc, Aydin
    Barry, Kerrie
    Georganas, Evangelos
    Session, Adam
    Strnadova, Veronika
    Jenkins, Jerry
    Sehgal, Sunish
    Oliker, Leonid
    Schmutz, Jeremy
    Yelick, Katherine A.
    Scholz, Uwe
    Waugh, Robbie
    Poland, Jesse A.
    Muehlbauer, Gary J.
    Stein, Nils
    Rokhsar, Daniel S.
    [J]. GENOME BIOLOGY, 2015, 16
  • [7] Analysis of the bread wheat genome using whole-genome shotgun sequencing
    Rachel Brenchley
    Manuel Spannagl
    Matthias Pfeifer
    Gary L. A. Barker
    Rosalinda D’Amore
    Alexandra M. Allen
    Neil McKenzie
    Melissa Kramer
    Arnaud Kerhornou
    Dan Bolser
    Suzanne Kay
    Darren Waite
    Martin Trick
    Ian Bancroft
    Yong Gu
    Naxin Huo
    Ming-Cheng Luo
    Sunish Sehgal
    Bikram Gill
    Sharyar Kianian
    Olin Anderson
    Paul Kersey
    Jan Dvorak
    W. Richard McCombie
    Anthony Hall
    Klaus F. X. Mayer
    Keith J. Edwards
    Michael W. Bevan
    Neil Hall
    [J]. Nature, 2012, 491 : 705 - 710
  • [8] Bioinformatics for whole-genome shotgun sequencing of microbial communities
    Chen, K
    Pachter, L
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (02) : 106 - 112
  • [9] Whole-genome shotgun sequencing of mitochondria from ancient hair shafts
    Gilbert, M. Thomas P.
    Tomsho, Lynn P.
    Rendulic, Snjezana
    Packard, Michael
    Drautz, Daniela I.
    Sher, Andrei
    Tikhonov, Alexei
    Dalen, Love
    Kuznetsova, Tatyana
    Kosintsev, Pavel
    Campos, Paula F.
    Higham, Thomas
    Collins, Matthew J.
    Wilson, Andrew S.
    Shidlovskiy, Fyodor
    Buigues, Bernard
    Ericson, Per G. P.
    Germonpre, Mietje
    Goetherstroem, Anders
    Iacumin, Paola
    Nikolaev, Vladimir
    Nowak-Kemp, Malgosia
    Willerslev, Eske
    Knight, James R.
    Irzyk, Gerard P.
    Perbost, Clotilde S.
    Fredrikson, Karin M.
    Harkins, Timothy T.
    Sheridan, Sharon
    Miller, Webb
    Schuster, Stephan C.
    [J]. SCIENCE, 2007, 317 (5846) : 1927 - 1930
  • [10] PennCNV in whole-genome sequencing data
    Leandro de Araújo Lima
    Kai Wang
    [J]. BMC Bioinformatics, 18