Improving mammalian genome scaffolding using large insert mate-pair next-generation sequencing

被引:27
|
作者
van Heesch, Sebastiaan [1 ,2 ]
Kloosterman, Wigard P. [3 ]
Lansu, Nico [1 ,2 ]
Ruzius, Frans-Paul [1 ,2 ]
Levandowsky, Elizabeth [4 ]
Lee, Clarence C. [4 ]
Zhou, Shiguo [5 ]
Goldstein, Steve [5 ]
Schwartz, David C. [5 ]
Harkins, Timothy T. [4 ]
Guryev, Victor [1 ,2 ,6 ]
Cuppen, Edwin [1 ,2 ,3 ]
机构
[1] Hubrecht Inst KNAW, NL-3584 CT Utrecht, Netherlands
[2] Univ Med Ctr Utrecht, NL-3584 CT Utrecht, Netherlands
[3] UMC Utrecht, Dept Med Genet, NL-3584 GG Utrecht, Netherlands
[4] Life Technol Inc, Adv Applicat Grp, Cummings Ctr 500, Beverly, MA 01915 USA
[5] Univ Wisconsin, Dept Chem, UW Biotechnol Ctr, Lab Mol & Computat Genom,Lab Genet, Madison, WI 53706 USA
[6] Univ Groningen, Univ Med Ctr Groningen, European Res Inst Biol Ageing, Lab Genome Struct & Ageing, NL-9713 AV Groningen, Netherlands
来源
BMC GENOMICS | 2013年 / 14卷
基金
美国国家卫生研究院;
关键词
Genome structure; Genome scaffolding; Mate-pair next-generation sequencing; Contig assembly; Rat genome; STRUCTURAL VARIATION; GEL-ELECTROPHORESIS; CANCER GENOMES; CHROMOTHRIPSIS; DNA; REARRANGEMENTS; ASSEMBLIES; RESOLUTION; EVOLUTION; PATTERNS;
D O I
10.1186/1471-2164-14-257
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Paired-tag sequencing approaches are commonly used for the analysis of genome structure. However, mammalian genomes have a complex organization with a variety of repetitive elements that complicate comprehensive genome-wide analyses. Results: Here, we systematically assessed the utility of paired-end and mate-pair (MP) next-generation sequencing libraries with insert sizes ranging from 170 bp to 25 kb, for genome coverage and for improving scaffolding of a mammalian genome (Rattus norvegicus). Despite a lower library complexity, large insert MP libraries (20 or 25 kb) provided very high physical genome coverage and were found to efficiently span repeat elements in the genome. Medium-sized (5, 8 or 15 kb) MP libraries were much more efficient for genome structure analysis than the more commonly used shorter insert paired-end and 3 kb MP libraries. Furthermore, the combination of medium-and large insert libraries resulted in a 3-fold increase in N50 in scaffolding processes. Finally, we show that our data can be used to evaluate and improve contig order and orientation in the current rat reference genome assembly. Conclusions: We conclude that applying combinations of mate-pair libraries with insert sizes that match the distributions of repetitive elements improves contig scaffolding and can contribute to the finishing of draft genomes.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Sequence capture and next-generation sequencing of ultraconserved elements in a large-genome salamander
    Newman, Catherine E.
    Austin, Christopher C.
    [J]. MOLECULAR ECOLOGY, 2016, 25 (24) : 6162 - 6174
  • [32] Trends in Next-Generation Sequencing and a New Era for Whole Genome Sequencing
    Park, Sang Tae
    Kim, Jayoung
    [J]. INTERNATIONAL NEUROUROLOGY JOURNAL, 2016, 20 : 76 - 83
  • [33] Comparison of Constitutional and Replication Stress-Induced Genome Structural Variation by SNP Array and Mate-Pair Sequencing
    Arlt, Martin F.
    Ozdemir, Alev Cagla
    Birkeland, Shanda R.
    Lyons, Robert H., Jr.
    Glover, Thomas W.
    Wilson, Thomas E.
    [J]. GENETICS, 2011, 187 (03) : 675 - 683
  • [34] Genome-wide Association Study Using Next-generation Sequencing in Spinach
    Shi, Ainong
    Qin, Jun
    Mou, Beiquan
    Correll, Jim
    Weng, Yuejin
    Feng, Chunda
    Motes, Dennis
    Yang, Wei
    Bhattarai, Gehendra
    Ravelombola, Waltram Second
    Dong, Lingdi
    Sugihara, Yuichi
    [J]. HORTSCIENCE, 2017, 52 (09) : S359 - S360
  • [35] Genome Survey of Stipa breviflora Griseb. Using Next-Generation Sequencing
    Yun, Xiangjun
    Wu, Jinrui
    Xu, Bo
    Lv, Shijie
    Zhang, Le
    Zhang, Wenguang
    Sun, Shixian
    Liu, Guixiang
    Zu, Yazhou
    Liu, Bin
    [J]. AGRICULTURE-BASEL, 2023, 13 (12):
  • [36] Detection of identity by descent using next-generation whole genome sequencing data
    Shu-Yi Su
    Jay Kasberger
    Sergio Baranzini
    William Byerley
    Wilson Liao
    Jorge Oksenberg
    Elliott Sherr
    Eric Jorgenson
    [J]. BMC Bioinformatics, 13
  • [37] The mitochondrial genome of the goosander (Mergus merganser) determined using next-generation sequencing
    Lee, Seon-Mi
    Jeon, Hye Sook
    Kim, Jung A.
    Kim, Jisoo
    Park, Jungeun
    Kil, Hyun-Jong
    [J]. MITOCHONDRIAL DNA PART B-RESOURCES, 2019, 4 (02): : 2547 - 2548
  • [38] Detection of identity by descent using next-generation whole genome sequencing data
    Su, Shu-Yi
    Kasberger, Jay
    Baranzini, Sergio
    Byerley, William
    Liao, Wilson
    Oksenberg, Jorge
    Sherr, Elliott
    Jorgenson, Eric
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [39] A practical comparison of the next-generation sequencing platform and assemblers using yeast genome
    Jeon, Min-Seung
    Jeong, Da Min
    Doh, Huijeong
    Kang, Hyun Ah
    Jung, Hyungtaek
    Eyun, Seong-il
    [J]. LIFE SCIENCE ALLIANCE, 2023, 6 (04)
  • [40] Clinical analysis of genome next-generation sequencing data using the Omicia platform
    Coonrod, Emily M.
    Margraf, Rebecca L.
    Russell, Archie
    Voelkerding, Karl V.
    Reese, Martin G.
    [J]. EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2013, 13 (06) : 529 - 540