Improving mammalian genome scaffolding using large insert mate-pair next-generation sequencing

被引:27
|
作者
van Heesch, Sebastiaan [1 ,2 ]
Kloosterman, Wigard P. [3 ]
Lansu, Nico [1 ,2 ]
Ruzius, Frans-Paul [1 ,2 ]
Levandowsky, Elizabeth [4 ]
Lee, Clarence C. [4 ]
Zhou, Shiguo [5 ]
Goldstein, Steve [5 ]
Schwartz, David C. [5 ]
Harkins, Timothy T. [4 ]
Guryev, Victor [1 ,2 ,6 ]
Cuppen, Edwin [1 ,2 ,3 ]
机构
[1] Hubrecht Inst KNAW, NL-3584 CT Utrecht, Netherlands
[2] Univ Med Ctr Utrecht, NL-3584 CT Utrecht, Netherlands
[3] UMC Utrecht, Dept Med Genet, NL-3584 GG Utrecht, Netherlands
[4] Life Technol Inc, Adv Applicat Grp, Cummings Ctr 500, Beverly, MA 01915 USA
[5] Univ Wisconsin, Dept Chem, UW Biotechnol Ctr, Lab Mol & Computat Genom,Lab Genet, Madison, WI 53706 USA
[6] Univ Groningen, Univ Med Ctr Groningen, European Res Inst Biol Ageing, Lab Genome Struct & Ageing, NL-9713 AV Groningen, Netherlands
来源
BMC GENOMICS | 2013年 / 14卷
基金
美国国家卫生研究院;
关键词
Genome structure; Genome scaffolding; Mate-pair next-generation sequencing; Contig assembly; Rat genome; STRUCTURAL VARIATION; GEL-ELECTROPHORESIS; CANCER GENOMES; CHROMOTHRIPSIS; DNA; REARRANGEMENTS; ASSEMBLIES; RESOLUTION; EVOLUTION; PATTERNS;
D O I
10.1186/1471-2164-14-257
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Paired-tag sequencing approaches are commonly used for the analysis of genome structure. However, mammalian genomes have a complex organization with a variety of repetitive elements that complicate comprehensive genome-wide analyses. Results: Here, we systematically assessed the utility of paired-end and mate-pair (MP) next-generation sequencing libraries with insert sizes ranging from 170 bp to 25 kb, for genome coverage and for improving scaffolding of a mammalian genome (Rattus norvegicus). Despite a lower library complexity, large insert MP libraries (20 or 25 kb) provided very high physical genome coverage and were found to efficiently span repeat elements in the genome. Medium-sized (5, 8 or 15 kb) MP libraries were much more efficient for genome structure analysis than the more commonly used shorter insert paired-end and 3 kb MP libraries. Furthermore, the combination of medium-and large insert libraries resulted in a 3-fold increase in N50 in scaffolding processes. Finally, we show that our data can be used to evaluate and improve contig order and orientation in the current rat reference genome assembly. Conclusions: We conclude that applying combinations of mate-pair libraries with insert sizes that match the distributions of repetitive elements improves contig scaffolding and can contribute to the finishing of draft genomes.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Improving mammalian genome scaffolding using large insert mate-pair next-generation sequencing
    Sebastiaan van Heesch
    Wigard P Kloosterman
    Nico Lansu
    Frans-Paul Ruzius
    Elizabeth Levandowsky
    Clarence C Lee
    Shiguo Zhou
    Steve Goldstein
    David C Schwartz
    Timothy T Harkins
    Victor Guryev
    Edwin Cuppen
    [J]. BMC Genomics, 14
  • [2] Improving Bacillus Altitudinis B-388 Genome Scaffolding Using Mate-Pair Next-Generation Sequencing
    Ulyanova V.
    Shah Mahmud R.
    Malanin S.
    Vershinina V.
    Ilinskaya O.
    [J]. Ulyanova, Vera (ulyanova.vera@gmail.com), 1600, Springer Science and Business Media, LLC (07): : 85 - 87
  • [3] Long-span, mate-pair scaffolding and other methods for faster next-generation sequencing library creation
    Cheng-Cang Wu
    Rosa Ye
    Svetlana Jasinovica
    Megan Wagner
    Ronald Godiska
    Amy Hin-Yan Tong
    Si Lok
    Amanda Krerowicz
    Curtis Knox
    David Mead
    Michael Lodes
    [J]. Nature Methods, 2012, 9 (9) : i - ii
  • [4] A scaffold analysis tool using mate-pair information in genome sequencing
    Kim, Pan-Gyu
    Cho, Hwan-Gue
    Park, Kiejung
    [J]. JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2008,
  • [5] Conversion of Mate-Pair Reads into Long Sequences for Improving Assembly Scaffolding
    Lee, Chao-Hung
    Tsai, Cheng-Wei
    Huang, Yao-Ting
    [J]. 2016 INTERNATIONAL COMPUTER SYMPOSIUM (ICS), 2016, : 44 - 48
  • [6] ConPath: Scaffold analysis tool using mate-pair information for genome sequencing
    Kim, Pan-Gyu
    Cho, Hwan-Gue
    Park, Kiejung
    [J]. PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 55 - +
  • [7] Next-generation sequencing and large genome assemblies
    Henson, Joseph
    Tischler, German
    Ning, Zemin
    [J]. PHARMACOGENOMICS, 2012, 13 (08) : 901 - 915
  • [8] Copy number variant analysis using genome-wide mate-pair sequencing
    Smadbeck, James B.
    Johnson, Sarah H.
    Smoley, Stephanie A.
    Gaitatzes, Athanasios
    Drucker, Travis M.
    Zenka, Roman M.
    Kosari, Farhad
    Murphy, Stephen J.
    Hoppman, Nicole
    Aypar, Umut
    Sukov, William R.
    Jenkins, Robert B.
    Kearney, Hutton M.
    Feldman, Andrew L.
    Vasmatzis, George
    [J]. GENES CHROMOSOMES & CANCER, 2018, 57 (09): : 459 - 470
  • [9] Mate-pair genome sequencing reveals structural variants for idiopathic male infertility
    Dong, Zirui
    Qian, Jicheng
    Law, Tracy Sze Man
    Chau, Matthew Hoi Kin
    Cao, Ye
    Xue, Shuwen
    Tong, Steve
    Zhao, Yilin
    Kwok, Yvonne K.
    Ng, Karen
    Chan, David Yiu Leung
    Chiu, Peter K. -F.
    Ng, Chi-Fai
    Chung, Cathy Hoi Sze
    Mak, Jennifer Sze Man
    Leung, Tak Yeung
    Chung, Jacqueline Pui Wah
    Morton, Cynthia C.
    Choy, Kwong Wai
    [J]. HUMAN GENETICS, 2023, 142 (03) : 363 - 377
  • [10] Mate-pair genome sequencing reveals structural variants for idiopathic male infertility
    Zirui Dong
    Jicheng Qian
    Tracy Sze Man Law
    Matthew Hoi Kin Chau
    Ye Cao
    Shuwen Xue
    Steve Tong
    Yilin Zhao
    Yvonne K. Kwok
    Karen Ng
    David Yiu Leung Chan
    Peter K.-F. Chiu
    Chi-Fai Ng
    Cathy Hoi Sze Chung
    Jennifer Sze Man Mak
    Tak Yeung Leung
    Jacqueline Pui Wah Chung
    Cynthia C. Morton
    Kwong Wai Choy
    [J]. Human Genetics, 2023, 142 : 363 - 377