Sequencing smart: De novo sequencing and assembly approaches for a non-model mammal

被引:12
|
作者
Etherington, Graham J. [1 ]
Heavens, Darren [1 ]
Baker, David [1 ]
Lister, Ashleigh [1 ]
McNelly, Rose [1 ]
Garcia, Gonzalo [1 ]
Clavijo, Bernardo [1 ]
Macaulay, Iain [1 ]
Haerty, Wilfried [1 ]
Di Palma, Federica [1 ]
机构
[1] Norwich Res Pk, Earlham Inst, Norwich NR4 7UZ, Norfolk, England
来源
GIGASCIENCE | 2020年 / 9卷 / 05期
基金
英国生物技术与生命科学研究理事会;
关键词
polecat; vertebrate; non-model organism; Illumina; chromium; Bionano; assembly; sequencing; POLECAT MUSTELA-PUTORIUS; CONSERVATION; GENOMICS; ANNOTATION; BIOLOGY;
D O I
10.1093/gigascience/giaa045
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Whilst much sequencing effort has focused on key mammalian model organisms such as mouse and human, little is known about the relationship between genome sequencing techniques for non-model mammals and genome assembly quality. This is especially relevant to non-model mammals, where the samples to be sequenced are often degraded and of low quality. A key aspect when planning a genome project is the choice of sequencing data to generate. This decision is driven by several factors, including the biological questions being asked, the quality of DNA available, and the availability of funds. Cutting-edge sequencing technologies now make it possible to achieve highly contiguous, chromosome-level genome assemblies, but rely on high-quality high molecular weight DNA. However, funding is often insufficient for many independent research groups to use these techniques. Here we use a range of different genomic technologies generated from a roadkill European polecat (Mustela putorius) to assess various assembly techniques on this low-quality sample. We evaluated different approaches for de novo assemblies and discuss their value in relation to biological analyses. Results: Generally, assemblies containing more data types achieved better scores in our ranking system. However, when accounting for misassemblies, this was not always the case for Bionano and low-coverage 10x Genomics (for scaffolding only). We also find that the extra cost associated with combining multiple data types is not necessarily associated with better genome assemblies. Conclusions: The high degree of variability between each de novo assembly method (assessed from the 7 key metrics) highlights the importance of carefully devising the sequencing strategy to be able to carry out the desired analysis. Adding more data to genome assemblies does not always result in better assemblies, so it is important to understand the nuances of genomic data integration explained here, in order to obtain cost-effective value for money when sequencing genomes.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms
    Sze, Sing-Hoi
    Pimsler, Meaghan L.
    Tomberlin, Jeffery K.
    Jones, Corbin D.
    Tarone, Aaron M.
    BMC GENOMICS, 2017, 18
  • [22] De novo transcriptome sequencing and comparative analysis of midgut tissues of four non-model insects pertaining to Hemiptera, Coleoptera, Diptera and Lepidoptera
    Gazara, Rajesh K.
    Cardoso, Christiane
    Bellieny-Rabelo, Daniel
    Ferreira, Clelia
    Terra, Walter R.
    Venancio, Thiago M.
    GENE, 2017, 627 : 85 - 93
  • [23] Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: A comparison of de novo assemblers
    Amin S.
    Prentis P.J.
    Gilding E.K.
    Pavasovic A.
    BMC Research Notes, 7 (1)
  • [24] A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms
    Sing-Hoi Sze
    Meaghan L. Pimsler
    Jeffery K. Tomberlin
    Corbin D. Jones
    Aaron M. Tarone
    BMC Genomics, 18
  • [25] A model of random sequences for de novo peptide sequencing
    Jarman, KD
    Cannon, WR
    Jarman, KH
    Heredia-Langner, A
    THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 206 - 213
  • [26] A Primer for Single-Cell Sequencing in Non-Model Organisms
    Alfieri, James M.
    Wang, Guosong
    Jonika, Michelle M.
    Gill, Clare A.
    Blackmon, Heath
    Athrey, Giridhar N.
    GENES, 2022, 13 (02)
  • [27] Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools
    Kisand, Veljo
    Lettieri, Teresa
    BMC GENOMICS, 2013, 14
  • [28] Long-read sequencing and de novo assembly of a Chinese genome
    Shi, Lingling
    Guo, Yunfei
    Dong, Chengliang
    Huddleston, John
    Yang, Hui
    Han, Xiaolu
    Fu, Aisi
    Li, Quan
    Li, Na
    Gong, Siyi
    Lintner, Katherine E.
    Ding, Qiong
    Wang, Zou
    Hu, Jiang
    Wang, Depeng
    Wang, Feng
    Wang, Lin
    Lyon, Gholson J.
    Guan, Yongtao
    Shen, Yufeng
    Evgrafov, Oleg V.
    Knowles, James A.
    Thibaud-Nissen, Francoise
    Schneider, Valerie
    Yu, Chack-Yung
    Zhou, Libing
    Eichler, Evan E.
    So, Kwok-Fai
    Wang, Kai
    NATURE COMMUNICATIONS, 2016, 7
  • [29] Long-read sequencing and de novo assembly of a Chinese genome
    Lingling Shi
    Yunfei Guo
    Chengliang Dong
    John Huddleston
    Hui Yang
    Xiaolu Han
    Aisi Fu
    Quan Li
    Na Li
    Siyi Gong
    Katherine E. Lintner
    Qiong Ding
    Zou Wang
    Jiang Hu
    Depeng Wang
    Feng Wang
    Lin Wang
    Gholson J. Lyon
    Yongtao Guan
    Yufeng Shen
    Oleg V. Evgrafov
    James A. Knowles
    Francoise Thibaud-Nissen
    Valerie Schneider
    Chack-Yung Yu
    Libing Zhou
    Evan E. Eichler
    Kwok-Fai So
    Kai Wang
    Nature Communications, 7
  • [30] Next generation shotgun sequencing and the challenges of de novo genome assembly
    Schlebusch, Stephen
    Illing, Nicola
    SOUTH AFRICAN JOURNAL OF SCIENCE, 2012, 108 (11-12) : 37 - 44