Sequencing smart: De novo sequencing and assembly approaches for a non-model mammal

被引:12
|
作者
Etherington, Graham J. [1 ]
Heavens, Darren [1 ]
Baker, David [1 ]
Lister, Ashleigh [1 ]
McNelly, Rose [1 ]
Garcia, Gonzalo [1 ]
Clavijo, Bernardo [1 ]
Macaulay, Iain [1 ]
Haerty, Wilfried [1 ]
Di Palma, Federica [1 ]
机构
[1] Norwich Res Pk, Earlham Inst, Norwich NR4 7UZ, Norfolk, England
来源
GIGASCIENCE | 2020年 / 9卷 / 05期
基金
英国生物技术与生命科学研究理事会;
关键词
polecat; vertebrate; non-model organism; Illumina; chromium; Bionano; assembly; sequencing; POLECAT MUSTELA-PUTORIUS; CONSERVATION; GENOMICS; ANNOTATION; BIOLOGY;
D O I
10.1093/gigascience/giaa045
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Whilst much sequencing effort has focused on key mammalian model organisms such as mouse and human, little is known about the relationship between genome sequencing techniques for non-model mammals and genome assembly quality. This is especially relevant to non-model mammals, where the samples to be sequenced are often degraded and of low quality. A key aspect when planning a genome project is the choice of sequencing data to generate. This decision is driven by several factors, including the biological questions being asked, the quality of DNA available, and the availability of funds. Cutting-edge sequencing technologies now make it possible to achieve highly contiguous, chromosome-level genome assemblies, but rely on high-quality high molecular weight DNA. However, funding is often insufficient for many independent research groups to use these techniques. Here we use a range of different genomic technologies generated from a roadkill European polecat (Mustela putorius) to assess various assembly techniques on this low-quality sample. We evaluated different approaches for de novo assemblies and discuss their value in relation to biological analyses. Results: Generally, assemblies containing more data types achieved better scores in our ranking system. However, when accounting for misassemblies, this was not always the case for Bionano and low-coverage 10x Genomics (for scaffolding only). We also find that the extra cost associated with combining multiple data types is not necessarily associated with better genome assemblies. Conclusions: The high degree of variability between each de novo assembly method (assessed from the 7 key metrics) highlights the importance of carefully devising the sequencing strategy to be able to carry out the desired analysis. Adding more data to genome assemblies does not always result in better assemblies, so it is important to understand the nuances of genomic data integration explained here, in order to obtain cost-effective value for money when sequencing genomes.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] De novo sequencing, assembly and functional annotation of Armillaria borealis genome
    Vasilina S. Akulova
    Vadim V. Sharov
    Anastasiya I. Aksyonova
    Yuliya A. Putintseva
    Natalya V. Oreshkova
    Sergey I. Feranchuk
    Dmitry A. Kuzmin
    Igor N. Pavlov
    Yulia A. Litovka
    Konstantin V. Krutovsky
    BMC Genomics, 21
  • [42] Sequencing and de novo assembly of a near complete indica rice genome
    Du, Huilong
    Yu, Ying
    Ma, Yanfei
    Gao, Qiang
    Cao, Yinghao
    Chen, Zhuo
    Ma, Bin
    Qi, Ming
    Li, Yan
    Zhao, Xianfeng
    Wang, Jing
    Liu, Kunfan
    Qin, Peng
    Yang, Xin
    Zhu, Lihuang
    Li, Shigui
    Liang, Chengzhi
    NATURE COMMUNICATIONS, 2017, 8
  • [43] High quality de novo sequencing and assembly of the Saccharomyces arboricolus genome
    Liti, Gianni
    Ba, Alex N. Nguyen
    Blythe, Martin
    Mueller, Carolin A.
    Bergstroem, Anders
    Cubillos, Francisco A.
    Dafhnis-Calas, Felix
    Khoshraftar, Shima
    Malla, Sunir
    Mehta, Neel
    Siow, Cheuk C.
    Warringer, Jonas
    Moses, Alan M.
    Louis, Edward J.
    Nieduszynski, Conrad A.
    BMC GENOMICS, 2013, 14
  • [44] Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
    Silvia Madritsch
    Agnes Burg
    Eva M. Sehr
    BMC Bioinformatics, 22
  • [45] Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
    Madritsch, Silvia
    Burg, Agnes
    Sehr, Eva M.
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [46] Enhancement of de novo sequencing, assembly and annotation of the Mongolian gerbil genome with transcriptome sequencing and assembly from several different tissues
    Shifeng Cheng
    Yuan Fu
    Yaolei Zhang
    Wenfei Xian
    Hongli Wang
    Benedikt Grothe
    Xin Liu
    Xun Xu
    Achim Klug
    Elizabeth A. McCullagh
    BMC Genomics, 20
  • [47] Enhancement of de novo sequencing, assembly and annotation of the Mongolian gerbil genome with transcriptome sequencing and assembly from several different tissues
    Cheng, Shifeng
    Fu, Yuan
    Zhang, Yaolei
    Xian, Wenfei
    Wang, Hongli
    Grothe, Benedikt
    Liu, Xin
    Xu, Xun
    Klug, Achim
    McCullagh, Elizabeth A.
    BMC GENOMICS, 2019, 20 (01)
  • [48] Application of next-generation sequencing to the study of non-model insects
    Wachi, Nakatada
    Matsubayashi, Kei W.
    Maeto, Kaoru
    ENTOMOLOGICAL SCIENCE, 2018, 21 (01) : 3 - 11
  • [49] De Novo Mass Spectrometry Peptide Sequencing with a Transformer Model
    Yilmaz, Melih
    Fondrie, William E.
    Bittremieux, Wout
    Oh, Sewoong
    Noble, William Stafford
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [50] NovoHMM: A hidden Markov model for de novo peptide sequencing
    Fischer, B
    Roth, V
    Roos, F
    Grossmann, J
    Baginsky, S
    Widmayer, P
    Gruissem, W
    Buhmann, JM
    ANALYTICAL CHEMISTRY, 2005, 77 (22) : 7265 - 7273