Sequencing smart: De novo sequencing and assembly approaches for a non-model mammal

被引:12
|
作者
Etherington, Graham J. [1 ]
Heavens, Darren [1 ]
Baker, David [1 ]
Lister, Ashleigh [1 ]
McNelly, Rose [1 ]
Garcia, Gonzalo [1 ]
Clavijo, Bernardo [1 ]
Macaulay, Iain [1 ]
Haerty, Wilfried [1 ]
Di Palma, Federica [1 ]
机构
[1] Norwich Res Pk, Earlham Inst, Norwich NR4 7UZ, Norfolk, England
来源
GIGASCIENCE | 2020年 / 9卷 / 05期
基金
英国生物技术与生命科学研究理事会;
关键词
polecat; vertebrate; non-model organism; Illumina; chromium; Bionano; assembly; sequencing; POLECAT MUSTELA-PUTORIUS; CONSERVATION; GENOMICS; ANNOTATION; BIOLOGY;
D O I
10.1093/gigascience/giaa045
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Whilst much sequencing effort has focused on key mammalian model organisms such as mouse and human, little is known about the relationship between genome sequencing techniques for non-model mammals and genome assembly quality. This is especially relevant to non-model mammals, where the samples to be sequenced are often degraded and of low quality. A key aspect when planning a genome project is the choice of sequencing data to generate. This decision is driven by several factors, including the biological questions being asked, the quality of DNA available, and the availability of funds. Cutting-edge sequencing technologies now make it possible to achieve highly contiguous, chromosome-level genome assemblies, but rely on high-quality high molecular weight DNA. However, funding is often insufficient for many independent research groups to use these techniques. Here we use a range of different genomic technologies generated from a roadkill European polecat (Mustela putorius) to assess various assembly techniques on this low-quality sample. We evaluated different approaches for de novo assemblies and discuss their value in relation to biological analyses. Results: Generally, assemblies containing more data types achieved better scores in our ranking system. However, when accounting for misassemblies, this was not always the case for Bionano and low-coverage 10x Genomics (for scaffolding only). We also find that the extra cost associated with combining multiple data types is not necessarily associated with better genome assemblies. Conclusions: The high degree of variability between each de novo assembly method (assessed from the 7 key metrics) highlights the importance of carefully devising the sequencing strategy to be able to carry out the desired analysis. Adding more data to genome assemblies does not always result in better assemblies, so it is important to understand the nuances of genomic data integration explained here, in order to obtain cost-effective value for money when sequencing genomes.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Sequencing and de novo assembly of a near complete indica rice genome
    Huilong Du
    Ying Yu
    Yanfei Ma
    Qiang Gao
    Yinghao Cao
    Zhuo Chen
    Bin Ma
    Ming Qi
    Yan Li
    Xianfeng Zhao
    Jing Wang
    Kunfan Liu
    Peng Qin
    Xin Yang
    Lihuang Zhu
    Shigui Li
    Chengzhi Liang
    Nature Communications, 8
  • [32] A survey on de novo assembly methods for single-molecular sequencing
    Chen, Ying
    Xiao, Chuan-Le
    QUANTITATIVE BIOLOGY, 2020, 8 (03) : 203 - 215
  • [33] Sequencing, de novo assembly and comparative analysis of Raphanus sativus transcriptome
    Wu, Gang
    Zhang, Libin
    Yin, Yongtai
    Wu, Jiangsheng
    Yu, Longjiang
    Zhou, Yanhong
    Li, Maoteng
    FRONTIERS IN PLANT SCIENCE, 2015, 6
  • [34] The effect of variant interference on de novo assembly for viral deep sequencing
    Castro, Christina J.
    Marine, Rachel L.
    Ramos, Edward
    Ng, Terry Fei Fan
    BMC GENOMICS, 2020, 21 (01)
  • [35] The effect of variant interference on de novo assembly for viral deep sequencing
    Christina J. Castro
    Rachel L. Marine
    Edward Ramos
    Terry Fei Fan Ng
    BMC Genomics, 21
  • [36] High quality de novo sequencing and assembly of the Saccharomyces arboricolus genome
    Gianni Liti
    Alex N Nguyen Ba
    Martin Blythe
    Carolin A Müller
    Anders Bergström
    Francisco A Cubillos
    Felix Dafhnis-Calas
    Shima Khoshraftar
    Sunir Malla
    Neel Mehta
    Cheuk C Siow
    Jonas Warringer
    Alan M Moses
    Edward J Louis
    Conrad A Nieduszynski
    BMC Genomics, 14
  • [37] Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome
    Weisberg, Alexandra J.
    Kim, Gunjune
    Westwood, James H.
    Jelesko, John G.
    GENES, 2017, 8 (11)
  • [38] De novo sequencing, assembly and functional annotation of Armillaria borealis genome
    Akulova, Vasilina S.
    Sharov, Vadim V.
    Aksyonova, Anastasiya I.
    Putintseva, Yuliya A.
    Oreshkova, Natalya V.
    Feranchuk, Sergey I.
    Kuzmin, Dmitry A.
    Pavlov, Igor N.
    Litovka, Yulia A.
    Krutovsky, Konstantin V.
    BMC GENOMICS, 2020, 21 (Suppl 7)
  • [39] NEXT-GENERATION DNA SEQUENCING FOR DE NOVO GENOME ASSEMBLY
    Hiatt, J.
    Turner, E.
    Patwardhan, R.
    Lee, C.
    Shendure, J.
    JOURNAL OF INVESTIGATIVE MEDICINE, 2009, 57 (01) : 99 - 100
  • [40] A survey on de novo assembly methods for single-molecular sequencing
    Ying Chen
    Chuan-Le Xiao
    Quantitative Biology, 2020, 8 (03) : 203 - 215