A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly

被引:66
|
作者
Francis, Warren R. [1 ,2 ]
Christianson, Lynne M. [1 ]
Kiko, Rainer [3 ]
Powers, Meghan L. [1 ,2 ]
Shaner, Nathan C. [4 ]
Haddock, Steven H. D. [1 ]
机构
[1] Monterey Bay Aquarium Res Inst, Moss Landing, CA 95039 USA
[2] Univ Calif Santa Cruz, Dept Ocean Sci, Santa Cruz, CA 95064 USA
[3] GEOMAR, Helmholtz Ctr Ocean Res Kiel, D-24105 Kiel, Germany
[4] Scintillon Inst, San Diego, CA 92121 USA
来源
BMC GENOMICS | 2013年 / 14卷
关键词
RNA-SEQ DATA; DIFFERENTIAL EXPRESSION; GENES; NORMALIZATION;
D O I
10.1186/1471-2164-14-167
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes. Results: We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters. Conclusions: These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies.
引用
收藏
页码:1 / 12
页数:11
相关论文
共 50 条
  • [21] Challenges and advances for transcriptome assembly in non-model species
    Ungaro, Arnaud
    Pech, Nicolas
    Martin, Jean-Francois
    McCairns, R. J. Scott
    Mevy, Jean-Philippe
    Chappaz, Remi
    Gilles, Andre
    PLOS ONE, 2017, 12 (09):
  • [22] De Novo Sequencing and Assembly Analysis of the Pseudostellaria heterophylla Transcriptome
    Li, Jun
    Zhen, Wei
    Long, Dengkai
    Ding, Ling
    Gong, Anhui
    Xiao, Chenghong
    Jiang, Weike
    Liu, Xiaoqing
    Zhou, Tao
    Huang, Luqi
    PLOS ONE, 2016, 11 (10):
  • [23] De novo transcriptome sequencing and comparative analysis of midgut tissues of four non-model insects pertaining to Hemiptera, Coleoptera, Diptera and Lepidoptera
    Gazara, Rajesh K.
    Cardoso, Christiane
    Bellieny-Rabelo, Daniel
    Ferreira, Clelia
    Terra, Walter R.
    Venancio, Thiago M.
    GENE, 2017, 627 : 85 - 93
  • [24] Normalized Workflow to Optimize Hybrid De Novo Transcriptome Assembly for Non-Model Species: A Case Study in Lilium ledebourii (Baker) Boiss
    Sheikh-Assadi, Morteza
    Naderi, Roohangiz
    Salami, Seyed Alireza
    Kafi, Mohsen
    Fatahi, Reza
    Shariati, Vahid
    Martinelli, Federico
    Cicatelli, Angela
    Triassi, Maria
    Guarino, Francesco
    Improta, Giovanni
    Claros, Manuel Gonzalo
    PLANTS-BASEL, 2022, 11 (18):
  • [25] Using de novo genome assembly and high-throughput sequencing to characterize the MHC region in a non-model bird, the Eurasian coot
    Pikus, Ewa
    Minias, Piotr
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [26] Sequencing, de novo assembly and comparative analysis of Raphanus sativus transcriptome
    Wu, Gang
    Zhang, Libin
    Yin, Yongtai
    Wu, Jiangsheng
    Yu, Longjiang
    Zhou, Yanhong
    Li, Maoteng
    FRONTIERS IN PLANT SCIENCE, 2015, 6
  • [27] Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome
    Weisberg, Alexandra J.
    Kim, Gunjune
    Westwood, James H.
    Jelesko, John G.
    GENES, 2017, 8 (11)
  • [28] Using de novo genome assembly and high-throughput sequencing to characterize the MHC region in a non-model bird, the Eurasian coot
    Ewa Pikus
    Piotr Minias
    Scientific Reports, 12
  • [29] Comparative performance of transcriptome assembly methods for non-model organisms
    Xin Huang
    Xiao-Guang Chen
    Peter A. Armbruster
    BMC Genomics, 17
  • [30] Comparative performance of transcriptome assembly methods for non-model organisms
    Huang, Xin
    Chen, Xiao-Guang
    Armbruster, Peter A.
    BMC GENOMICS, 2016, 17