A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly

被引:66
|
作者
Francis, Warren R. [1 ,2 ]
Christianson, Lynne M. [1 ]
Kiko, Rainer [3 ]
Powers, Meghan L. [1 ,2 ]
Shaner, Nathan C. [4 ]
Haddock, Steven H. D. [1 ]
机构
[1] Monterey Bay Aquarium Res Inst, Moss Landing, CA 95039 USA
[2] Univ Calif Santa Cruz, Dept Ocean Sci, Santa Cruz, CA 95064 USA
[3] GEOMAR, Helmholtz Ctr Ocean Res Kiel, D-24105 Kiel, Germany
[4] Scintillon Inst, San Diego, CA 92121 USA
来源
BMC GENOMICS | 2013年 / 14卷
关键词
RNA-SEQ DATA; DIFFERENTIAL EXPRESSION; GENES; NORMALIZATION;
D O I
10.1186/1471-2164-14-167
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes. Results: We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters. Conclusions: These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies.
引用
收藏
页码:1 / 12
页数:11
相关论文
共 50 条
  • [31] De Novo Assembly of the Transcriptome of the Non-Model Plant Streptocarpus rexii Employing a Novel Heuristic to Recover Locus-Specific Transcript Clusters
    Chiara, Matteo
    Horner, David S.
    Spada, Alberto
    PLOS ONE, 2013, 8 (12):
  • [32] De novo sequencing and assembly analysis of transcriptome in the Sodom apple (Calotropis gigantea)
    Muriira, Nkatha G.
    Xu, Wei
    Muchugi, Alice
    Xu, Jianchu
    Liu, Aizhong
    BMC GENOMICS, 2015, 16
  • [33] High-Throughput Sequencing and De Novo Assembly of the Isatis indigotica Transcriptome
    Tang, Xiaoqing
    Xiao, Yunhua
    Lv, Tingting
    Wang, Fangquan
    Zhu, QianHao
    Zheng, Tianqing
    Yang, Jie
    PLOS ONE, 2014, 9 (09):
  • [34] De novo assembly of transcriptome from next-generation sequencing data
    Xuan Li
    Yimeng Kong
    QiongYi Zhao
    YuanYuan Li
    Pei Hao
    Quantitative Biology, 2016, 4 (02) : 94 - 105
  • [35] Sequencing, de novo assembly and annotation of a pink bollworm larval midgut transcriptome
    Tassone, Erica E.
    Zastrow-Hayes, Gina
    Mathis, John
    Nelson, Mark E.
    Wu, Gusui
    Flexner, J. Lindsey
    Carriere, Yves
    Tabashnik, Bruce E.
    Fabrick, Jeffrey A.
    GIGASCIENCE, 2016, 5
  • [36] De novo sequencing and assembly analysis of transcriptome in the Sodom apple (Calotropis gigantea)
    Nkatha G. Muriira
    Wei Xu
    Alice Muchugi
    Jianchu Xu
    Aizhong Liu
    BMC Genomics, 16
  • [37] A Pipeline for Non-model Organisms for de novo Transcriptome Assembly, Annotation, and Gene Ontology Analysis Using Open Tools: Case Study with Scots Pine
    Duarte, Gustavo T.
    Volkova, Polina Yu
    Geras'kin, Stanislav A.
    BIO-PROTOCOL, 2021, 11 (03):
  • [38] Transcriptome Sequencing and De Novo Assembly of Golden Cuttlefish Sepia esculenta Hoyle
    Liu, Changlin
    Zhao, Fazhen
    Yan, Jingping
    Liu, Chunsheng
    Liu, Siwei
    Chen, Siqing
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (10)
  • [39] MobiSeq: De novo SNP discovery in model and non-model species through sequencing the flanking region of transposable elements
    Rey-Iglesia, Alba
    Gopalakrishan, Shyam
    Caroe, Christian
    Alquezar-Planas, David E.
    Ahlmann Nielsen, Anne
    Roder, Timo
    Pedersen, Lene Bruhn
    Naesborg-Nielsen, Christina
    Sinding, Mikkel-Holger S.
    Rath, Martin Fredensborg
    Li, Zhipeng
    Petersen, Bent
    Gilbert, M. Thomas P.
    Bunce, Michael
    Mourier, Tobias
    Hansen, Anders Johannes
    MOLECULAR ECOLOGY RESOURCES, 2019, 19 (02) : 512 - 525
  • [40] Next generation sequencing and de novo transcriptome analysis of Costus pictus D. Don, a non-model plant with potent anti-diabetic properties
    Annadurai, Ramasamy S.
    Jayakumar, Vasanthan
    Mugasimangalam, Raja C.
    Katta, Mohan A. V. S. K.
    Anand, Sanchita
    Gopinathan, Sreeja
    Sarma, Santosh Prasad
    Fernandes, Sunjay Jude
    Mullapudi, Nandita
    Murugesan, S.
    Rao, Sudha Narayana
    BMC GENOMICS, 2012, 13 : 1 - 15