A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly

被引:66
|
作者
Francis, Warren R. [1 ,2 ]
Christianson, Lynne M. [1 ]
Kiko, Rainer [3 ]
Powers, Meghan L. [1 ,2 ]
Shaner, Nathan C. [4 ]
Haddock, Steven H. D. [1 ]
机构
[1] Monterey Bay Aquarium Res Inst, Moss Landing, CA 95039 USA
[2] Univ Calif Santa Cruz, Dept Ocean Sci, Santa Cruz, CA 95064 USA
[3] GEOMAR, Helmholtz Ctr Ocean Res Kiel, D-24105 Kiel, Germany
[4] Scintillon Inst, San Diego, CA 92121 USA
来源
BMC GENOMICS | 2013年 / 14卷
关键词
RNA-SEQ DATA; DIFFERENTIAL EXPRESSION; GENES; NORMALIZATION;
D O I
10.1186/1471-2164-14-167
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes. Results: We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters. Conclusions: These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies.
引用
收藏
页码:1 / 12
页数:11
相关论文
共 50 条
  • [41] Next generation sequencing and de novo transcriptome analysis of Costus pictus D. Don, a non-model plant with potent anti-diabetic properties
    Ramasamy S Annadurai
    Vasanthan Jayakumar
    Raja C Mugasimangalam
    Mohan AVSK Katta
    Sanchita Anand
    Sreeja Gopinathan
    Santosh Prasad Sarma
    Sunjay Jude Fernandes
    Nandita Mullapudi
    S Murugesan
    Sudha Narayana Rao
    BMC Genomics, 13
  • [42] Impact of sequencing depth and technology on de novo RNA-Seq assembly
    Jordan Patterson
    Eric J. Carpenter
    Zhenzhen Zhu
    Dan An
    Xinming Liang
    Chunyu Geng
    Radoje Drmanac
    Gane Ka-Shu Wong
    BMC Genomics, 20
  • [43] Impact of sequencing depth and technology on de novo RNA-Seq assembly
    Patterson, Jordan
    Carpenter, Eric J.
    Zhu, Zhenzhen
    An, Dan
    Liang, Xinming
    Geng, Chunyu
    Drmanac, Radoje
    Wong, Gane Ka-Shu
    BMC GENOMICS, 2019, 20 (1)
  • [44] In-depth transcriptome analysis of Larimichthys polyactis, de novo assembly, functional annotation
    Liu, Lian-Wei
    Sui, You-Zhen
    Zhu, Wen -Bin
    Guo, Ai
    Xu, Kai-Da
    Zhou, Yong-Dong
    MARINE GENOMICS, 2017, 33 : 27 - 29
  • [45] Optimizing and benchmarking de novo transcriptome sequencing: from library preparation to assembly evaluation
    Yuichiro Hara
    Kaori Tatsumi
    Michio Yoshida
    Eriko Kajikawa
    Hiroshi Kiyonari
    Shigehiro Kuraku
    BMC Genomics, 16
  • [46] De novo assembly and characterization of Camelina sativa transcriptome by paired-end sequencing
    Liang, Chao
    Liu, Xuan
    Yiu, Siu-Ming
    Lim, Boon Leong
    BMC GENOMICS, 2013, 14
  • [47] Sequencing and De Novo Assembly of the Transcriptome of the Glassy-Winged Sharpshooter (Homalodisca vitripennis)
    Nandety, Raja Sekhar
    Kamita, Shizuo G.
    Hammock, Bruce D.
    Falk, Bryce W.
    PLOS ONE, 2013, 8 (12):
  • [48] IDP-denovo: de novo transcriptome assembly and isoform annotation by hybrid sequencing
    Fu, Shuhua
    Ma, Yingke
    Yao, Hui
    Xu, Zhichao
    Chen, Shilin
    Song, Jingyuan
    Au, Kin Fai
    BIOINFORMATICS, 2018, 34 (13) : 2168 - 2176
  • [49] Sequencing and de novo assembly of the Asian gypsy moth transcriptome using the Illumina platform
    Fan Xiaojun
    Yang Chun
    Liu Jianhong
    Zhang Chang
    Li Yao
    GENETICS AND MOLECULAR BIOLOGY, 2017, 40 (01) : 160 - 167
  • [50] Holm Oak (Quercus ilex) Transcriptome. De novo Sequencing and Assembly Analysis
    Guerrero-Sanchez, Victor M.
    Maldonado-Alconada, Ana M.
    Amil-Ruiz, Francisco
    Jorrin-Novo, Jesus V.
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2017, 4