Transcriptome deep-sequencing and clustering of expressed isoforms from Favia corals

被引:18
|
作者
Mehr, Shaadi F. Pooyaei [1 ,2 ]
DeSalle, Rob [2 ]
Kao, Hung-Teh [3 ]
Narechania, Apurva [2 ]
Han, Zhou [4 ]
Tchernov, Dan [5 ]
Pieribone, Vincent [4 ]
Gruber, David F. [1 ,2 ,6 ]
机构
[1] CUNY, Grad Ctr, New York, NY 10065 USA
[2] Amer Museum Nat Hist, Sackler Inst Comparat Genom, New York, NY 10024 USA
[3] Brown Univ, Warren Alpert Med Sch, Dept Psychiat & Human Behav, Div Biol & Med, Providence, RI 02912 USA
[4] Yale Univ, John B Pierce Lab, New Haven, CT 06519 USA
[5] Univ Haifa, Leon H Charney Sch Marine Sci, Dept Marine Biol, IL-31905 Haifa, Israel
[6] CUNY, Baruch Coll, Dept Nat Sci, New York, NY 10010 USA
来源
BMC GENOMICS | 2013年 / 14卷
基金
美国国家科学基金会;
关键词
K-mer; Contig; Open reading frame; Fluorescent protein; Blast; Clustering; High-throughput sequencing; Illumina paired-end; Coral; GREEN FLUORESCENT PROTEINS; RNA-SEQ; SCLERACTINIAN CORALS; DNA-SEQUENCES; ALIGNMENT; GENOME; PHYLOGENY; EVOLUTION; RESPONSES; SELECTION;
D O I
10.1186/1471-2164-14-546
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Genomic and transcriptomic sequence data are essential tools for tackling ecological problems. Using an approach that combines next-generation sequencing, de novo transcriptome assembly, gene annotation and synthetic gene construction, we identify and cluster the protein families from Favia corals from the northern Red Sea. Results: We obtained 80 million 75 bp paired-end cDNA reads from two Favia adult samples collected at 65 m (Fav1, Fav2) on the Illumina GA platform, and generated two de novo assemblies using ABySS and CAP3. After removing redundancy and filtering out low quality reads, our transcriptome datasets contained 58,268 (Fav1) and 62,469 (Fav2) contigs longer than 100 bp, with N50 values of 1,665 bp and 1,439 bp, respectively. Using the proteome of the sea anemone Nematostella vectensis as a reference, we were able to annotate almost 20% of each dataset using reciprocal homology searches. Homologous clustering of these annotated transcripts allowed us to divide them into 7,186 (Fav1) and 6,862 (Fav2) homologous transcript clusters (E-value <= 2e(-30)). Functional annotation categories were assigned to homologous clusters using the functional annotation of Nematostella vectensis. General annotation of the assembled transcripts was improved 1-3% using the Acropora digitifera proteome. In addition, we screened these transcript isoform clusters for fluorescent proteins (FPs) homologs and identified seven potential FP homologs in Fav1, and four in Fav2. These transcripts were validated as bona fide FP transcripts via robust fluorescence heterologous expression. Annotation of the assembled contigs revealed that 1.34% and 1.61% (in Fav1 and Fav2, respectively) of the total assembled contigs likely originated from the corals' algal symbiont, Symbiodinium spp. Conclusions: Here we present a study to identify the homologous transcript isoform clusters from the transcriptome of Favia corals using a far-related reference proteome. Furthermore, the symbiont-derived transcripts were isolated from the datasets and their contribution quantified. This is the first annotated transcriptome of the genus Favia, a major increase in genomics resources available in this important family of corals.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Transcriptome profiling of biliary atresia from new born infants by deep sequencing
    Xiao, Jie
    Xia, Su-yun
    Xia, Yun
    Xia, Qiang
    Wang, Xiang-rui
    MOLECULAR BIOLOGY REPORTS, 2014, 41 (12) : 8063 - 8069
  • [42] Identification and characterization of microRNAs expressed in human breast cancer T-47D cells in response to prolactin treatment by Solexa deep-sequencing technology
    Wei, Qinjun
    He, Wei
    Yao, Jun
    Guo, Li
    Lu, Yajie
    Cao, Xin
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2013, 432 (03) : 480 - 487
  • [43] Identification of SSRs and differentially expressed genes in two cultivars of celery (Apium graveolens L.) by deep transcriptome sequencing
    Li, Meng-Yao
    Wang, Feng
    Jiang, Qian
    Ma, Jing
    Xiong, Ai-Sheng
    HORTICULTURE RESEARCH, 2014, 1
  • [44] Deep-Sequencing Based Detection and Characterization of Placental Microrna from Maternal Circulation: A Potential Biomarker for Fetal Well-Being
    Williams, Zev
    Ben-Dov, Iddo Z.
    Mihailovic, Aleksandra
    Elias, Rony
    Frey, Sebastian
    Rosenwaks, Zev
    Tuschl, Thomas
    REPRODUCTIVE SCIENCES, 2012, 19 (S3) : 76A - 76A
  • [45] Transcriptome profiling of the cancer and normal tissues from gastric cancer patients by deep sequencing
    Zhang, Fei-gong
    He, Zhi-Ying
    Wang, Qiang
    TUMOR BIOLOGY, 2014, 35 (08) : 7423 - 7427
  • [46] Main Pathways and Ion Channels Differentially Expressed in the Transcriptome of Male and Female Adult Angiostrongylus cantonensis using a Deep Sequencing Approach
    Guo, Yue
    Zhou, Hong Chang
    Dong, Hai Yan
    Yao, Yun Liang
    Xu, Bo Ying
    Zhao, Yu
    IRANIAN JOURNAL OF PARASITOLOGY, 2021, 16 (04) : 610 - 620
  • [47] Survey of Programs Used to Detect Alternative Splicing Isoforms from Deep Sequencing Data In Silico
    Min, Feng
    Wang, Sumei
    Zhang, Li
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [48] Prioritizing natural-selection signals from the deep-sequencing genomic data suggests multi-variant adaptation in Tibetan highlanders
    Lian Deng
    Chao Zhang
    Kai Yuan
    Yang Gao
    Yuwen Pan
    Xueling Ge
    Yaoxi He
    Yuan Yuan
    Yan Lu
    Xiaoxi Zhang
    Hao Chen
    Haiyi Lou
    Xiaoji Wang
    Dongsheng Lu
    Jiaojiao Liu
    Lei Tian
    Qidi Feng
    Asifullah Khan
    Yajun Yang
    Zi-Bing Jin
    Jian Yang
    Fan Lu
    Jia Qu
    Longli Kang
    Bing Su
    Shuhua Xu
    NationalScienceReview, 2019, 6 (06) : 1201 - 1222
  • [49] deepBase v2.0: identification, expression, evolution and function of small RNAs, LncRNAs and circular RNAs from deep-sequencing data
    Zheng, Ling-Ling
    Li, Jun-Hao
    Wu, Jie
    Sun, Wen-Ju
    Liu, Shun
    Wang, Ze-Lin
    Zhou, Hui
    Yang, Jian-Hua
    Qu, Liang-Hu
    NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D196 - D202
  • [50] The eSNV-detect: a computational system to identify expressed single nucleotide variants from transcriptome sequencing data
    Tang, Xiaojia
    Baheti, Saurabh
    Shameer, Khader
    Thompson, Kevin J.
    Wills, Quin
    Niu, Nifang
    Holcomb, Ilona N.
    Boutet, Stephane C.
    Ramakrishnan, Ramesh
    Kachergus, Jennifer M.
    Kocher, Jean-Pierre A.
    Weinshilboum, Richard M.
    Wang, Liewei
    Thompson, E. Aubrey
    Kalari, Krishna R.
    NUCLEIC ACIDS RESEARCH, 2014, 42 (22)