Multiple model species selection for transcriptomics analysis of non-model organisms

被引:5
|
作者
Pai, Tun-Wen [1 ,2 ]
Li, Kuan-Hung [1 ]
Yang, Cing-Han [1 ]
Hu, Chin-Hwa [3 ]
Lin, Han-Jia [3 ]
Wang, Wen-Der [4 ]
Chen, Yet-Ran [5 ]
机构
[1] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, Keelung, Taiwan
[2] Natl Taipei Univ Technol, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[3] Natl Taiwan Ocean Univ, Dept Biosci & Biotechnol, Keelung, Taiwan
[4] Natl Chiayi Univ, Dept Bioagr Sci, Chiayi, Taiwan
[5] Acad Sinica, Agr Biotechnol Res Ctr, Taipei, Taiwan
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
RNA-seq; Reference model species; Differential expression analysis; Ultra-conserved orthologous gene; Gene ontology; Biological pathway; RNA-SEQ; CELLS; GENES; TOOL;
D O I
10.1186/s12859-018-2278-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Transcriptomic sequencing (RNA-seq) related applications allow for rapid explorations due to their high-throughput and relatively fast experimental capabilities, providing unprecedented progress in gene functional annotation, gene regulation analysis, and environmental factor verification. However, with increasing amounts of sequenced reads and reference model species, the selection of appropriate reference species for gene annotation has become a new challenge. Methods: We proposed a novel approach for finding the most effective reference model species through taxonomic associations and ultra-conserved orthologous (UCO) gene comparisons among species. An online system for multiple species selection (MSS) for RNA-seq differential expression analysis was developed, and comprehensive genomic annotations from 291 reference model eukaryotic species were retrieved from the RefSeq, KEGG, and UniProt databases. Results: Using the proposed MSS pipeline, gene ontology and biological pathway enrichment analysis can be efficiently achieved, especially in the case of transcriptomic analysis of non-model organisms. The results showed that the proposed method solved problems related to limitations in annotation information and provided a roughly twenty-fold reduction in computational time, resulting in more accurate results than those of traditional approaches of using a single model reference species or the large non-redundant reference database. Conclusions: Selection of appropriate reference model species helps to reduce missing annotation information, allowing for more comprehensive results than those obtained with a single model reference species. In addition, adequate model species selection reduces the computational time significantly while retaining the same order of accuracy. The proposed system indeed provides superior performance by selecting appropriate multiple species for transcriptomic analysis compared to traditional approaches.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] CRISPR in non-model organisms through ReMOT Control
    Rasgon, Jason
    [J]. INTEGRATIVE AND COMPARATIVE BIOLOGY, 2023, 63 : S376 - S376
  • [22] Amplified fragment length homoplasy: in silico analysis for model and non-model species
    Paris, Margot
    Bonnes, Benjamin
    Ficetola, Gentile Francesco
    Poncet, Benedicte N.
    Despres, Laurence
    [J]. BMC GENOMICS, 2010, 11
  • [23] Amplified fragment length homoplasy: in silico analysis for model and non-model species
    Margot Paris
    Benjamin Bonnes
    Gentile Francesco Ficetola
    Bénédicte N Poncet
    Laurence Després
    [J]. BMC Genomics, 11
  • [24] Issues with RNA-seq analysis in non-model organisms: A salmonid example
    Sundaram, Arvind
    Tengs, Torstein
    Grimholt, Unni
    [J]. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY, 2017, 75 : 38 - 47
  • [25] Cell classification of non-model organisms challenged by the approach of the single cell analysis
    Koiwai, Keiichiro
    [J]. NIPPON SUISAN GAKKAISHI, 2022, 88 (06) : 540 - 540
  • [26] Homeolog expression analysis in an allotriploid non-model crop via integration of transcriptomics and proteomics
    van Wesemael, Jelle
    Hueber, Yann
    Kissel, Ewaut
    Campos, Nadia
    Swennen, Rony
    Carpentier, Sebastien
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [27] Homeolog expression analysis in an allotriploid non-model crop via integration of transcriptomics and proteomics
    Jelle van Wesemael
    Yann Hueber
    Ewaut Kissel
    Nádia Campos
    Rony Swennen
    Sebastien Carpentier
    [J]. Scientific Reports, 8
  • [28] Detecting signatures of positive selection in non-model species using genomic data
    Weigand, Hannah
    Leese, Florian
    [J]. ZOOLOGICAL JOURNAL OF THE LINNEAN SOCIETY, 2018, 184 (02) : 528 - 583
  • [29] The proteomic response of model and non-model marine organisms to thermal and oxidative stress
    Tomanek, L.
    [J]. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY A-MOLECULAR & INTEGRATIVE PHYSIOLOGY, 2009, 154 (01): : S7 - S7
  • [30] TEENA: an integrated web server for transposable element enrichment analysis in various model and non-model organisms
    Li, Yuzhuo
    Lyu, Renzhe
    Chen, Shuai
    Wang, Yejun
    Sun, Ming-an
    [J]. NUCLEIC ACIDS RESEARCH, 2024, 52 (W1) : W126 - W131