Multiple model species selection for transcriptomics analysis of non-model organisms

被引:5
|
作者
Pai, Tun-Wen [1 ,2 ]
Li, Kuan-Hung [1 ]
Yang, Cing-Han [1 ]
Hu, Chin-Hwa [3 ]
Lin, Han-Jia [3 ]
Wang, Wen-Der [4 ]
Chen, Yet-Ran [5 ]
机构
[1] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, Keelung, Taiwan
[2] Natl Taipei Univ Technol, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[3] Natl Taiwan Ocean Univ, Dept Biosci & Biotechnol, Keelung, Taiwan
[4] Natl Chiayi Univ, Dept Bioagr Sci, Chiayi, Taiwan
[5] Acad Sinica, Agr Biotechnol Res Ctr, Taipei, Taiwan
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
RNA-seq; Reference model species; Differential expression analysis; Ultra-conserved orthologous gene; Gene ontology; Biological pathway; RNA-SEQ; CELLS; GENES; TOOL;
D O I
10.1186/s12859-018-2278-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Transcriptomic sequencing (RNA-seq) related applications allow for rapid explorations due to their high-throughput and relatively fast experimental capabilities, providing unprecedented progress in gene functional annotation, gene regulation analysis, and environmental factor verification. However, with increasing amounts of sequenced reads and reference model species, the selection of appropriate reference species for gene annotation has become a new challenge. Methods: We proposed a novel approach for finding the most effective reference model species through taxonomic associations and ultra-conserved orthologous (UCO) gene comparisons among species. An online system for multiple species selection (MSS) for RNA-seq differential expression analysis was developed, and comprehensive genomic annotations from 291 reference model eukaryotic species were retrieved from the RefSeq, KEGG, and UniProt databases. Results: Using the proposed MSS pipeline, gene ontology and biological pathway enrichment analysis can be efficiently achieved, especially in the case of transcriptomic analysis of non-model organisms. The results showed that the proposed method solved problems related to limitations in annotation information and provided a roughly twenty-fold reduction in computational time, resulting in more accurate results than those of traditional approaches of using a single model reference species or the large non-redundant reference database. Conclusions: Selection of appropriate reference model species helps to reduce missing annotation information, allowing for more comprehensive results than those obtained with a single model reference species. In addition, adequate model species selection reduces the computational time significantly while retaining the same order of accuracy. The proposed system indeed provides superior performance by selecting appropriate multiple species for transcriptomic analysis compared to traditional approaches.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Multiple model species selection for transcriptomics analysis of non-model organisms
    Tun-Wen Pai
    Kuan-Hung Li
    Cing-Han Yang
    Chin-Hwa Hu
    Han-Jia Lin
    Wen-Der Wang
    Yet-Ran Chen
    [J]. BMC Bioinformatics, 19
  • [2] Non-model organisms, a species endangered by proteogenomics
    Armengaud, Jean
    Trapp, Judith
    Pible, Olivier
    Geffard, Olivier
    Chaumot, Arnaud
    Hartmann, Erica M.
    [J]. JOURNAL OF PROTEOMICS, 2014, 105 : 5 - 18
  • [3] Non-model model organisms
    Russell, James J.
    Theriot, Julie A.
    Sood, Pranidhi
    Marshall, Wallace F.
    Landweber, Laura F.
    Fritz-Laylin, Lillian
    Polka, Jessica K.
    Oliferenko, Snezhana
    Gerbich, Therese
    Gladfelter, Amy
    Umen, James
    Bezanilla, Magdalena
    Lancaster, Madeline A.
    He, Shuonan
    Gibson, Matthew C.
    Goldstein, Bob
    Tanaka, Elly M.
    Hu, Chi-Kuo
    Brunet, Anne
    [J]. BMC BIOLOGY, 2017, 15
  • [4] Non-model model organisms
    James J. Russell
    Julie A. Theriot
    Pranidhi Sood
    Wallace F. Marshall
    Laura F. Landweber
    Lillian Fritz-Laylin
    Jessica K. Polka
    Snezhana Oliferenko
    Therese Gerbich
    Amy Gladfelter
    James Umen
    Magdalena Bezanilla
    Madeline A. Lancaster
    Shuonan He
    Matthew C. Gibson
    Bob Goldstein
    Elly M. Tanaka
    Chi-Kuo Hu
    Anne Brunet
    [J]. BMC Biology, 15
  • [5] Non-model organisms
    Nawy, Tal
    [J]. NATURE METHODS, 2012, 9 (01) : 37 - 37
  • [6] Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms
    Joyce, Blake L.
    Haug-Baltzell, Asher K.
    Hulvey, Jonathan P.
    McCarthy, Fiona
    Devisetty, Upendra Kumar
    Lyons, Eric
    [J]. JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2017, (123):
  • [7] Digital gene expression for non-model organisms
    Hong, Lewis Z.
    Li, Jun
    Schmidt-Kuentzel, Anne
    Warren, Wesley C.
    Barsh, Gregory S.
    [J]. GENOME RESEARCH, 2011, 21 (11) : 1905 - 1915
  • [8] Local Ancestry Inference in Non-Model Organisms
    Salter-Townshend, Michael
    Myers, Simon
    [J]. HUMAN HEREDITY, 2017, 83 (05) : 244 - 244
  • [9] Pitfall or promise: Proteomics for non-model organisms
    Tomanek, Lars
    [J]. JOURNAL OF EXPERIMENTAL BIOLOGY, 2006, 209 (19): : vi - vi
  • [10] Measuring gene expression in non-model organisms
    Louisa Flintoft
    [J]. Nature Reviews Genetics, 2011, 12 (11) : 742 - 742