Integrating phenotype and gene expression data for predicting gene function

被引:6
|
作者
Malone, Brandon M. [1 ,2 ]
Perkins, Andy D. [1 ,2 ]
Bridges, Susan M. [1 ,2 ]
机构
[1] Mississippi State Univ, Dept Comp Sci & Engn, Mississippi State, MS 39762 USA
[2] Mississippi State Univ, Inst Digital Biol, Mississippi State, MS 39762 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
ONTOLOGY ANNOTATION; PHENOMICDB;
D O I
10.1186/1471-2105-10-S11-S20
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: This paper presents a framework for integrating disparate data sets to predict gene function. The algorithm constructs a graph, called an integrated similarity graph, by computing similarities based upon both gene expression and textual phenotype data. This integrated graph is then used to make predictions about whether individual genes should be assigned a particular annotation from the Gene Ontology. Results: A combined graph was generated from publicly-available gene expression data and phenotypic information from Saccharomyces cerevisiae. This graph was used to assign annotations to genes, as were graphs constructed from gene expression data and textual phenotype information alone. While the F-measure appeared similar for all three methods, annotations based upon the integrated similarity graph exhibited a better overall precision than gene expression or phenotype information alone can generate. The integrated approach was also able to assign almost as many annotations as the gene expression method alone, and generated significantly more total and correct assignments than the phenotype information could provide. Conclusion: These results suggest that augmenting standard gene expression data sets with publicly-available textual phenotype data can help generate more precise functional annotation predictions while mitigating the weaknesses of a standard textual phenotype approach.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Integrating phenotype and gene expression data for predicting gene function
    Brandon M Malone
    Andy D Perkins
    Susan M Bridges
    [J]. BMC Bioinformatics, 10
  • [2] Integrating Clinical Phenotype and Gene Expression Data to Prioritize Novel Drug Uses
    Paik, H.
    Chen, B.
    Sirota, M.
    Hadley, D.
    Butte, A. J.
    [J]. CPT-PHARMACOMETRICS & SYSTEMS PHARMACOLOGY, 2016, 5 (11): : 599 - 607
  • [3] Identifying Gene Network Rewiring by Integrating Gene Expression and Gene Network Data
    Xu, Ting
    Ou-Yang, Le
    Hu, Xiaohua
    Zhang, Xiao-Fei
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (06) : 2079 - 2085
  • [4] Predicting the Lung Adenocarcinoma and Its Biomarkers by Integrating Gene Expression and DNA Methylation Data
    Qiu, Wang-Ren
    Qi, Bei-Bei
    Lin, Wei-Zhong
    Zhang, Shou-Hua
    Yu, Wang-Ke
    Huang, Shun-Fa
    [J]. FRONTIERS IN GENETICS, 2022, 13
  • [5] Integrating Gene Expression Data Into Genomic Prediction
    Li, Zhengcao
    Gao, Ning
    Martini, Johannes W. R.
    Simianer, Henner
    [J]. FRONTIERS IN GENETICS, 2019, 10
  • [6] Integrating gene expression profiling and clinical data
    Paoli, Silvano
    Jurman, Giuseppe
    Albanese, Davide
    Merler, Stefano
    Furlanello, Cesare
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 47 (01) : 58 - 69
  • [7] Integrating heterogeneous gene expression data for gene regulatory network modelling
    Alina Sîrbu
    Heather J. Ruskin
    Martin Crane
    [J]. Theory in Biosciences, 2012, 131 : 95 - 102
  • [8] Integrating heterogeneous gene expression data for gene regulatory network modelling
    Sirbu, Alina
    Ruskin, Heather J.
    Crane, Martin
    [J]. THEORY IN BIOSCIENCES, 2012, 131 (02) : 95 - 102
  • [9] Connectionist approaches for predicting mouse gene function from gene expression
    Shenouda, Emad Andrews
    Morris, Quaid
    Bonner, Anthony J.
    [J]. NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2006, 4232 : 280 - 289
  • [10] Predicting genotypes from gene expression data
    Mary Muers
    [J]. Nature Reviews Genetics, 2012, 13 (6) : 379 - 379