Gene function prediction using protein domain probability and hierarchical Gene Ontology information

被引:0
|
作者
Jung, Jaehee [1 ]
Thon, Michael R. [2 ]
机构
[1] Texas A&M Univ, Dept Comp Sci, College Stn, TX 77843 USA
[2] Univ Salamanca, Dept Gen Microbiol, E-37008 Salamanca, Spain
基金
美国农业部;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Gene Ontology (GO) is a controlled vocabulary of terms to describe protein functions. It also includes a hierarchical description of the relationships among the terms in the form of a directed acyclic graph (DAG). Several systems have been developed that employ pattern recognition to assign gene function, using a variety of features, including sequence similarity, presence of protein functional domains and gene expression patterns, but most of these approaches have not considered the hierarchical structure of the GO. The DAG represents the functional relationships between the GO terms, thus it should be an important component of an automated annotation system. We propose a Bayesian, multi-label classifier that incorporates the relationships among GO terms found in the GO DAG. A comparative analysis of our method to other previously described annotation systems shows that our method provides improved annotation accuracy when the performance of individual GO terms are compared. More importantly, our method enables the classification of significantly more GO terms to more proteins than were previously possible.
引用
收藏
页码:2161 / 2164
页数:4
相关论文
共 50 条
  • [41] Prediction of Protein Functions with Gene Ontology and Interspecies Protein Homology Data
    Mitrofanova, Antonina
    Pavlovic, Vladimir
    Mishra, Bud
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 775 - 784
  • [42] Graph embeddings on gene ontology annotations for protein–protein interaction prediction
    Xiaoshi Zhong
    Jagath C. Rajapakse
    BMC Bioinformatics, 21
  • [43] UDoGeC: Essential Protein Prediction Using Domain And Gene Expression Profiles
    Shabnam, Fathima C. B.
    Izudheen, Sminu
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS, 2016, 93 : 1003 - 1009
  • [44] Hierarchical multi-label prediction of gene function
    Barutcuoglu, Z
    Schapire, RE
    Troyanskaya, OG
    BIOINFORMATICS, 2006, 22 (07) : 830 - 836
  • [45] CEGSO: Boosting Essential Proteins Prediction by Integrating Protein Complex, Gene Expression, Gene Ontology, Subcellular Localization and Orthology Information
    Zhang, Wei
    Xue, Xiaoli
    Xie, Chengwang
    Li, Yuanyuan
    Liu, Junhong
    Chen, Hailin
    Li, Guanghui
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (03) : 349 - 361
  • [46] CEGSO: Boosting Essential Proteins Prediction by Integrating Protein Complex, Gene Expression, Gene Ontology, Subcellular Localization and Orthology Information
    Wei Zhang
    Xiaoli Xue
    Chengwang Xie
    Yuanyuan Li
    Junhong Liu
    Hailin Chen
    Guanghui Li
    Interdisciplinary Sciences: Computational Life Sciences, 2021, 13 : 349 - 361
  • [47] PFP/ESG: automated protein function prediction servers enhanced with Gene Ontology visualization tool
    Khan, Ishita K.
    Wei, Qing
    Chitale, Meghana
    Kihara, Daisuke
    BIOINFORMATICS, 2015, 31 (02) : 271 - 272
  • [48] Gene Ontology consistent protein function prediction: the FALCON algorithm applied to six eukaryotic genomes
    Kourmpetis, Yiannis A. I.
    van Dijk, Aalt D. J.
    ter Braak, Cajo J. F.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2013, 8
  • [49] A Bayesian approach to construct context-specific Gene Ontology: application to protein function prediction
    Njah, Hasna
    Jamoussi, Salma
    Mahdi, Walid
    Elati, Mohamed
    2016 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2016,
  • [50] Gene Ontology consistent protein function prediction: the FALCON algorithm applied to six eukaryotic genomes
    Yiannis AI Kourmpetis
    Aalt DJ van Dijk
    Cajo JF ter Braak
    Algorithms for Molecular Biology, 8