Evaluating the Significance of Protein Functional Similarity Based on Gene Ontology

被引:3
|
作者
Konopka, Bogumil M. [1 ]
Golda, Tomasz [1 ]
Kotulska, Malgorzata [1 ]
机构
[1] Wroclaw Univ Technol, Inst Biomed Engn & Instrumentat, PL-50370 Wroclaw, Poland
关键词
gene ontology; protein function; semantic similarity; SEMANTIC SIMILARITY; PREDICTION; TOOL;
D O I
10.1089/cmb.2014.0181
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene ontology is among the most successful ontologies in the biomedical domain. It is used to describe, unambiguously, protein molecular functions, cellular localizations, and processes in which proteins participate. The hierarchical structure of gene ontology allows quantifying protein functional similarity by application of algorithms that calculate semantic similarities. The scores, however, are meaningless without a given context. Here, we propose how to evaluate the significance of protein function semantic similarity scores by comparing them to reference distributions calculated for randomly chosen proteins. In the study, thresholds for significant functional semantic similarity, in four representative annotation corpuses, were estimated. We also show that the score significance is influenced by the number and specificity of gene ontology terms that are annotated to compared proteins. While proteins with a greater number of terms tend to yield higher similarity scores, proteins with more specific terms produce lower scores. The estimated significance thresholds were validated using protein sequence-function and structure-function relationships. Taking into account the term number and term specificity improves the distinction between significant and insignificant semantic similarity comparisons.
引用
收藏
页码:809 / 822
页数:14
相关论文
共 50 条
  • [41] MEGO: gene functional module expression based on gene ontology
    Tu, K
    Yu, H
    Zhu, MZ
    BIOTECHNIQUES, 2005, 38 (02) : 277 - 283
  • [42] Cluster analysis of protein array results via similarity of Gene Ontology annotation
    Wolting, Cheryl
    McGlade, C. Jane
    Tritchler, David
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [43] Exploring the Application of Gene Ontology Semantic Similarity Measure for Identifying Protein Complexes
    Luo, Jiawei
    Yu, Lingyao
    Dang, Qian
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 502 - 507
  • [44] Assessing protein similarity with Gene Ontology and its use in subnuclear localization prediction
    Lei, Zhengdeng
    Dai, Yang
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [45] Assessing protein similarity with Gene Ontology and its use in subnuclear localization prediction
    Zhengdeng Lei
    Yang Dai
    BMC Bioinformatics, 7
  • [46] Cluster analysis of protein array results via similarity of Gene Ontology annotation
    Cheryl Wolting
    C Jane McGlade
    David Tritchler
    BMC Bioinformatics, 7
  • [47] Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
    Wang, Jian
    Xie, Dong
    Lin, Hongfei
    Yang, Zhihao
    Zhang, Yijia
    PROTEOME SCIENCE, 2012, 10
  • [48] Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
    Jian Wang
    Dong Xie
    Hongfei Lin
    Zhihao Yang
    Yijia Zhang
    Proteome Science, 10
  • [49] Protein function classification based on gene ontology
    Park, DW
    Heo, HS
    Kwon, HC
    Chung, HY
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 691 - 696
  • [50] Protein Function Prediction With Functional and Topological Knowledge of Gene Ontology
    Zhao, Yingwen
    Yang, Zhihao
    Hong, Yongkai
    Yang, Yumeng
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2023, 22 (04) : 755 - 762