A New Path Based Hybrid Measure for Gene Ontology Similarity

被引:20
|
作者
Bandyopadhyay, Sanghamitra [1 ]
Mallick, Koushik [2 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700108, W Bengal, India
[2] RCC Inst Informat Technol, CSE Dept, Kolkata 700015, W Bengal, India
关键词
Gene ontology similarity; semantic similarity; term similarity; information content; protein interaction prediction; functional classification of genes; microRNA; SEMANTIC SIMILARITY; PROTEIN-INTERACTION; SACCHAROMYCES-CEREVISIAE; FUNCTIONAL SIMILARITY; R PACKAGE; DATABASE; GO; SEQUENCE; NETWORK; TOOLS;
D O I
10.1109/TCBB.2013.149
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene Ontology (GO) consists of a controlled vocabulary of terms, annotating a gene or gene product, structured in a directed acyclic graph. In the graph, semantic relations connect the terms, that represent the knowledge of functional description and cellular component information of gene products. GO similarity gives us a numerical representation of biological relationship between a gene set, which can be used to infer various biological facts such as protein interaction, structural similarity, gene clustering, etc. Here we introduce a new shortest path based hybrid measure of ontological similarity between two terms which combines both structure of the GO graph and information content of the terms. Here the similarity between two terms t(1) and t(2), referred to as GOSim(PBHM)(t(1), t(2)), has two components; one obtained from the common ancestors of t(1) and t(2). The other from their remaining ancestors. The proposed path based hybrid measure does not suffer from the well-known shallow annotation problem. Its superiority with respect to some other popular measures is established for protein protein interaction prediction, correlation with gene expression and functional classification of genes in a biological pathway. Finally, the proposed measure is utilized to compute the average GO similarity score among the genes that are experimentally validated targets of some microRNAs. Results demonstrate that the targets of a given miRNA have a high degree of similarity in the biological process category of GO.
引用
收藏
页码:116 / 127
页数:12
相关论文
共 50 条
  • [41] GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms
    Zhao, Chenguang
    Wang, Zheng
    SCIENTIFIC REPORTS, 2018, 8
  • [42] Exploring the Application of Gene Ontology Semantic Similarity Measure for Identifying Protein Complexes
    Luo, Jiawei
    Yu, Lingyao
    Dang, Qian
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 502 - 507
  • [43] GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms
    Chenguang Zhao
    Zheng Wang
    Scientific Reports, 8
  • [44] Web Service Clustering using a Hybrid Term-Similarity Measure with Ontology Learning
    Kumara, Banage T. G. S.
    Paik, Incheon
    Chen, Wuhui
    Ryu, Keun Ho
    INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2014, 11 (02) : 24 - 45
  • [45] An Ontology Concept Update Method Based on Hybrid Semantic Similarity
    Zhang, Peng
    Qi, Jiahui
    Wu, Min
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING, INDUSTRIAL MATERIALS AND INDUSTRIAL ELECTRONICS (MEIMIE 2019), 2019, : 232 - 240
  • [46] IWD towards Semantic similarity measure in ontology
    Rathee, Preeti
    Malik, Sanjay Kumar
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2020, 41 (07): : 1561 - 1577
  • [47] Word and Sentence Embedding Tools to Measure Semantic Similarity of Gene Ontology Terms by Their Definitions
    Duong, Dat
    Ahmad, Wasi Uddin
    Eskin, Eleazar
    Chang, Kai-Wei
    Li, Jingyi Jessica
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (01) : 38 - 52
  • [48] Sigmoid similarity - a new feature-based similarity measure
    Likavec, Silvia
    Lombardi, Ilaria
    Cena, Federica
    INFORMATION SCIENCES, 2019, 481 : 203 - 218
  • [49] Evaluating the Significance of Protein Functional Similarity Based on Gene Ontology
    Konopka, Bogumil M.
    Golda, Tomasz
    Kotulska, Malgorzata
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2014, 21 (11) : 809 - 822
  • [50] Gene ontology similarity measures based on linear order statistics
    Keller, James M.
    Bezdek, James C.
    Popescu, Mihail
    Pal, Nikhil R.
    Mitchell, Joyce A.
    Huband, Jacalyn M.
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2006, 14 (06) : 639 - 661