A New Path Based Hybrid Measure for Gene Ontology Similarity

被引:20
|
作者
Bandyopadhyay, Sanghamitra [1 ]
Mallick, Koushik [2 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700108, W Bengal, India
[2] RCC Inst Informat Technol, CSE Dept, Kolkata 700015, W Bengal, India
关键词
Gene ontology similarity; semantic similarity; term similarity; information content; protein interaction prediction; functional classification of genes; microRNA; SEMANTIC SIMILARITY; PROTEIN-INTERACTION; SACCHAROMYCES-CEREVISIAE; FUNCTIONAL SIMILARITY; R PACKAGE; DATABASE; GO; SEQUENCE; NETWORK; TOOLS;
D O I
10.1109/TCBB.2013.149
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene Ontology (GO) consists of a controlled vocabulary of terms, annotating a gene or gene product, structured in a directed acyclic graph. In the graph, semantic relations connect the terms, that represent the knowledge of functional description and cellular component information of gene products. GO similarity gives us a numerical representation of biological relationship between a gene set, which can be used to infer various biological facts such as protein interaction, structural similarity, gene clustering, etc. Here we introduce a new shortest path based hybrid measure of ontological similarity between two terms which combines both structure of the GO graph and information content of the terms. Here the similarity between two terms t(1) and t(2), referred to as GOSim(PBHM)(t(1), t(2)), has two components; one obtained from the common ancestors of t(1) and t(2). The other from their remaining ancestors. The proposed path based hybrid measure does not suffer from the well-known shallow annotation problem. Its superiority with respect to some other popular measures is established for protein protein interaction prediction, correlation with gene expression and functional classification of genes in a biological pathway. Finally, the proposed measure is utilized to compute the average GO similarity score among the genes that are experimentally validated targets of some microRNAs. Results demonstrate that the targets of a given miRNA have a high degree of similarity in the biological process category of GO.
引用
收藏
页码:116 / 127
页数:12
相关论文
共 50 条
  • [31] A Semantic Similarity Measure for Ontology-Based Information
    Stuckenschmidt, Heiner
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 406 - 417
  • [32] Exploring protein networks with a semantic similarity measure across Gene Ontology
    Lubovac, Z
    Gamalielsson, J
    Olsson, B
    Lindlöf, A
    PROCEEDINGS OF THE 8TH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1-3, 2005, : 1203 - 1208
  • [33] An ontology-based measure to compute semantic similarity in biomedicine
    Batet, Montserrat
    Sanchez, David
    Valls, Aida
    JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (01) : 118 - 125
  • [34] Ontology-based Measure of Semantic Similarity between Concepts
    Shi Bin
    Fang Liying
    Yan Jianzhuo
    Wang Pu
    Zhao Zhongcheng
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 2, PROCEEDINGS, 2009, : 109 - 112
  • [35] Ontology-based Semantic Similarity Measure with Concept Lattice
    Song, Huazhu
    Xiao, Cong
    Xu, Lu
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 177 - 181
  • [36] A semantic similarity measure based on information distance for ontology alignment
    Jiang, Yong
    Wang, Xinmin
    Zheng, Hai-Tao
    INFORMATION SCIENCES, 2014, 278 : 76 - 87
  • [37] A New Semantic Functional Similarity over Gene Ontology
    Jeong, Jong Cheol
    Chen, Xuewen
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (02) : 322 - 334
  • [38] A Rough Similarity Measure for Ontology Mapping
    Zhao, Yi
    Halang, Wolfgang
    Wang, Xia
    2008 3RD INTERNATIONAL CONFERENCE ON INTERNET AND WEB APPLICATIONS AND SERVICES (ICIW 2008), 2008, : 136 - +
  • [39] A New Method for Measuring the Semantic Similarity on Gene Ontology
    Shen, Ying
    Zhang, Shaohong
    Wong, Hau-San
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 533 - 538
  • [40] A new metric to measure gene product similarity
    Mathur, Sachin
    Dinakarpandian, Deendayal
    2007 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2007, : 333 - 338