An ontology-based measure to compute semantic similarity in biomedicine

被引:143
|
作者
Batet, Montserrat [1 ]
Sanchez, David [1 ]
Valls, Aida [1 ]
机构
[1] Univ Rovira & Virgili, Dept Engn Informat & Matemat, Intelligent Technol Adv Knowledge Acquisit ITAKA, Tarragona 43007, Catalonia, Spain
关键词
Semantic similarity; Ontologies; SNOMED CT; Biomedicine; Data mining; INFORMATION-RETRIEVAL; CONCEPTUAL DISTANCE; CONTEXT;
D O I
10.1016/j.jbi.2010.09.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Proper understanding of textual data requires the exploitation and integration of unstructured and heterogeneous clinical sources, healthcare records or scientific literature, which are fundamental aspects in clinical and translational research. The determination of semantic similarity between word pairs is an important component of text understanding that enables the processing, classification and structuring of textual resources. In the past, several approaches for assessing word similarity by exploiting different knowledge sources (ontologies, thesauri, domain corpora, etc.) have been proposed. Some of these measures have been adapted to the biomedical field by incorporating domain information extracted from clinical data or from medical ontologies (such as MeSH or SNOMED CT). In this paper, these approaches are introduced and analyzed in order to determine their advantages and limitations with respect to the considered knowledge bases. After that, a new measure based on the exploitation of the taxonomical structure of a biomedical ontology is proposed. Using SNOMED CT as the input ontology, the accuracy of our proposal is evaluated and compared against other approaches according to a standard benchmark of manually ranked medical terms. The correlation between the results of the evaluated measures and the human experts' ratings shows that our proposal outperforms most of the previous measures avoiding, at the same time, some of their limitations. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:118 / 125
页数:8
相关论文
共 50 条
  • [1] An Ontology-Based Semantic Similarity Measure Considering Multi-Inheritance in Biomedicine
    Yang, Fengqin
    Xing, Yuanyuan
    Sun, Hongguang
    Sun, Tieli
    Chen, Siya
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [2] A Semantic Similarity Measure for Ontology-Based Information
    Stuckenschmidt, Heiner
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 406 - 417
  • [3] Ontology-based Measure of Semantic Similarity between Concepts
    Shi Bin
    Fang Liying
    Yan Jianzhuo
    Wang Pu
    Zhao Zhongcheng
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 2, PROCEEDINGS, 2009, : 109 - 112
  • [4] Ontology-based Semantic Similarity Measure with Concept Lattice
    Song, Huazhu
    Xiao, Cong
    Xu, Lu
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 177 - 181
  • [5] New ontology-based semantic similarity measure for the biomedical domain
    Nguyen, Hoa A.
    Al-Mubaid, Hisham
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 623 - +
  • [6] From Ontology to Semantic Similarity: Calculation of Ontology-Based Semantic Similarity
    Gan, Mingxin
    Dou, Xue
    Jiang, Rui
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [7] Ontology-based approach for measuring semantic similarity
    Taieb, Mohamed Ali Hadj
    Ben Aouicha, Mohamed
    Ben Hamadou, Abdelmajid
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 36 : 238 - 261
  • [8] An Ontology-Based Semantic Similarity Computation Model
    Yang, Yuehua
    Ping, Yuan
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 561 - 564
  • [9] Research on Ontology-Based Measuring Semantic Similarity
    Yin Guisheng
    Sheng Qiuyan
    ICICSE: 2008 INTERNATIONAL CONFERENCE ON INTERNET COMPUTING IN SCIENCE AND ENGINEERING, PROCEEDINGS, 2008, : 250 - 253
  • [10] An Improvement on the Model of Ontology-Based Semantic Similarity Computation
    Wang, Xiaoyun
    Zhou, Jianfeng
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 509 - 512