Measures of semantic similarity and relatedness in the biomedical domain

被引:316
|
作者
Pedersen, Ted
Pakhomov, Serguei V. S.
Patwardhan, Siddharth
Chute, Christopher G.
机构
[1] Univ Minnesota, Dept Comp Sci, Duluth, MN 55812 USA
[2] Mayo Coll Med, Div Biomed Informat, Rochester, MN USA
[3] Univ Utah, Sch Comp, Salt Lake City, UT 84112 USA
关键词
semantic similarity; path based measures; information content; context vectors; SNOMED-CT;
D O I
10.1016/j.jbi.2006.06.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Measures of semantic similarity between concepts are widely used in Natural Language Processing. In this article, we show how six existing domain-independent measures can be adapted to the biomedical domain. These measures were originally based on WordNet, an English lexical database of concepts and relations. In this research, we adapt these measures to the SNOMED-CT (R) ontology of medical concepts. The measures include two path-based measures, and three measures that augment path-based measures with information content statistics from corpora. We also derive a context vector measure based on medical corpora that can be used as a measure of semantic relatedness. These six measures are evaluated against a newly created test bed of 30 medical concept pairs scored by three physicians and nine medical coders. We find that the medical coders and physicians differ in their ratings, and that the context vector measure correlates most closely with the physicians, while the path-based measures and one of the information content measures correlates most closely with the medical coders. We conclude that there is a role both for more flexible measures of relatedness based on information derived from corpora, as well as for measures that rely on existing ontological structures. (C) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:288 / 299
页数:12
相关论文
共 50 条
  • [1] Association measures for estimating semantic similarity and relatedness between biomedical concepts
    Henry, Sam
    McQuilkin, Alex
    McInnes, Bridget T.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 93 : 1 - 10
  • [2] Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text
    McInnes, Bridget T.
    Pedersen, Ted
    JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (06) : 1116 - 1124
  • [3] The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies
    Harispe, Sebastien
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    BIOINFORMATICS, 2014, 30 (05) : 740 - 742
  • [4] Adapting Gloss Vector Semantic Relatedness Measure for Semantic Similarity Estimation: An Evaluation in the Biomedical Domain
    Pesaranghader, Ahmad
    Rezaei, Azadeh
    Pesaranghader, Ali
    SEMANTIC TECHNOLOGY, 2014, 8388 : 129 - 145
  • [5] Effects of domain on measures of semantic relatedness
    Macias-Galindo, Daniel
    Cavedon, Lawrence
    Thangarajah, John
    Wong, Wilson
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (10) : 2116 - 2131
  • [6] Semantic Similarity Measures in the Biomedical Domain by Leveraging a Web Search Engine
    Hsieh, Sheau-Ling
    Chang, Wen-Yung
    Chen, Chi-Huang
    Weng, Yung-Ching
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2013, 17 (04) : 853 - 861
  • [7] SISR: System for integrating semantic relatedness and similarity measures
    Mohamed Ben Aouicha
    Mohamed Ali Hadj Taieb
    Abdelmajid Ben Hamadou
    Soft Computing, 2018, 22 : 1855 - 1879
  • [8] SISR: System for integrating semantic relatedness and similarity measures
    Ben Aouicha, Mohamed
    Taieb, Mohamed Ali Hadj
    Ben Hamadou, Abdelmajid
    SOFT COMPUTING, 2018, 22 (06) : 1855 - 1879
  • [9] A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain
    Harispe, Sebastien
    Sanchez, David
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 48 : 38 - 53
  • [10] A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain
    Harispe, Sébastien
    Sánchez, David
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    Journal of Biomedical Informatics, 2014, 48 : 38 - 53