Association measures for estimating semantic similarity and relatedness between biomedical concepts

被引:9
|
作者
Henry, Sam [1 ]
McQuilkin, Alex [1 ]
McInnes, Bridget T. [1 ]
机构
[1] Virginia Commonwealth Univ, Richmond, VA 23284 USA
关键词
Natural language processing; Association measures; Semantic similarity; Semantic relatedness; NETWORK; INTERACTOME; PHENOME; UMLS;
D O I
10.1016/j.artmed.2018.08.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Association measures quantify the observed likelihood a term pair co-occurs versus their predicted co-occurrence together if by chance. This is based both on the terms' individual occurrence frequencies, and their mutual co-occurrence frequencies. One application of association scores is estimating semantic relatedness, which is critical for many natural language processing applications, such as clustering of biomedical and clinical documents and the development of biomedical terminologies and ontololgies. In this paper we propose a method of generating association scores between biomedical concepts to estimate semantic relatedness. We use co-occurrence statistics between Unified Medical Language System (UMLS) concepts to account for lexical variation at the synonymous level, and introduce a process of concept expansion that exploits hierarchical information from the UMLS to account for lexical variation at the hyponymous level. State of the art results are achieved on several standard evaluation datasets, and an in depth analysis of hyper-parameters is presented.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Measures of semantic similarity and relatedness in the biomedical domain
    Pedersen, Ted
    Pakhomov, Serguei V. S.
    Patwardhan, Siddharth
    Chute, Christopher G.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (03) : 288 - 299
  • [2] Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text
    McInnes, Bridget T.
    Pedersen, Ted
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (06) : 1116 - 1124
  • [3] The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies
    Harispe, Sebastien
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    [J]. BIOINFORMATICS, 2014, 30 (05) : 740 - 742
  • [4] Measuring Semantic Similarity Between Biomedical Concepts Within Multiple Ontologies
    Al-Mubaid, Hisham
    Nguyen, Hoa A.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2009, 39 (04): : 389 - 398
  • [5] SISR: System for integrating semantic relatedness and similarity measures
    Mohamed Ben Aouicha
    Mohamed Ali Hadj Taieb
    Abdelmajid Ben Hamadou
    [J]. Soft Computing, 2018, 22 : 1855 - 1879
  • [6] SISR: System for integrating semantic relatedness and similarity measures
    Ben Aouicha, Mohamed
    Taieb, Mohamed Ali Hadj
    Ben Hamadou, Abdelmajid
    [J]. SOFT COMPUTING, 2018, 22 (06) : 1855 - 1879
  • [7] Semantic classification of biomedical concepts using distributional similarity
    Fan, Jung-Wei
    Friedman, Carol
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2007, 14 (04) : 467 - 477
  • [8] Evaluating semantic similarity and relatedness between concepts by combining taxonomic and non-taxonomic semantic features of WordNet and Wikipedia
    Hussain, Muhammad Jawad
    Bai, Heming
    Wasti, Shahbaz Hassan
    Huang, Guangjian
    Jiang, Yuncheng
    [J]. INFORMATION SCIENCES, 2023, 625 : 673 - 699
  • [9] Computing semantic similarity between biomedical concepts using new information content approach
    Ben Aouicha, Mohamed
    Taieb, Mohamed Ali Hadj
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 59 : 258 - 275
  • [10] Adapting Gloss Vector Semantic Relatedness Measure for Semantic Similarity Estimation: An Evaluation in the Biomedical Domain
    Pesaranghader, Ahmad
    Rezaei, Azadeh
    Pesaranghader, Ali
    [J]. SEMANTIC TECHNOLOGY, 2014, 8388 : 129 - 145