A hybrid approach for measuring semantic similarity based on IC-weighted path distance in WordNet

被引:32
|
作者
Cai, Yuanyuan [1 ]
Zhang, Qingchuan [1 ]
Lu, Wei [2 ]
Che, Xiaoping [2 ]
机构
[1] Beijing Technol & Business Univ, Sch Comp & Informat Engn, Beijing Key Lab Big Data Technol Food Safety, Beijing 100048, Peoples R China
[2] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Concept semantic similarity; Intrinsic information content; WordNet; Edge distance; INFORMATION-CONTENT; BIOMEDICAL DOMAIN; LEXICAL CHAINS; CONTEXT; ONTOLOGIES; SEARCH; VECTOR;
D O I
10.1007/s10844-017-0479-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a valuable tool for text understanding, semantic similarity measurement enables discriminative semantic-based applications in the fields of natural language processing, information retrieval, computational linguistics and artificial intelligence. Most of the existing studies have used structured taxonomies such as WordNet to explore the lexical semantic relationship, however, the improvement of computation accuracy is still a challenge for them. To address this problem, in this paper, we propose a hybrid WordNet-based approach CSSM-ICSP to measuring concept semantic similarity, which leverage the information content(IC) of concepts to weight the shortest path distance between concepts. To improve the performance of IC computation, we also develop a novel model of the intrinsic IC of concepts, where a variety of semantic properties involved in the structure of WordNet are taken into consideration. In addition, we summarize and classify the technical characteristics of previous WordNet-based approaches, as well as evaluate our approach against these approaches on various benchmarks. The experimental results of the proposed approaches are more correlated with human judgment of similarity in term of the correlation coefficient, which indicates that our IC model and similarity detection approach are comparable or even better for semantic similarity measurement as compared to others.
引用
收藏
页码:23 / 47
页数:25
相关论文
共 50 条
  • [21] Measuring Word Semantic Relatedness Using WordNet-Based Approach
    Wei, Tingting
    Chang, Huiyou
    [J]. JOURNAL OF COMPUTERS, 2015, 10 (04) : 252 - 259
  • [22] Efficient Hybrid Semantic Text Similarity using Wordnet and a Corpus
    Atoum, Issa
    Otoom, Ahmed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (09) : 124 - 130
  • [23] Sentence Semantic Similarity based on Word FiImbedding and WordNet
    Farouk, Mamdouh
    [J]. PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 33 - 37
  • [24] An algorithm for semantic similarity of short text based on WordNet
    Zhai, Yan-Dong
    Wang, Kang-Ping
    Zhang, Dong-Na
    Hunag, Lan
    Zhou, Chun-Guang
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2012, 40 (03): : 617 - 620
  • [25] Semantic similarity-based PageRank using wordnet
    Poomagal, S.
    Hamsapriya, T.
    Visalakshi, P.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2013, 46 (02) : 101 - 112
  • [26] Ontology-based approach for measuring semantic similarity
    Taieb, Mohamed Ali Hadj
    Ben Aouicha, Mohamed
    Ben Hamadou, Abdelmajid
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 36 : 238 - 261
  • [27] A New Model of Information Content Based on Concept's Topology for Measuring Semantic Similarity in WordNet
    Meng, Lingling
    Gu, Junzhong
    Zhou, Zili
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2012, 5 (03): : 81 - 93
  • [28] A Novel Information Theoretic Approach for Finding Semantic Similarity in WordNet
    Adhikari, Abhijit
    Singh, Shivang
    Dutta, Animesh
    Dutta, Biswanath
    [J]. TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [29] EXPANDING APPROACH TO INFORMATION RETRIEVAL USING SEMANTIC SIMILARITY ANALYSIS BASED ON WORDNET AND WIKIPEDIA
    Zhao, Feng
    Fang, Fei
    Yan, Fengwei
    Jin, Hai
    Zhang, Qin
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2012, 22 (02) : 305 - 322
  • [30] Exploiting non-taxonomic relations for measuring semantic similarity and relatedness in WordNet
    AlMousa, Mohannad
    Benlamri, Rachid
    Khoury, Richard
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 212