A hybrid approach for measuring semantic similarity based on IC-weighted path distance in WordNet

被引:32
|
作者
Cai, Yuanyuan [1 ]
Zhang, Qingchuan [1 ]
Lu, Wei [2 ]
Che, Xiaoping [2 ]
机构
[1] Beijing Technol & Business Univ, Sch Comp & Informat Engn, Beijing Key Lab Big Data Technol Food Safety, Beijing 100048, Peoples R China
[2] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Concept semantic similarity; Intrinsic information content; WordNet; Edge distance; INFORMATION-CONTENT; BIOMEDICAL DOMAIN; LEXICAL CHAINS; CONTEXT; ONTOLOGIES; SEARCH; VECTOR;
D O I
10.1007/s10844-017-0479-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a valuable tool for text understanding, semantic similarity measurement enables discriminative semantic-based applications in the fields of natural language processing, information retrieval, computational linguistics and artificial intelligence. Most of the existing studies have used structured taxonomies such as WordNet to explore the lexical semantic relationship, however, the improvement of computation accuracy is still a challenge for them. To address this problem, in this paper, we propose a hybrid WordNet-based approach CSSM-ICSP to measuring concept semantic similarity, which leverage the information content(IC) of concepts to weight the shortest path distance between concepts. To improve the performance of IC computation, we also develop a novel model of the intrinsic IC of concepts, where a variety of semantic properties involved in the structure of WordNet are taken into consideration. In addition, we summarize and classify the technical characteristics of previous WordNet-based approaches, as well as evaluate our approach against these approaches on various benchmarks. The experimental results of the proposed approaches are more correlated with human judgment of similarity in term of the correlation coefficient, which indicates that our IC model and similarity detection approach are comparable or even better for semantic similarity measurement as compared to others.
引用
收藏
页码:23 / 47
页数:25
相关论文
共 50 条
  • [1] A hybrid approach for measuring semantic similarity based on IC-weighted path distance in WordNet
    Yuanyuan Cai
    Qingchuan Zhang
    Wei Lu
    Xiaoping Che
    [J]. Journal of Intelligent Information Systems, 2018, 51 : 23 - 47
  • [2] A Hybrid Approach for Measuring Semantic Similarity between Ontologies Based on WordNet
    He, Wei
    Yang, Xiaoping
    Huang, Dupei
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2011, 7091 : 68 - +
  • [3] Measuring Semantic Similarity Based On WordNet
    Zhao, Zhongcheng
    Yan, Jianzhuo
    Fang, Liying
    Wang, Pu
    [J]. 2009 SIXTH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2009, : 89 - 92
  • [4] A novel wordnet-based approach for measuring semantic similarity
    Zhu, Xinhua
    Li, Fei
    Chen, Hongchao
    Mao, Junqing
    [J]. Journal of Information and Computational Science, 2015, 12 (13): : 4919 - 4927
  • [5] Measuring semantic similarity in WordNet
    Liu, Xiao-Ying
    Zhou, Yi-Ming
    Zheng, Ruo-Shi
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3431 - +
  • [6] An information Content-Based Approach for Measuring Concept Semantic Similarity in WordNet
    Zhang, Xiaogang
    Sun, Shouqian
    Zhang, Kejun
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2018, 103 (01) : 117 - 132
  • [7] An information Content-Based Approach for Measuring Concept Semantic Similarity in WordNet
    Xiaogang Zhang
    Shouqian Sun
    Kejun Zhang
    [J]. Wireless Personal Communications, 2018, 103 : 117 - 132
  • [8] A fuzzy approach for measuring the semantic similarity between words in WordNet
    Song, Ling
    Ma, Jun
    Lei, Jingsheng
    Li, Chao
    [J]. Journal of Information and Computational Science, 2009, 6 (03): : 1673 - 1680
  • [9] An Efficient Approach for Measuring Semantic Similarity Combining WordNet and Wikipedia
    Li, Fei
    Liao, Lejian
    Zhang, Lanfang
    Zhu, Xinhua
    Zhang, Bo
    Wang, Zheng
    [J]. IEEE ACCESS, 2020, 8 : 184318 - 184338
  • [10] A New Hybrid Semantic Similarity Measure Based on WordNet
    Meng, Lingling
    Gu, Junzhong
    Zhou, Zili
    [J]. NETWORK COMPUTING AND INFORMATION SECURITY, 2012, 345 : 739 - +