Assessing Semantic Similarity Between Concepts Using Wikipedia Based on Nonlinear Fitting

被引:1
|
作者
Huang, Guangjian [1 ]
Jiang, Yuncheng [1 ]
Ma, Wenjun [1 ]
Liu, Weiru [2 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou, Peoples R China
[2] Univ Bristol, Sch Comp Sci Elect & Elect Engn & Engn Maths, Bristol, Avon, England
基金
中国国家自然科学基金;
关键词
Semantic similarity; Wikipedia; Nonlinear fitting; INFORMATION-CONTENT; REPRESENTATION; KNOWLEDGE; MODELS;
D O I
10.1007/978-3-030-29563-9_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature-based methods of semantic similarity with Wikipedia achieve fruitful performances on measuring the "likeness" between objects in many research fields. However, since Wikipedia is created and edited by volunteers around the world, the preciseness of these methods more or less are influenced by the incompleteness, invalidity and inconsistency of the knowledge in Wikipedia. Unfortunately, this problem has not got enough attention in the existing work. To address this issue, this paper proposes a novel feature-based method for semantic similarity, which has three parts: low frequency features removal, the similarities of generalized synonyms computing, and weighted feature-based methods based on nonlinear fitting. Moreover, we show that our new method can always get a better Pearson correlation coefficient on one or more benchmarks through a set of experimental evaluations.
引用
收藏
页码:159 / 171
页数:13
相关论文
共 50 条
  • [1] An approach for measuring semantic similarity between Wikipedia concepts using multiple inheritances
    Hussain, Muhammad Jawad
    Wasti, Shahbaz Hassan
    Huang, Guangjian
    Wei, Lina
    Jiang, Yuncheng
    Tang, Yong
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [2] Feature-based approaches to semantic similarity assessment of concepts using Wikipedia
    Jiang, Yuncheng
    Zhang, Xiaopei
    Tang, Yong
    Nie, Ruihua
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (03) : 215 - 234
  • [3] Measuring Semantic Similarity between Words Using Wikipedia
    Lu Zhiqiang
    Shao Werimin
    Yu Zhenhua
    [J]. WISM: 2009 INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, : 251 - +
  • [4] Assessing semantic similarity between concepts: A weighted-feature-based approach
    Wasti, Shahbaz Hassan
    Hussain, Muhammad Jawad
    Huang, Guangjian
    Akram, Aftab
    Jiang, Yuncheng
    Tang, Yong
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (07):
  • [5] Experimental data for computing semantic similarity between concepts using multiple inheritances in Wikipedia category graph
    Hussain, Muhammad Jawad
    Wasti, Shahbaz Hassan
    Huang, Guangjian
    Jiang, Yuncheng
    [J]. DATA IN BRIEF, 2020, 30
  • [6] Computing semantic similarity based on novel models of semantic representation using Wikipedia
    Qu, Rong
    Fang, Yongyi
    Bai, Wen
    Jiang, Yuncheng
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (06) : 1002 - 1021
  • [7] Semantic Similarity between Concepts based on OWL Ontologies
    Xiao Min
    Zhong Luo
    Xiong Qianxing
    [J]. WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 749 - +
  • [8] Semantic Annotation of Documents Based on Wikipedia Concepts
    Brank, Janez
    Leban, Gregor
    Grobelnik, Marko
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2018, 42 (01): : 23 - 32
  • [9] Semantic annotation of documents based on wikipedia concepts
    Brank, Janez
    Leban, Gregor
    Grobelnik, Marko
    [J]. Informatica (Slovenia), 2018, 42 (01): : 23 - 32
  • [10] Evaluating semantic similarity and relatedness between concepts by combining taxonomic and non-taxonomic semantic features of WordNet and Wikipedia
    Hussain, Muhammad Jawad
    Bai, Heming
    Wasti, Shahbaz Hassan
    Huang, Guangjian
    Jiang, Yuncheng
    [J]. INFORMATION SCIENCES, 2023, 625 : 673 - 699