Evaluating semantic similarity and relatedness between concepts by combining taxonomic and non-taxonomic semantic features of WordNet and Wikipedia

被引:9
|
作者
Hussain, Muhammad Jawad [1 ,2 ]
Bai, Heming [1 ,2 ]
Wasti, Shahbaz Hassan [4 ]
Huang, Guangjian [3 ]
Jiang, Yuncheng [5 ,6 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[2] Nantong Univ, Res Ctr Intelligent Informat Technol, Nantong 226019, Peoples R China
[3] Jiangxi Univ Sci & Technol, Sch Informat Engn, Ganzhou, Peoples R China
[4] Univ Educ, Dept Informat Sci, Div Sci & Technol, Lahore, Pakistan
[5] South China Normal Univ, Sch Comp Sci, Guangzhou 510631, Peoples R China
[6] South China Normal Univ, Sch Sci, Guangzhou 510631, Peoples R China
基金
中国国家自然科学基金;
关键词
Information content; Semantic relatedness; Semantic similarity; Vector space; Wikipedia; WordNet; INFORMATION-CONTENT; EFFICIENT APPROACH; MODELS;
D O I
10.1016/j.ins.2023.01.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many applications in cognitive science and artificial intelligence utilize semantic similarity and relatedness to solve difficult tasks such as information retrieval, word sense disam-biguation, and text classification. Previously, several approaches for evaluating concept similarity and relatedness based on WordNet or Wikipedia have been proposed. WordNet-based methods rely on highly precise knowledge but have limited lexical cover-age. In contrast, Wikipedia-based models achieve more coverage but sacrifice knowledge quality. Therefore, in this paper, we focus on developing a comprehensive semantic simi-larity and relatedness method based on WordNet and Wikipedia. To improve the accuracy of existing measures, we combine various taxonomic and non-taxonomic features of WordNet, including gloss, lemmas, examples, sister-terms, derivations, holonyms/mero-nyms, and hypernyms/hyponyms, with Wikipedia gloss and hyperlinks, to describe con-cepts. We present a novel technique for extracting 'is-a' and 'part-whole' relationships between concepts using the Wikipedia link structure. The suggested technique identifies taxonomic and non-taxonomic relationships between concepts and offers dense vector representations of concepts. To fully exploit WordNet and Wikipedia's semantic attributes, the proposed method integrates their semantic knowledge at feature-level, combining semantic similarity and relatedness into a single comprehensive measure. The experimen-tal results demonstrate the effectiveness of the proposed method over state-of-the-art measures on various gold standard benchmarks. (c) 2023 Elsevier Inc. All rights reserved.
引用
收藏
页码:673 / 699
页数:27
相关论文
共 13 条
  • [1] Exploiting non-taxonomic relations for measuring semantic similarity and relatedness in WordNet
    AlMousa, Mohannad
    Benlamri, Rachid
    Khoury, Richard
    KNOWLEDGE-BASED SYSTEMS, 2021, 212
  • [2] An Efficient Approach for Measuring Semantic Similarity Combining WordNet and Wikipedia
    Li, Fei
    Liao, Lejian
    Zhang, Lanfang
    Zhu, Xinhua
    Zhang, Bo
    Wang, Zheng
    IEEE ACCESS, 2020, 8 : 184318 - 184338
  • [3] Combining and learning word embedding with WordNet for semantic relatedness and similarity measurement
    Lee, Yang-Yin
    Ke, Hao
    Yen, Ting-Yu
    Huang, Hen-Hsen
    Chen, Hsin-Hsi
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2020, 71 (06) : 657 - 670
  • [4] A graph-based semantic relatedness assessment method combining wikipedia features
    Li, Pu
    Xiao, Bao
    Ma, Wenjun
    Jiang, Yuncheng
    Zhang, Zhifeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 268 - 281
  • [5] EVALUATING SEMANTIC RELATEDNESS USING WIKIPEDIA-BASED REPRESENTATIVE FEATURES ANALYSIS
    Cui, Qing-jun
    Zhang, Hui
    Liu, Rui
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 467 - 472
  • [6] Study on Non-Taxonomic Relation Extraction between Concepts from Ontology Learning
    Lin, Junping
    Lin, Shiping
    PROCEEDINGS OF 2010 ASIA-PACIFIC YOUTH CONFERENCE ON COMMUNICATION, VOLS 1 AND 2, 2010, : 573 - 576
  • [7] A Non-Taxonomic Relation Extraction Method between Concepts from Ontology Learning
    Lin, Shiping
    Lin, Junping
    ASIA-PACIFIC YOUTH CONFERENCE ON COMMUNICATION TECHNOLOGY 2010 (APYCCT 2010), 2010, : 574 - 577
  • [8] Evaluating Taxonomic Relationships Using Semantic Similarity Measures on Sensor Domain Ontologies
    Tovar Vidal, Mireya
    Hernandez Garcia, Aimee Cecilia
    Lavalle Martinez, Jose de Jesus
    Reyes-Ortiz, Jose A.
    Vilarino Ayala, Dames
    ADVANCES IN INFORMATION AND COMMUNICATION, VOL 2, 2020, 1130 : 282 - 294
  • [9] Association measures for estimating semantic similarity and relatedness between biomedical concepts
    Henry, Sam
    McQuilkin, Alex
    McInnes, Bridget T.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 93 : 1 - 10
  • [10] An approach for measuring semantic similarity between Wikipedia concepts using multiple inheritances
    Hussain, Muhammad Jawad
    Wasti, Shahbaz Hassan
    Huang, Guangjian
    Wei, Lina
    Jiang, Yuncheng
    Tang, Yong
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)