An efficient approach for measuring semantic relatedness using Wikipedia bidirectional links

被引:5
|
作者
Zhu, Xinhua [1 ]
Guo, Qingsong [1 ]
Zhang, Bo [1 ,2 ]
Li, Fei [3 ]
机构
[1] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
[2] Hezhou Univ, Sch Math & Comp Sci, Hezhou 542899, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic relatedness; Link vector; Vector similarity metric; Disambiguation; Wikipedia; INFORMATION-CONTENT; SIMILARITY; REPRESENTATION; ASSOCIATION;
D O I
10.1007/s10489-019-01452-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The measurement of the semantic relatedness between concepts is an important fundamental research topic in natural language processing. The link-based model is the most promising relatedness method in Wikipedia-based measures because its manually defined links in Wikipedia are refined and close to the semantics of humans. This paper proposes a Wikipedia two-way link model to extend the existing Wikipedia one-way out-link model, which has a low dimension and a high efficiency, as well as being easy to implement and repeat. First, this model utilizes the out-links and in-links of concepts in Wikipedia to combine into a bidirectional link vector for concept semantic interpreter and uses a TF*IDF-based bidirectional weight method to uniformly calculate the strength of the mutual association between a given concept and its out-link or in-link concept. Second, we propose a disambiguation strategy based on the social awareness of senses that directly sorts the out-links within a disambiguation page in the order in which they occur in the disambiguation page and adopts an adjustable threshold to determine how many senses will be selected. Moreover, we also propose new vector similarity metrics based on logarithm and exponent to improve the comprehensive performance of the semantic relatedness measurements based on Wikipedia links. The experimental results on some well-recognized datasets demonstrate that our model surpasses the existing popular Naive Explicit Semantic Analysis (Naive-ESA) and Wikipedia Out-Link vector-based Measure (WOLM) methods in the current Wikipedia versions and that our bidirectional link model significantly improves the performance of the existing one-way link model in practical applications.
引用
收藏
页码:3708 / 3730
页数:23
相关论文
共 50 条
  • [1] An efficient approach for measuring semantic relatedness using Wikipedia bidirectional links
    Xinhua Zhu
    Qingsong Guo
    Bo Zhang
    Fei Li
    Applied Intelligence, 2019, 49 : 3708 - 3730
  • [2] Measuring Semantic Relatedness using Wikipedia Signed Network
    Yang, Wen-Teng
    Kao, Hung-Yu
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (04) : 615 - 630
  • [3] Measuring semantic relatedness using wikipedia signed network
    1600, Institute of Information Science (29):
  • [4] Measuring Semantic Relatedness using Wikipedia Revision Information in a Signed Network
    Yang, Wen-Teng
    Kao, Hung-Yu
    2011 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2011), 2011, : 69 - 74
  • [5] A New Approach for Computing Semantic Relatedness with Wikipedia
    Zhang, Xinye
    Li, Xiu
    Ruan, Zhijian
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER, NETWORKS AND COMMUNICATION ENGINEERING (ICCNCE 2013), 2013, 30 : 654 - 657
  • [6] An Efficient Approach for Measuring Semantic Similarity Combining WordNet and Wikipedia
    Li, Fei
    Liao, Lejian
    Zhang, Lanfang
    Zhu, Xinhua
    Zhang, Bo
    Wang, Zheng
    IEEE ACCESS, 2020, 8 : 184318 - 184338
  • [7] Computing semantic relatedness using Wikipedia features
    Taieb, Mohamed Ali Hadj
    Ben Aouicha, Mohamed
    Ben Hamadou, Abdelmajid
    KNOWLEDGE-BASED SYSTEMS, 2013, 50 : 260 - 278
  • [8] Discovering Cross-language Links in Wikipedia through Semantic Relatedness
    Penta, Antonio
    Quercini, Gianluca
    Reynaud, Chantal
    Shadbolt, Nigel
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 642 - +
  • [9] Wikipedia bi-linear link (WBLM) model: A new approach for measuring semantic similarity and relatedness between linguistic concepts using Wikipedia link structure
    Hussain, Muhammad Jawad
    Bai, Heming
    Jiang, Yuncheng
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)
  • [10] A Wikipedia Two-way Link Vector Model for Measuring Semantic Relatedness
    Zhu, Xinhua
    Guo, Qingsong
    Zhang, Bo
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 323 - 330