Context-enhanced concept disambiguation in Wikification

被引:0
|
作者
Saeidi, Mozhgan
Mahdaviani, Kaveh
Milios, Evangelos
Zeh, Norbert
机构
来源
关键词
Wikification; Word sense disambiguation; Text coherence; Wikipedia; Representation learning; WIKIPEDIA; BANDWIDTH; CODES;
D O I
10.1016/j.iswa.2023.200246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikification is a method to automatically enrich a text with links to Wikipedia as a knowledge base. One step in Wikification is detecting ambiguous mentions, and one other step is disambiguating those mentions. In this paper, we worked on the mention disambiguation problem. Some state-of-the-art disambiguation approaches have divided long input document text into non-overlapping windows. Later, for each ambiguous mention, they pick the most similar sense to the chosen meaning of the key-entity (a word that helps disambiguation other words of the text). Partitioning the input into disjoint windows means that the most appropriate key-entity to disambiguate a given mention may be in an adjacent window. The disjoint windows negatively affect the accuracy of these methods. This work presents CACW (Context-Aware Concept Wikifier), a knowledge-based approach to produce the correct meaning for ambiguous mentions in the document. CACW incorporates two algorithms; the first uses co-occurring mentions in consecutive windows to augment the available contextual information to find the correct sense. The second algorithm ranks senses based on their context relevancy. We also define a new metric for disambiguation to measure the coherence of the whole text document. Comparing our approach with state-of-the-art methods shows the effectiveness of our method in terms of text coherence in the English Wikification task. We observed between 10-20 percent improvement in the F1 measure compared to the state-of-the-art techniques.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Open dataset discovery using context-enhanced similarity search
    Bernhauer, David
    Necasky, Martin
    Skoda, Petr
    Klimek, Jakub
    Skopal, Tomas
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (12) : 3265 - 3291
  • [22] Open dataset discovery using context-enhanced similarity search
    David Bernhauer
    Martin Nečaský
    Petr Škoda
    Jakub Klímek
    Tomáš Skopal
    Knowledge and Information Systems, 2022, 64 : 3265 - 3291
  • [23] Context-enhanced motion coherence modeling for global outlier rejection
    Li, Hongjie
    Dong, Mingyue
    Zheng, Xianwei
    Xu, Xiong
    Xie, Xiao
    Xiong, Hanjiang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 202 : 69 - 86
  • [24] Integrating the physical world with the web to enable context-enhanced mobile services
    Debaty, P
    Goddi, P
    Vorbau, A
    MOBILE NETWORKS & APPLICATIONS, 2005, 10 (04): : 385 - 394
  • [25] A context-enhanced neural network model for biomedical event trigger detection
    Wang, Zilin
    Ren, Yafeng
    Peng, Qiong
    Ji, Donghong
    Information Sciences, 2025, 691
  • [26] Integrating the Physical World with the Web to Enable Context-Enhanced Mobile Services
    Philippe Debaty
    Patrick Goddi
    Alex Vorbau
    Mobile Networks and Applications, 2005, 10 : 385 - 394
  • [27] Context-Enhanced LLM-Based Framework for Automatic Test Refactoring
    Gao, Yi
    Hu, Xing
    Yang, Xiaohu
    Xia, Xin
    arXiv,
  • [28] A context-enhanced Dirichlet model for online clustering in short text streams
    Kumar, Jay
    Shao, Junming
    Kumar, Rajesh
    Din, Salah Ud
    Mawuli, Cobbinah B.
    Yang, Qinli
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [29] Context-Enhanced Probabilistic Diffusion for Urban Point-of-Interest Recommendation
    Zhang, Zhipeng
    Dong, Mianxiong
    Ota, Kaoru
    Zhang, Yao
    Kudo, Yasuo
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (06) : 3156 - 3169
  • [30] A Context-Enhanced Generate-then-Evaluate Framework for Chinese Abbreviation Prediction
    Tong, Hanwen
    Xie, Chenhao
    Liang, Jiaqing
    He, Qianyu
    Yue, Zhiang
    Liu, Jingping
    Xiao, Yanghua
    Wang, Wenguang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1945 - 1954