Context-enhanced concept disambiguation in Wikification

被引:0
|
作者
Saeidi, Mozhgan
Mahdaviani, Kaveh
Milios, Evangelos
Zeh, Norbert
机构
来源
关键词
Wikification; Word sense disambiguation; Text coherence; Wikipedia; Representation learning; WIKIPEDIA; BANDWIDTH; CODES;
D O I
10.1016/j.iswa.2023.200246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikification is a method to automatically enrich a text with links to Wikipedia as a knowledge base. One step in Wikification is detecting ambiguous mentions, and one other step is disambiguating those mentions. In this paper, we worked on the mention disambiguation problem. Some state-of-the-art disambiguation approaches have divided long input document text into non-overlapping windows. Later, for each ambiguous mention, they pick the most similar sense to the chosen meaning of the key-entity (a word that helps disambiguation other words of the text). Partitioning the input into disjoint windows means that the most appropriate key-entity to disambiguate a given mention may be in an adjacent window. The disjoint windows negatively affect the accuracy of these methods. This work presents CACW (Context-Aware Concept Wikifier), a knowledge-based approach to produce the correct meaning for ambiguous mentions in the document. CACW incorporates two algorithms; the first uses co-occurring mentions in consecutive windows to augment the available contextual information to find the correct sense. The second algorithm ranks senses based on their context relevancy. We also define a new metric for disambiguation to measure the coherence of the whole text document. Comparing our approach with state-of-the-art methods shows the effectiveness of our method in terms of text coherence in the English Wikification task. We observed between 10-20 percent improvement in the F1 measure compared to the state-of-the-art techniques.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A Concept for Context-Enhanced Heterogeneous Access Management
    Klein, Andreas
    Mannweiler, Christian
    Schneider, Joerg
    Thillen, Fraenz
    Schotten, Hans D.
    2010 IEEE GLOBECOM WORKSHOPS, 2010, : 6 - 10
  • [2] SENSEMBERT: Context-Enhanced Sense Embeddings for Multilingual Word Sense Disambiguation
    Scarlini, Bianca
    Pasini, Tommaso
    Navigli, Roberto
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8758 - 8765
  • [3] Wikification of Learning Objects Using Metadata as an Alternative Context for Disambiguation
    Melara Abarca, Reyna
    Perez-Martinez, Claudia
    Gelbukh, Alexander
    Lopez Morteo, Gabriel
    Martinez Reyes, Magally
    Perez Lopez, Moises
    COMPUTACION Y SISTEMAS, 2014, 18 (04): : 755 - 765
  • [4] Complementary Context-Enhanced Concept Lattice Aware Personalized Recommendation
    Huang, Wenqing
    Hao, Fei
    Pang, Guangyao
    Sun, Yifei
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 919 - 926
  • [5] Context-Enhanced Stereo Transformer
    Guo, Weiyu
    Li, Zhaoshuo
    Yang, Yongkui
    Wang, Zheng
    Taylor, Russell H.
    Unberath, Mathias
    Yuille, Alan
    Li, Yingwei
    COMPUTER VISION - ECCV 2022, PT XXXII, 2022, 13692 : 263 - 279
  • [6] Context-Enhanced Directed Model Checking
    Wehrle, Martin
    Kupferschmid, Sebastian
    MODEL CHECKING SOFTWARE, 2010, 6349 : 88 - 105
  • [7] Context-Enhanced Adaptive Entity Linking
    Ilievski, Filip
    Rizzo, Giuseppe
    van Erp, Marieke
    Plu, Julien
    Troncy, Raphael
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 541 - 548
  • [8] BOSS: context-enhanced search for biomedical objects
    Choi, Jaehoon
    Kim, Donghyeon
    Kim, Seongsoon
    Lee, Sunwon
    Lee, Kyubum
    Kang, Jaewoo
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [9] BOSS: context-enhanced search for biomedical objects
    Jaehoon Choi
    Donghyeon Kim
    Seongsoon Kim
    Sunwon Lee
    Kyubum Lee
    Jaewoo Kang
    BMC Medical Informatics and Decision Making, 12
  • [10] A Context-Enhanced De-identification System
    Lee K.
    Kayaalp M.
    Henry S.
    Uzuner O.
    ACM Transactions on Computing for Healthcare, 2022, 3 (01):