An Empirical Study of Word Sense Disambiguation for Biomedical Information Retrieval System

被引:0
|
作者
Rais, Mohammed [1 ]
Lachkar, Abdelmonaime [1 ]
机构
[1] USMBA, Dept Elect & Comp Engn, LISA, ENSA, Fes, Morocco
关键词
Natural language processing; Biomedical Information Retrieval; Word Sense Disambiguation; Biomedical indexing methods; Strategy of disambiguation; Conceptualization; Sense based indexing; RELATEDNESS;
D O I
10.1007/978-3-319-78723-7_27
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Document representation is an important stage to ensure the indexation of biomedical document. The ordinary way to represent a text is a bag of words BoW, This Representation suffers from the lack of sense in resulting representations ignoring all semantics that reside in the original text; instead of, the Conceptualization using background knowledge enriches document representation models. Three strategies can be used in order to realize the conceptualization task: Adding Concept, Partial Conceptualization, and Complete Conceptualization. While searching polysemic term corresponding senses in semantic resources, multiple matches are detected then introduce some ambiguities in the final document representation, three strategies for Disambiguation can be used: First Concept, All Concepts and Context-Based. SenseRelate is a well-known Context-Based algorithm, which uses a fixed window size and taking into consideration the distance weight on how far the terms in the context are from the target word. This may impact negatively on the yielded concepts or senses, we propose a simple modified version of SenseRelate algorithm namely NoDistanceSenseRelate, which simply ignore the distance that is the terms in the context will have the same distance weight. In order to evaluate the effect of the conceptualization strategies and Disambiguation strategies in the indexing process, in this study, several experiments have been conducted using OHSUMED corpus on a biomedical information retrieval system. The obtained results using OHSUMED corpus show that the Context-Based methods (SenseRelate and NoDistanceSenseRelate) outperform the others ones when applying Adding Concept Conceptualization strategy results using Biomedical Information retrieval system. The obtained results prove the evidence of adding the sense of concepts to the Term Representation in the IR process.
引用
收藏
页码:314 / 326
页数:13
相关论文
共 50 条
  • [21] Cross-language information retrieval using EuroWordNet and word sense disambiguation
    Clough, P
    Stevenson, M
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2004, 2997 : 327 - 337
  • [22] Developing a test collection for biomedical word sense disambiguation
    Weeber, M
    Mork, JG
    Aronson, AR
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, : 746 - 750
  • [23] Can multilinguality improve Biomedical Word Sense Disambiguation?
    Duque, Andres
    Martinez-Romo, Juan
    Araujo, Lourdes
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 : 320 - 332
  • [24] Attention Neural Network for Biomedical Word Sense Disambiguation
    Zhang, Chun-Xiang
    Pang, Shu-Yang
    Gao, Xue-Yao
    Lu, Jia-Qi
    Yu, Bo
    [J]. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2022, 2022
  • [25] An empirical study of the domain dependence of supervised word sense disambiguation systems
    Escudero, G
    Màrquez, L
    Rigau, G
    [J]. PROCEEDINGS OF THE 2000 JOINT SIGDAT CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND VERY LARGE CORPORA, 2000, : 172 - 180
  • [26] Word-Sense Disambiguation for Ontology Mapping: Concept Disambiguation using Virtual Documents and Information Retrieval Techniques
    Schadd, Frederik C.
    Roos, Nico
    [J]. JOURNAL ON DATA SEMANTICS, 2015, 4 (03) : 167 - 186
  • [27] Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-language Information Retrieval
    Clough, Paul
    Stevenson, Mark
    [J]. GWC 2004: SECOND INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS, 2003, : 97 - 105
  • [28] Word Sense Disambiguation by Information Filtering and Extraction
    Jeremy Ellman
    Ian Klincke
    John Tait
    [J]. Computers and the Humanities, 2000, 34 : 127 - 134
  • [29] Spreading semantic information by Word Sense Disambiguation
    Gutierrez, Yoan
    Vazquez, Sonia
    Montoyo, Andres
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 132 : 47 - 61
  • [30] Word sense disambiguation using implicit information
    Jain, Goonjan
    Lobiyal, D. K.
    [J]. NATURAL LANGUAGE ENGINEERING, 2020, 26 (04) : 413 - 432