An Empirical Study of Word Sense Disambiguation for Biomedical Information Retrieval System

被引:0
|
作者
Rais, Mohammed [1 ]
Lachkar, Abdelmonaime [1 ]
机构
[1] USMBA, Dept Elect & Comp Engn, LISA, ENSA, Fes, Morocco
关键词
Natural language processing; Biomedical Information Retrieval; Word Sense Disambiguation; Biomedical indexing methods; Strategy of disambiguation; Conceptualization; Sense based indexing; RELATEDNESS;
D O I
10.1007/978-3-319-78723-7_27
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Document representation is an important stage to ensure the indexation of biomedical document. The ordinary way to represent a text is a bag of words BoW, This Representation suffers from the lack of sense in resulting representations ignoring all semantics that reside in the original text; instead of, the Conceptualization using background knowledge enriches document representation models. Three strategies can be used in order to realize the conceptualization task: Adding Concept, Partial Conceptualization, and Complete Conceptualization. While searching polysemic term corresponding senses in semantic resources, multiple matches are detected then introduce some ambiguities in the final document representation, three strategies for Disambiguation can be used: First Concept, All Concepts and Context-Based. SenseRelate is a well-known Context-Based algorithm, which uses a fixed window size and taking into consideration the distance weight on how far the terms in the context are from the target word. This may impact negatively on the yielded concepts or senses, we propose a simple modified version of SenseRelate algorithm namely NoDistanceSenseRelate, which simply ignore the distance that is the terms in the context will have the same distance weight. In order to evaluate the effect of the conceptualization strategies and Disambiguation strategies in the indexing process, in this study, several experiments have been conducted using OHSUMED corpus on a biomedical information retrieval system. The obtained results using OHSUMED corpus show that the Context-Based methods (SenseRelate and NoDistanceSenseRelate) outperform the others ones when applying Adding Concept Conceptualization strategy results using Biomedical Information retrieval system. The obtained results prove the evidence of adding the sense of concepts to the Term Representation in the IR process.
引用
收藏
页码:314 / 326
页数:13
相关论文
共 50 条
  • [1] Word sense disambiguation for Information Retrieval
    Uzuner, O
    Katz, B
    Yuret, D
    [J]. SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 985 - 985
  • [2] Information Retrieval with Word Sense Disambiguation for Spanish
    Ledo Mezquita, Yoel
    [J]. COMPUTACION Y SISTEMAS, 2008, 11 (03): : 288 - 300
  • [3] Arabic Word Sense Disambiguation for Information Retrieval
    Abderrahim, Mohammed Alaeddine
    Abderrahim, Mohammed El-Amine
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (04)
  • [4] Information retrieval by means of word sense disambiguation
    Ureña, LA
    Hidalgo, JMG
    de Buenaga, M
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 93 - 98
  • [5] An Intelligent Information Retrieval System Using Automatic Word Sense Disambiguation
    Ramasubramanian, Prasanna G.
    Agah, Arvin
    Gauch, Susan E.
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2007, 16 (02) : 135 - 166
  • [6] A word sense disambiguation algorithm for information retrieval applications
    Pascucci, G
    Spadaro, S
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2003: OTM 2003 WORKSHOPS, 2003, 2889 : 306 - 317
  • [7] Biomedical Word Sense Disambiguation with Word Embeddings
    Antunes, Rui
    Matos, Sergio
    [J]. 11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 273 - 279
  • [8] Word sense disambiguation for cross-language information retrieval
    Liu, MX
    Diamond, T
    Diekema, AR
    [J]. 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : B35 - B40
  • [9] Word Sense Disambiguation based on IDF applied to Information Retrieval
    Perea-Ortega, Jose M.
    Martinez-Santiago, Fernando
    Garcia-Cumbreras, Miguel A.
    Montejo-Raez, Arturo
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (46): : 99 - 106
  • [10] Analysis of Word Sense Disambiguation-Based Information Retrieval
    Guyot, Jacques
    Falquet, Gilles
    Radhouani, Said
    Benzineb, Karim
    [J]. EVALUATING SYSTEMS FOR MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS, 2009, 5706 : 146 - 154