An Improved Approach to Word Sense Disambiguation

被引:0
|
作者
Sachdeva, Pradeep [1 ]
Verma, Surbhi [1 ]
Singh, Sandeep Kumar [1 ]
机构
[1] JIlT, Dept Comp Sci & Informat Technol, Noida, India
关键词
WordNet; Word Sense Disambiguation; Intersection; Scoring of Senses; Distance measure;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Words in the English language often correspond to different meanings in different contexts. Such words are referred to as polysemous words i.e. words having more than one sense. This paper presents a knowledge based algorithm for disambiguating polysemous words using computational linguistics tool, WordNet. The task of word sense disambiguation requires finding out the similarity between the target word (word to be disambiguated) and the nearby words (words surrounding the target word in input text). Algorithms in the past have calculated similarity either by finding out the number of common words (intersection) between the glosses (definitions/meanings) of the target and nearby words, or by finding out the exact occurrence of the nearby word's sense in the hierarchy (hypernyms) of the target word's senses. This paper proposes an algorithm which modifies the above two parameters by computing intersection using not only the glosses but also by including the related words. Also the intersection is computed for the entire hierarchy of the target and nearby words. It also incorporates a third parameter 'distance' (between target and nearby words). The proposed approach incorporates more parameters for calculating similarity, which has not been attempted by any of the previous approaches. It scores the senses based on the overall impact of three parameters i.e. intersection, hierarchy and distance and then chooses the sense with the highest score. The algorithm has been evaluated on SemCor which is the largest available sense-tagged corpus. The proposed algorithm achieves a precision of 53.12% for Topl results, 59.91% for Top2 results and that of 62.13% for Top3 results which is better than other knowledge based approaches.
引用
收藏
页码:235 / 240
页数:6
相关论文
共 50 条
  • [21] An Improved supervised word sense disambiguation methods in the biomedical domain
    He, JiaCong
    Ren, Kai
    Yu, WeiJie
    [J]. INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 3103 - +
  • [22] AN IMPROVED UNSUPERVISED LEARNING PROBABILISTIC MODEL OF WORD SENSE DISAMBIGUATION
    Li, Xu
    Zhao, Xiuyan
    Ban, Fenglong
    Liu, Bai
    [J]. PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 1071 - 1075
  • [23] ShotgunWSD 2.0: An Improved Algorithm for Global Word Sense Disambiguation
    Butnaru, Andrei M.
    Ionescu, Radu Tudor
    [J]. IEEE ACCESS, 2019, 7 : 120961 - 120975
  • [24] Word sense disambiguation methods
    Turdakov, D. Yu.
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2010, 36 (06) : 309 - 326
  • [25] Word sense disambiguation model
    Zhu, Jing-bo
    Yao, Tian-shun
    [J]. Dongbei Daxue Xuebao/Journal of Northeastern University, 2000, 21 (05): : 484 - 486
  • [26] Probabilistic word sense disambiguation
    Preiss, J
    [J]. COMPUTER SPEECH AND LANGUAGE, 2004, 18 (03): : 319 - 337
  • [27] ARABIC WORD SENSE DISAMBIGUATION
    Merhbene, Laroussi
    Zouaghi, Anis
    Zrigui, Mounir
    [J]. ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 652 - 655
  • [28] Word sense disambiguation methods
    D. Yu. Turdakov
    [J]. Programming and Computer Software, 2010, 36 : 309 - 326
  • [29] Word Sense Disambiguation: A Survey
    Navigli, Roberto
    [J]. ACM COMPUTING SURVEYS, 2009, 41 (02)
  • [30] Multiwords and Word Sense Disambiguation
    Arranz, V
    Atserias, J
    Castillo, M
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 250 - 262