Word Sense Disambiguation in Bengali: a Knowledge based Approach using Bengali WordNet

被引:0
|
作者
Pal, Alok Ranjan [1 ]
Saha, Diganta [2 ]
Naskar, Sudip Kumar [2 ]
机构
[1] Coll Engn & Mgmt, Kolaghat, India
[2] Jadavpur Univ, Kolkata, India
关键词
Natural Language Processing; Word Sense Disambiguation; Knowledge base; WordNet; Maximum Overlap;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, a knowledge based approach for Word Sense Disambiguation (WSD) in Bengali language has been presented. Bengali WordNet, developed at ISI Kolkata has been used as a knowledge base and the input data set is prepared from the Bengali Text Corpus developed in the TDIL (Technology Development for Indian Language) project of the Government of India. The proposed approach resolute the exact sense of a Bengali ambiguous word based on the maximum overlap among the dictionary definitions of the ambiguous word, with its collocating words in that sentence and the synonymous words of these collocating words. The algorithm is tested on 9 (nine) mostly used Bengali ambiguous words. The accuracy of the output is achieved 75% which is verified by an expert. The challenges and the pitfalls of this approach are discussed in this report in detail.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Word Sense Disambiguation in Bengali: an Unsupervised Approach
    Pal, Alok Ranjan
    Saha, Diganta
    [J]. PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [2] A comprehensive review of Bengali word sense disambiguation
    Debapratim Das Dawn
    Soharab Hossain Shaikh
    Rajat Kumar Pal
    [J]. Artificial Intelligence Review, 2020, 53 : 4183 - 4213
  • [3] A comprehensive review of Bengali word sense disambiguation
    Das Dawn, Debapratim
    Shaikh, Soharab Hossain
    Pal, Rajat Kumar
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (06) : 4183 - 4213
  • [4] A novel approach to word sense disambiguation in Bengali language using supervised methodology
    Alok Ranjan Pal
    Diganta Saha
    Niladri Sekhar Dash
    Sudip Kumar Naskar
    Antara Pal
    [J]. Sādhanā, 2019, 44
  • [5] A novel approach to word sense disambiguation in Bengali language using supervised methodology
    Pal, Alok Ranjan
    Saha, Diganta
    Dash, Niladri Sekhar
    Naskar, Sudip Kumar
    Pal, Antara
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (08):
  • [6] A Memory Based Approach to Word Sense Disambiguation in Bengali Using k-NN Method
    Pandit, Rajat
    Naskar, Sudip Kumar
    [J]. 2015 IEEE 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION SYSTEMS (RETIS), 2015, : 383 - 386
  • [7] A dataset for evaluating Bengali word sense disambiguation techniques
    Das Dawn D.
    Khan A.
    Shaikh S.H.
    Pal R.K.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (04) : 4057 - 4086
  • [8] Modified lesk algorithm for word sense disambiguation in Bengali
    Das, Ratul
    Pal, Alok Ranjan
    Saha, Diganta
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2024, 49 (02):
  • [9] Word Sense Disambiguation in Bengali language using unsupervised methodology with modifications
    Alok Ranjan Pal
    Diganta Saha
    [J]. Sādhanā, 2019, 44
  • [10] A novel word sense disambiguation approach using WordNet knowledge graph
    AlMousa, Mohannad
    Benlamri, Rachid
    Khoury, Richard
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 74