Assamese Word Sense Disambiguation using Supervised Learning

被引:0
|
作者
Borah, Pranjal Protim [1 ]
Talukdar, Gitimoni [1 ]
Baruah, Arup [1 ]
机构
[1] Assam Don Bosco Univ, Dept Comp Sci & Engn & IT, Gauhati, India
关键词
Lexicon; Wordnet; Local collocations; Polysemic word; Unigram cooccurence;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word sense disambiguation (WSD) can be defined as a task that focuses on estimating the right sense of a word in its context. It is important as a preprocessing step in information extraction, machine translation, question answering and many other natural language processing tasks. Ambiguity in Word Sense arises when a particular word has more than one possible sense. Finding the correct sense requires thorough knowledge regarding words. This information of words is often derived from the sources such as words appearing in the context of the target word, part of speech information of the words in the neighbour, syntactical relations and local collocations. Our main aim in this paper is to develop an automatic system for WSD in Assamese using a Naive Bayes classifier. This is the first work to the best of our knowledge on developing an automatic WSD system for Assamese language. Assamese, the main language of most of the people in North-Eastern part of India is a morphologically very rich language. In Assamese WSD is a challenging task because a word can behave differently when combined with a suffix or a sequence of suffixes to have an entirely different sense. WSD often makes use of lexical resources such as WordNet, lexicon, annotated or unannotated corpora etc for its process of disambiguation.
引用
收藏
页码:946 / 950
页数:5
相关论文
共 50 条
  • [1] Word Sense Disambiguation for Assamese
    Sarmah, Jumi
    Sarma, Shikhar Kr
    [J]. 2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 146 - 151
  • [2] Effect of Supervised Sense Disambiguation Model Using Machine Learning Technique and Word Embedding in Word Sense Disambiguation
    Mahajan, Rupesh
    Kokane, Chandrakant
    Pathak, Kishor
    Kodmelwar, Manohar
    Wagh, Kapil
    Bhandari, Mahesh
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 436 - 443
  • [3] Assamese Word Sense Disambiguation using Cuckoo Search Algorithm
    Gogoi, Arjun
    Baruah, Nomi
    Nath, Lakhya Jyoti
    [J]. AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 142 - 147
  • [4] Word sense disambiguation by semi-supervised learning
    Niu, ZY
    Ji, DH
    Tan, CL
    Yang, LP
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 238 - 241
  • [5] Applying active learning to supervised word sense disambiguation in MEDLINE
    Chen, Yukun
    Cao, Hongxin
    Mei, Qiaozhu
    Zheng, Kai
    Xu, Hua
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (05) : 1001 - 1006
  • [6] Retraining: The Semi-Supervised Learning of the Word Sense Disambiguation
    Suarez, Armando
    Palomar, Manuel
    Rigau, German
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (34):
  • [7] Implementation of Walker Algorithm In Word Sense Disambiguation for Assamese language
    Kalita, Purabi
    Barman, Anup Kumar
    [J]. 2015 INTERNATIONAL SYMPOSIUM ON ADVANCED COMPUTING AND COMMUNICATION (ISACC), 2015, : 136 - 140
  • [8] Supervised word sense disambiguation using semantic diffusion kernel
    Wang, Tinghua
    Rao, Junyang
    Hu, Qi
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 27 : 167 - 174
  • [9] Word sense disambiguation based on semi-supervised ensemble learning
    Zhang, Chunxiang
    Xiong, Jingzhao
    Gao, Xueyao
    [J]. Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2020, 41 (08): : 1216 - 1222
  • [10] Investigating problems of semi-supervised learning for word sense disambiguation
    Le, Anh-Cuong
    Shimazu, Akira
    Nguyen, Le-Minh
    [J]. COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 482 - +