A Word Sense Disambiguation Method Applied to Natural Language Processing for the Portuguese Language

被引:1
|
作者
do Nascimento, Clovis Holanda [1 ]
Garcia, Vinicius Cardoso [1 ]
Araujo, Ricardo de Andrade [2 ]
机构
[1] Univ Fed Pernambuco, Informat Ctr, BR-50670901 Recife, Brazil
[2] Fed Inst Sertao Pernambucano, Araripe Computat Intelligence Lab, BR-56200000 Ouricuri, Brazil
基金
加拿大自然科学与工程研究理事会;
关键词
Task analysis; Natural language processing; Libraries; Data models; Computational modeling; Context modeling; Training; Artificial intelligence; language models; natural language processing; word sense disambiguation;
D O I
10.1109/OJCS.2024.3396518
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Natural language processing (NLP) and artificial intelligence (AI) have advanced significantly in recent years, enabling the development of various tasks, such as machine translation, text summarization, sentiment analysis, and speech analysis. However, there are still challenges to overcome, such as natural language ambiguity. One of the problems caused by ambiguity is the difficulty of determining the proper meaning of a word in a specific context. For example, the word "mouse" can mean a computer peripheral or an animal, depending on the context. This limitation can lead to an incorrect semantic interpretation of the processed sentence. In recent years, language models (LMs) have provided a new impetus to NLP and AI, including in the task of word sense disambiguation (WSD). LMs are capable of learning and generating texts as they are trained on large amounts of data. However, in the Portuguese language, there are still few studies on WSD using LMs. Given this scenario, this article presents a method for WSD for the Portuguese language. To do this, it uses the BERTimbau language model, which is specific to the Portuguese. The results will be evaluated using the metrics established in the literature.
引用
收藏
页码:268 / 277
页数:10
相关论文
共 50 条
  • [1] A critical analysis and explication of word sense disambiguation as approached by natural language processing
    Mennes, Julie
    van Gulik, Stephan van der Waart
    [J]. LINGUA, 2020, 243
  • [2] A comprehensive review on Arabic word sense disambiguation for natural language processing applications
    Kaddoura, Sanaa
    Ahmed, Rowanda D.
    Hemanth, Jude D.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (04)
  • [3] An Associative Concept Dictionary for Natural Language Processing: Text Summarization and Word Sense Disambiguation
    Okamoto, Jun
    Ishizaki, Shun
    [J]. JOURNAL OF COGNITIVE SCIENCE, 2011, 12 (03) : 259 - 276
  • [4] Word Sense Disambiguation in Nepali Language
    Dhungana, Udaya Raj
    Shakya, Subarna
    [J]. 2014 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION AND COMMUNICATION TECHNOLOGY AND IT'S APPLICATIONS (DICTAP), 2014, : 46 - 50
  • [5] Research on the method of word sense disambiguation based on target language bigram
    Harbin Inst of Technology, Harbin, China
    [J]. Ruan Jian Xue Bao, 10 (21-25):
  • [6] A Literature Survey on Word Sense Disambiguation for the Hindi Language
    Gujjar, Vinto
    Mago, Neeru
    Kumari, Raj
    Patel, Shrikant
    Chintalapudi, Nalini
    Battineni, Gopi
    [J]. INFORMATION, 2023, 14 (09)
  • [7] Word sense disambiguation using heterogeneous language resources
    Shirai, K
    Tamagaki, T
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 377 - 385
  • [8] Word Sense Disambiguation of Polysemy Words in Kannada Language
    Shashank, N. S.
    Kallimani, Jagadish S.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 641 - 644
  • [9] Word sense disambiguation of Thai language with unsupervised learning
    Pongpinigpinyo, S
    Rivepiboon, W
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2005, 3681 : 1275 - 1283
  • [10] Survey of the Word Sense Disambiguation and Challenges for the Slovak Language
    Hladek, Daniel
    Stas, Jan
    Pleva, Matus
    Ondas, Stanislav
    Kovacs, Laszlo
    [J]. 2016 17TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI 2016), 2016, : 225 - 229