Toward Universal Word Sense Disambiguation Using Deep Neural Networks

被引:9
|
作者
Calvo, Hiram [1 ]
Rocha-Ramirez, Arturo P. [1 ]
Moreno-Armendariz, Marco A. [1 ]
Duchanoy, Carlos A. [1 ,2 ]
机构
[1] Inst Politecn Nacl JD Batiz E MO Mendizabal, Ctr Invest Comp, Mexico City 07738, DF, Mexico
[2] Catedra CONACyT, Mexico City 03940, DF, Mexico
关键词
Word sense disambiguation; recurrent neural networks; LSTM; multilayer perceptron; senseval english lexical sample test;
D O I
10.1109/ACCESS.2019.2914921
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, approaches based on neural networks to solve the problem of disambiguation of the meaning of words (WSD) use a set of classidiers at the end, which results in a specialization in a single set of words-those for which they were trained. This makes impossible to apply the learned models to words not previously seen in the training corpus. This paper seeks to address a generalization of the problem of WSD in order to solve it through deep neural networks without limiting the method to a fixed set of words, with a performance close to the state-of-the-art, and an acceptable computational cost. We explore different architectures based on multilayer perceptrons, recurrent cells (Long Short-Term Memory-LSTM and Gated Recurrent Units-GRU), and a classifier model. Different sources and dimensions of embeddings were tested as well. The main evaluation was performed on the Senseval 3 English Lexical Sample. To evaluate the application to an unseen set of words, learned models are evaluated in the completely unseen words of a different corpus (Senseval 2 English Lexical Sample), overcoming the random baseline.
引用
收藏
页码:60264 / 60275
页数:12
相关论文
共 50 条
  • [1] Chinese word sense disambiguation based on neural networks
    刘挺
    卢志茂
    郎君
    李生
    Journal of Harbin Institute of Technology(New series), 2005, (04) : 408 - 414
  • [2] A Lexicographic Encoding for Word Sense Disambiguation with Evolutionary Neural Networks
    Azzini, A.
    Pereira, C. da Costa
    Dragoni, M.
    Tettamanzi, A. G. B.
    AI (ASTERISK) IA 2009: EMERGENT PERSPECTIVES IN ARTIFICIAL INTELLIGENCE, 2009, 5883 : 192 - 201
  • [3] A Practical Approach for Representing Context and for Performing Word Sense Disambiguation Using Neural Networks
    Gallant, Stephen I.
    NEURAL COMPUTATION, 1991, 3 (03) : 293 - 309
  • [4] Word Sense Disambiguation with Semantic Networks
    Tsatsaronis, George
    Varlamis, Iraklis
    Vazirgiannis, Michalis
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 219 - 226
  • [5] The model of word sense disambiguation combining statistics and BP neural networks
    Xiangfan Radio and TV University, Xiangfan 441021, China
    Wuhan Ligong Daxue Xuebao, 2006, 8 (131-134):
  • [6] deepBioWSD: effective deep neural word sense disambiguation of biomedical text data
    Pesaranghader, Ahmad
    Matwin, Stan
    Sokolova, Marina
    Pesaranghader, Ali
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (05) : 438 - 446
  • [7] Incorporating Glosses into Neural Word Sense Disambiguation
    Luo, Fuli
    Liu, Tianyu
    Xia, Qiaolin
    Chang, Baobao
    Sui, Zhifang
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2473 - 2482
  • [8] Word Sense Disambiguation of Medical Terms via Recurrent Convolutional Neural Networks
    Festag, Sven
    Spreckelsen, Cord
    HEALTH INFORMATICS MEETS EHEALTH: DIGITAL INSIGHT - INFORMATION-DRIVEN HEALTH & CARE, 2017, 236 : 8 - 15
  • [9] Word Sense Disambiguation Based on Semi-Supervised Convolutional Neural Networks
    Zhang C.
    Tang L.
    Gao X.
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (01): : 11 - 17and27
  • [10] Word sense disambiguation for Punjabi language using deep learning techniques
    Singh, Varinder Pal
    Kumar, Parteek
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 2963 - 2973