Toward Universal Word Sense Disambiguation Using Deep Neural Networks

被引:9
|
作者
Calvo, Hiram [1 ]
Rocha-Ramirez, Arturo P. [1 ]
Moreno-Armendariz, Marco A. [1 ]
Duchanoy, Carlos A. [1 ,2 ]
机构
[1] Inst Politecn Nacl JD Batiz E MO Mendizabal, Ctr Invest Comp, Mexico City 07738, DF, Mexico
[2] Catedra CONACyT, Mexico City 03940, DF, Mexico
关键词
Word sense disambiguation; recurrent neural networks; LSTM; multilayer perceptron; senseval english lexical sample test;
D O I
10.1109/ACCESS.2019.2914921
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, approaches based on neural networks to solve the problem of disambiguation of the meaning of words (WSD) use a set of classidiers at the end, which results in a specialization in a single set of words-those for which they were trained. This makes impossible to apply the learned models to words not previously seen in the training corpus. This paper seeks to address a generalization of the problem of WSD in order to solve it through deep neural networks without limiting the method to a fixed set of words, with a performance close to the state-of-the-art, and an acceptable computational cost. We explore different architectures based on multilayer perceptrons, recurrent cells (Long Short-Term Memory-LSTM and Gated Recurrent Units-GRU), and a classifier model. Different sources and dimensions of embeddings were tested as well. The main evaluation was performed on the Senseval 3 English Lexical Sample. To evaluate the application to an unseen set of words, learned models are evaluated in the completely unseen words of a different corpus (Senseval 2 English Lexical Sample), overcoming the random baseline.
引用
收藏
页码:60264 / 60275
页数:12
相关论文
共 50 条
  • [41] Sense Space for Word Sense Disambiguation
    Kang, Myung Yun
    Min, Tae Hong
    Lee, Jae Sung
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 669 - 672
  • [42] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [43] Image disambiguation with deep neural networks
    DeGuchy, Omar
    Ho, Alex
    Marcia, Roummel F.
    APPLICATIONS OF MACHINE LEARNING, 2019, 11139
  • [44] Neural-network based word sense disambiguation method
    Zhang, Guoqing
    Zhang, Yongkui
    Jisuanji Gongcheng/Computer Engineering, 2001, 27 (12):
  • [45] Sparsity Makes Sense: Word Sense Disambiguation Using Sparse Contextualized Word Representations
    Berend, Gabor
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8498 - 8508
  • [46] Biomedical Word Sense Disambiguation Based on Graph Attention Networks
    Zhang, Chun-Xiang
    Wang, Ming-Lei
    Gao, Xue-Yao
    IEEE ACCESS, 2022, 10 : 123328 - 123336
  • [47] Word sense disambiguation methods
    Turdakov, D. Yu.
    PROGRAMMING AND COMPUTER SOFTWARE, 2010, 36 (06) : 309 - 326
  • [48] Word sense disambiguation model
    Zhu, Jing-bo
    Yao, Tian-shun
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2000, 21 (05): : 484 - 486
  • [49] Word sense disambiguation methods
    D. Yu. Turdakov
    Programming and Computer Software, 2010, 36 : 309 - 326
  • [50] Probabilistic word sense disambiguation
    Preiss, J
    COMPUTER SPEECH AND LANGUAGE, 2004, 18 (03): : 319 - 337