Toward Universal Word Sense Disambiguation Using Deep Neural Networks

被引:9
|
作者
Calvo, Hiram [1 ]
Rocha-Ramirez, Arturo P. [1 ]
Moreno-Armendariz, Marco A. [1 ]
Duchanoy, Carlos A. [1 ,2 ]
机构
[1] Inst Politecn Nacl JD Batiz E MO Mendizabal, Ctr Invest Comp, Mexico City 07738, DF, Mexico
[2] Catedra CONACyT, Mexico City 03940, DF, Mexico
关键词
Word sense disambiguation; recurrent neural networks; LSTM; multilayer perceptron; senseval english lexical sample test;
D O I
10.1109/ACCESS.2019.2914921
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, approaches based on neural networks to solve the problem of disambiguation of the meaning of words (WSD) use a set of classidiers at the end, which results in a specialization in a single set of words-those for which they were trained. This makes impossible to apply the learned models to words not previously seen in the training corpus. This paper seeks to address a generalization of the problem of WSD in order to solve it through deep neural networks without limiting the method to a fixed set of words, with a performance close to the state-of-the-art, and an acceptable computational cost. We explore different architectures based on multilayer perceptrons, recurrent cells (Long Short-Term Memory-LSTM and Gated Recurrent Units-GRU), and a classifier model. Different sources and dimensions of embeddings were tested as well. The main evaluation was performed on the Senseval 3 English Lexical Sample. To evaluate the application to an unseen set of words, learned models are evaluated in the completely unseen words of a different corpus (Senseval 2 English Lexical Sample), overcoming the random baseline.
引用
收藏
页码:60264 / 60275
页数:12
相关论文
共 50 条
  • [31] Word sense disambiguation using implicit information
    Jain, Goonjan
    Lobiyal, D. K.
    NATURAL LANGUAGE ENGINEERING, 2020, 26 (04) : 413 - 432
  • [32] Unsupervised Word Sense Disambiguation Using The WWW
    Klapaftis, Ioannis P.
    Manandhar, Suresh
    STAIRS 2006, 2006, 142 : 174 - 183
  • [33] Using Exponential Kernel for Word Sense Disambiguation
    Wang, Tinghua
    Rao, Junyang
    Zhao, Dongyan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2013, 2013, 8131 : 545 - 552
  • [34] WSD-GAN: Word Sense Disambiguation Using Generative Adversarial Networks
    Hu, Zijian
    Luo, Fuli
    Tan, Yutong
    Zeng, Wenxin
    Sui, Zhifang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9943 - 9944
  • [35] Word Sense Disambiguation Using an Evolutionary Approach
    Menai, Mohamed El Bachir
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2014, 38 (02): : 155 - 169
  • [36] MALAYALAM WORD SENSE DISAMBIGUATION USING YAMCHA
    Junaida, M. K.
    Jayan, Jisha P.
    Elizabeth, Sherly
    2015 INTERNATIONAL CONFERENCE ON COMPUTING AND NETWORK COMMUNICATIONS (COCONET), 2015, : 720 - 724
  • [37] Chinese Word Sense Disambiguation Using a LSTM
    Sun, Xue-Ren
    Lv, Shao-He
    Wang, Xiao-Dong
    Wang, Dong
    4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [38] Effect of Supervised Sense Disambiguation Model Using Machine Learning Technique and Word Embedding in Word Sense Disambiguation
    Mahajan, Rupesh
    Kokane, Chandrakant
    Pathak, Kishor
    Kodmelwar, Manohar
    Wagh, Kapil
    Bhandari, Mahesh
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 436 - 443
  • [39] Domain Adaptation for Word Sense Disambiguation Using Word Embeddings
    Komiya, Kanako
    Suzuki, Shota
    Sasaki, Minoru
    Shinnou, Hiroyuki
    Okumura, Manabu
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 195 - 206
  • [40] WORD SENSE DISAMBIGUATION USING WORD ONTOLOGY AND CONCEPT DISTRIBUTION
    Hung, Jason C.
    Yang, Che-Yu
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2009, 32 (02) : 153 - 168