A metaheuristic with a neural surrogate function for Word Sense Disambiguation

被引:1
|
作者
Nodehi, Azim Keshavarzian [1 ]
Charkari, Nasrollah Moghadam [1 ]
机构
[1] Tarbiat Modares Univ, Tehran, Iran
来源
关键词
Word Sense Disambiguation; Metaheuristics; Surrogate Functions; Sense Mapping;
D O I
10.1016/j.mlwa.2022.100369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word Sense Disambiguation (WSD) is one of the earliest problems in natural language processing which aims to determine the correct sense of words in context. The semantic information provided by WSD systems is highly beneficial to many tasks such as machine translation, information extraction, and semantic parsing. In this work, a new approach for WSD is proposed which uses a neural network as a surrogate fitness function in a metaheuristic algorithm. Also, a new method for simultaneous training of word and sense embeddings is proposed in this work. Accordingly, the node2vec algorithm is employed on the WordNet graph to generate sequences containing both words and senses. These sequences are then used along with paragraphs from Wikipedia in the word2vec algorithm to generate embeddings for words and senses at the same time. In order to address data imbalance in this task, sense probability distribution data extracted from the training corpus is used in the search process of the proposed simulated annealing algorithm. Furthermore, we introduce a new approach for clustering and mapping senses in the WordNet graph, which considerably improves the accuracy of the proposed method. In this approach, nodes in the WordNet graph are clustered on the condition that no two senses of the same word be present in one cluster. Then, repeatedly, all nodes in each cluster are mapped to a randomly selected node from that cluster, meaning that the representative node can take advantage of the training instances of all the other nodes in the cluster. Training the proposed method in this work is done using the SemCor dataset and the SemEval-2015 dataset has been used as the validation set. The final evaluation of the system is performed on SensEval-2, SensEval-3, SemEval-2007, SemEval-2013, SemEval-2015, and the concatenation of all five mentioned datasets. The performance of the system is also evaluated on the four content word categories, namely, nouns, verbs, adjectives, and adverbs. Experimental results show that the proposed method achieves accuracies in the range of 74.8 to 84.6 percent in the ten aforementioned evaluation categories which are close to and in some cases better than the state of the art in this task.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] MultiMirror: Neural Cross-lingual Word Alignment for Multilingual Word Sense Disambiguation
    Sapienza NLP Group, Department of Computer Science, Sapienza University of Rome
    IJCAI Int. Joint Conf. Artif. Intell., 2021, (3915-3921):
  • [32] Biomedical Word Sense Disambiguation with Word Embeddings
    Antunes, Rui
    Matos, Sergio
    11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 273 - 279
  • [33] Word sense disambiguation of english modal verb must by neural network
    Yu, Jianping
    An, Lin
    Fu, Jilin
    ICIC Express Letters, 2010, 4 (01): : 83 - 88
  • [34] Toward Universal Word Sense Disambiguation Using Deep Neural Networks
    Calvo, Hiram
    Rocha-Ramirez, Arturo P.
    Moreno-Armendariz, Marco A.
    Duchanoy, Carlos A.
    IEEE ACCESS, 2019, 7 : 60264 - 60275
  • [35] A Novel Neural Sequence Model with Multiple Attentions for Word Sense Disambiguation
    Ahmed, Mahtab
    Samee, Muhammad Rifayat
    Mercer, Robert E.
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 687 - 694
  • [36] The model of word sense disambiguation combining statistics and BP neural networks
    Xiangfan Radio and TV University, Xiangfan 441021, China
    Wuhan Ligong Daxue Xuebao, 2006, 8 (131-134):
  • [37] Word Sense Indicators: Effective Feature for Chinese Word Sense Disambiguation
    Quan, Changqin
    Ren, Fuji
    He, Tingting
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2009, 12 (05): : 1157 - 1164
  • [38] A Word Sense Disambiguation Technique for Sinhala
    Arukgoda, Janindu
    Bandara, Vidudaya
    Bashani, Samiththa
    Gamage, Vijayindu
    Wimalasuriya, Daya
    PROCEEDINGS 2014 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE WITH APPLICATIONS IN ENGINEERING AND TECHNOLOGY ICAIET 2014, 2014, : 207 - 211
  • [39] Genetic Word Sense Disambiguation Algorithm
    Zhang, ChunHui
    Zhou, Yiming
    Martin, Trevor
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 123 - +
  • [40] An Improved Word Sense Disambiguation Method
    Yu, Linlin
    Song, Lifang
    Sun, Jianyan
    Li, Lin
    2016 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS (ITMS 2016), 2016, : 153 - 155