A metaheuristic with a neural surrogate function for Word Sense Disambiguation

被引:1
|
作者
Nodehi, Azim Keshavarzian [1 ]
Charkari, Nasrollah Moghadam [1 ]
机构
[1] Tarbiat Modares Univ, Tehran, Iran
来源
关键词
Word Sense Disambiguation; Metaheuristics; Surrogate Functions; Sense Mapping;
D O I
10.1016/j.mlwa.2022.100369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word Sense Disambiguation (WSD) is one of the earliest problems in natural language processing which aims to determine the correct sense of words in context. The semantic information provided by WSD systems is highly beneficial to many tasks such as machine translation, information extraction, and semantic parsing. In this work, a new approach for WSD is proposed which uses a neural network as a surrogate fitness function in a metaheuristic algorithm. Also, a new method for simultaneous training of word and sense embeddings is proposed in this work. Accordingly, the node2vec algorithm is employed on the WordNet graph to generate sequences containing both words and senses. These sequences are then used along with paragraphs from Wikipedia in the word2vec algorithm to generate embeddings for words and senses at the same time. In order to address data imbalance in this task, sense probability distribution data extracted from the training corpus is used in the search process of the proposed simulated annealing algorithm. Furthermore, we introduce a new approach for clustering and mapping senses in the WordNet graph, which considerably improves the accuracy of the proposed method. In this approach, nodes in the WordNet graph are clustered on the condition that no two senses of the same word be present in one cluster. Then, repeatedly, all nodes in each cluster are mapped to a randomly selected node from that cluster, meaning that the representative node can take advantage of the training instances of all the other nodes in the cluster. Training the proposed method in this work is done using the SemCor dataset and the SemEval-2015 dataset has been used as the validation set. The final evaluation of the system is performed on SensEval-2, SensEval-3, SemEval-2007, SemEval-2013, SemEval-2015, and the concatenation of all five mentioned datasets. The performance of the system is also evaluated on the four content word categories, namely, nouns, verbs, adjectives, and adverbs. Experimental results show that the proposed method achieves accuracies in the range of 74.8 to 84.6 percent in the ten aforementioned evaluation categories which are close to and in some cases better than the state of the art in this task.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Incorporating Glosses into Neural Word Sense Disambiguation
    Luo, Fuli
    Liu, Tianyu
    Xia, Qiaolin
    Chang, Baobao
    Sui, Zhifang
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2473 - 2482
  • [2] Chinese word sense disambiguation based on neural networks
    刘挺
    卢志茂
    郎君
    李生
    Journal of Harbin Institute of Technology(New series), 2005, (04) : 408 - 414
  • [3] Word Sense Disambiguation Based on Convolution Neural Network
    Zhang C.-X.
    Zhao L.-Y.
    Gao X.-Y.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (03): : 114 - 119
  • [4] Integrating Personalized PageRank into Neural Word Sense Disambiguation
    EISheikh, Ahmed
    Bevilacqua, Michele
    Navigli, Roberto
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9092 - 9098
  • [5] Neural Network Models for Word Sense Disambiguation: An Overview
    Popov, Alexander
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2018, 18 (01) : 139 - 151
  • [6] Attention Neural Network for Biomedical Word Sense Disambiguation
    Zhang, Chun-Xiang
    Pang, Shu-Yang
    Gao, Xue-Yao
    Lu, Jia-Qi
    Yu, Bo
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2022, 2022
  • [7] Comparative Study on Weight Function for Word Sense Disambiguation
    Lu, Wenpeng
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2623 - 2626
  • [8] Sense Space for Word Sense Disambiguation
    Kang, Myung Yun
    Min, Tae Hong
    Lee, Jae Sung
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 669 - 672
  • [9] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [10] Neural-network based word sense disambiguation method
    Zhang, Guoqing
    Zhang, Yongkui
    Jisuanji Gongcheng/Computer Engineering, 2001, 27 (12):