Capsule Network Improved Multi-Head Attention for Word Sense Disambiguation

被引:0
|
作者
Cheng, Jinfeng [1 ]
Tong, Weiqin [1 ,2 ]
Yan, Weian [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai 200444, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 06期
关键词
word sense disambiguation; multi-head attention; capsule network; capsule routing;
D O I
10.3390/app11062488
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Word sense disambiguation (WSD) is one of the core problems in natural language processing (NLP), which is to map an ambiguous word to its correct meaning in a specific context. There has been a lively interest in incorporating sense definition (gloss) into neural networks in recent studies, which makes great contribution to improving the performance of WSD. However, disambiguating polysemes of rare senses is still hard. In this paper, while taking gloss into consideration, we further improve the performance of the WSD system from the perspective of semantic representation. We encode the context and sense glosses of the target polysemy independently using encoders with the same structure. To obtain a better presentation in each encoder, we leverage the capsule network to capture different important information contained in multi-head attention. We finally choose the gloss representation closest to the context representation of the target word as its correct sense. We do experiments on English all-words WSD task. Experimental results show that our method achieves good performance, especially having an inspiring effect on disambiguating words of rare senses.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] ResGAT: an improved graph neural network based on multi-head attention mechanism and residual network for paper classification
    Xuejian Huang
    Zhibin Wu
    Gensheng Wang
    Zhipeng Li
    Yuansheng Luo
    Xiaofang Wu
    Scientometrics, 2024, 129 : 1015 - 1036
  • [22] Improved Multi-Head Self-Attention Classification Network for Multi-View Fetal Echocardiography Recognition
    Zhang, Yingying
    Zhu, Haogang
    Wang, Yan
    Wang, Jingyi
    He, Yihua
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [23] ResGAT: an improved graph neural network based on multi-head attention mechanism and residual network for paper classification
    Huang, Xuejian
    Wu, Zhibin
    Wang, Gensheng
    Li, Zhipeng
    Luo, Yuansheng
    Wu, Xiaofang
    SCIENTOMETRICS, 2024, 129 (02) : 1015 - 1036
  • [24] Combining Multi-Head Attention and Sparse Multi-Head Attention Networks for Session-Based Recommendation
    Zhao, Zhiwei
    Wang, Xiaoye
    Xiao, Yingyuan
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [25] WORD SENSE DISAMBIGUATION BASED ON IMPROVED BAYESIAN CLASSIFIERS
    Liu Ting Lu Zhimao Li Sheng (Computer Science & Technology School
    Journal of Electronics(China), 2006, (03) : 394 - 398
  • [26] Math Word Problem Solving: Operator and Template Techniques with Multi-Head Attention
    Sarkar, Sandip
    Das, Dipankar
    Pakray, Partha
    Pinto-Avendano, David Eduardo
    COMPUTACION Y SISTEMAS, 2023, 27 (04): : 1075 - 1088
  • [27] WORD SENSE DISAMBIGUATION BASED ON IMPROVED BAYESIAN CLASSIFIERS
    Liu Ting Lu Zhimao Li Sheng Computer Science Technology School Harbin Institute of Technology Harbin China Computer Science Technology School Harbin Engineering University Harbin China
    JournalofElectronics, 2006, (03) : 394 - 398
  • [28] Multi-head attention graph convolutional network model: End-to-end entity and relation joint extraction based on multi-head attention graph convolutional network
    Tao, Zhihua
    Ouyang, Chunping
    Liu, Yongbin
    Chung, Tonglee
    Cao, Yixin
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (02) : 468 - 477
  • [29] Biomedical Word Sense Disambiguation Based on Graph Attention Networks
    Zhang, Chun-Xiang
    Wang, Ming-Lei
    Gao, Xue-Yao
    IEEE ACCESS, 2022, 10 : 123328 - 123336
  • [30] Improved Convolutional Neural Network Based on Multi-head Attention Mechanism for Industrial Process Fault Classification
    Cui, Wenzhi
    Deng, Xiaogang
    Zhang, Zheng
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 918 - 922