Fine-grained attention mechanism for neural machine translation

被引:122
|
作者
Choi, Heeyoul [1 ]
Cho, Kyunghyun [2 ]
Bengio, Yoshua [3 ]
机构
[1] Handong Global Univ, Pohang, South Korea
[2] NYU, Comp Sci & Data Sci, New York, NY USA
[3] Univ Montreal, Montreal, PQ, Canada
基金
新加坡国家研究基金会; 加拿大自然科学与工程研究理事会;
关键词
Neural machine translation; Attention mechanism; Fine-grained attention;
D O I
10.1016/j.neucom.2018.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural machine translation (NMT) has been a new paradigm in machine translation, and the attention mechanism has become the dominant approach with the state-of-the-art records in many language pairs. While there are variants of the attention mechanism, all of them use only temporal attention where one scalar value is assigned to one context vector corresponding to a source word. In this paper, we propose a fine-grained (or 2D) attention mechanism where each dimension of a context vector will receive a separate attention score. In experiments with the task of En-De and En-Fi translation, the fine-grained attention method improves the translation quality in terms of BLEU score. In addition, our alignment analysis reveals how the fine-grained attention mechanism exploits the internal structure of context vectors. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:171 / 176
页数:6
相关论文
共 50 条
  • [31] Attention Bilinear Pooling for Fine-Grained Classification
    Wang, Wenqian
    Zhang, Jun
    Wang, Fenglei
    [J]. SYMMETRY-BASEL, 2019, 11 (08):
  • [32] Fine-grained attention for image caption generation
    Yan-Shuo Chang
    [J]. Multimedia Tools and Applications, 2018, 77 : 2959 - 2971
  • [33] Noun-based attention mechanism for Fine-grained Named Entity Recognition
    Rodriguez, Alejandro Jesus Castaneira
    Castro, Daniel Castro
    Herold Garcia, Silena
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
  • [34] Traits:: A mechanism for fine-grained reuse
    Ducasse, S
    Nierstrasz, O
    Schärli, N
    Wuyts, R
    Black, AP
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2006, 28 (02): : 331 - 388
  • [35] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [36] Machine translation-based fine-grained comments generation for solidity smart contracts
    Shi, Chaochen
    Xiang, Yong
    Yu, Jiangshan
    Sood, Keshav
    Gao, Longxiang
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 153
  • [37] Attention-shift based deep neural network for fine-grained visual categorization
    Niu, Yi
    Jiao, Yang
    Shi, Guangming
    [J]. PATTERN RECOGNITION, 2021, 116
  • [38] Adaptive Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
    Li, Ang
    Chen, Jianxin
    Kang, Bin
    Zhuang, Wenqin
    Zhang, Xuguang
    [J]. 2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,
  • [39] Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Mei, Tao
    Luo, Jiebo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5219 - 5227
  • [40] CariesFG: A fine-grained RGB image classification framework with attention mechanism for dental caries
    Jiang, Hao
    Zhang, Peiliang
    Che, Chao
    Jin, Bo
    Zhu, Yongjun
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123