Linguistic Knowledge-Aware Neural Machine Translation

被引:16
|
作者
Li, Qiang [1 ,2 ]
Wong, Derek F. [3 ]
Chao, Lidia S. [3 ]
Zhu, Muhua [2 ]
Xiao, Tong [1 ,4 ]
Zhu, Jingbo [1 ,4 ]
Zhang, Min [5 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Nat Language Proc Lab, Shenyang 110819, Liaoning, Peoples R China
[2] Alibaba Inc, Hangzhou 311121, Zhejiang, Peoples R China
[3] Univ Macau, Nat Language Proc & Portuguese Chinese Machine Tr, Macau, Peoples R China
[4] Shenyang Yatrans Network Technol Co Ltd, Shenyang 110004, Liaoning, Peoples R China
[5] Soochow Univ, Inst Artificial Intelligence, Sch Comp Sci & Technol, Suzhou 215000, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention gate; knowledge block; knowledge gate; neural machine translation (NMT);
D O I
10.1109/TASLP.2018.2864648
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, researchers have shown an increasing interest in incorporating linguistic knowledge into neural machine translation (NMT). To this end, previous works choose either to alter the architecture of NMT encoder to incorporate syntactic information into the translation model, or to generalize the embedding layer of the encoder to encode additional linguistic features. The former approach mainly focuses on injecting the syntactic structure of the source sentence into the encoding process, leading to a complicated model that lacks the flexibility to incorporate other types of knowledge. The latter extends word embeddings by considering additional linguistic knowledge as features to enrich the word representation. It thus does not explicitly balance the contribution from word embeddings and the contribution from additional linguistic knowledge. To address these limitations, this paper proposes a knowledge-aware NMT approach that models additional linguistic features in parallel to the word feature. The core idea is that we propose modeling a series of linguistic features at the word level (knowledge block) using a recurrent neural network (RNN). And in sentence level, those word-corresponding feature blocks are further encoded using a RNN encoder. In decoding, we propose a knowledge gate and an attention gate to dynamically control the proportions of information contributing to the generation of target words from different sources. Extensive experiments show that our approach is capable of better accounting for importance of additional linguistic, and we observe significant improvements from 1.0 to 2.3 BLEU points on Chinese <-> English and English -> German translation tasks.
引用
收藏
页码:2341 / 2354
页数:14
相关论文
共 50 条
  • [41] Multi-view knowledge graph fusion via knowledge-aware attentional graph neural network
    Zhichao Huang
    Xutao Li
    Yunming Ye
    Baoquan Zhang
    Guangning Xu
    Wensheng Gan
    [J]. Applied Intelligence, 2023, 53 : 3652 - 3671
  • [42] Quality-Aware Decoding for Neural Machine Translation
    Fernandes, Patrick
    Farinhas, Antonio
    Rei, Ricardo
    de Souza, Jose G. C.
    Ogayo, Perez
    Neubig, Graham
    Martins, Andre F. T.
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1396 - 1412
  • [43] Multi-view knowledge graph fusion via knowledge-aware attentional graph neural network
    Huang, Zhichao
    Li, Xutao
    Ye, Yunming
    Zhang, Baoquan
    Xu, Guangning
    Gan, Wensheng
    [J]. APPLIED INTELLIGENCE, 2023, 53 (04) : 3652 - 3671
  • [44] Knowledge-aware progressive clustering for social image
    Mingyuan Li
    Yadong Dong
    Dongqing Liu
    Xiaoqiang Yan
    Caitong Yue
    Xiangyang Ren
    [J]. Complex & Intelligent Systems, 2024, 10 : 2173 - 2185
  • [45] Knowledge-Aware Bayesian Deep Topic Model
    Wang, Dongsheng
    Xu, Yishi
    Li, Miaoge
    Duan, Zhibin
    Wang, Chaojie
    Chen, Bo
    Zhou, Mingyuan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [46] A survey on knowledge-aware news recommender systems
    Iana, Andreea
    Alam, Mehwish
    Paulheim, Heiko
    [J]. SEMANTIC WEB, 2024, 15 (01) : 21 - 82
  • [47] A hierarchical knowledge-aware neural network for protein-protein interaction article classification
    Wei, Hao
    Zhou, Ai
    Zhang, Yijia
    Chen, Fei
    Qu, Wen
    Lu, Mingyu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 3002 - 3004
  • [48] Knowledge-aware progressive clustering for social image
    Li, Mingyuan
    Dong, Yadong
    Liu, Dongqing
    Yan, Xiaoqiang
    Yue, Caitong
    Ren, Xiangyang
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 2173 - 2185
  • [49] KVQA: Knowledge-Aware Visual Question Answering
    Shah, Sanket
    Mishra, Anand
    Yadati, Naganand
    Talukdar, Partha Pratim
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8876 - 8884
  • [50] Knowledge Graphs Enhanced Neural Machine Translation
    Zhao, Yang
    Zhang, Jiajun
    Zhou, Yu
    Zong, Chengqing
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4039 - 4045