Linguistic Knowledge-Aware Neural Machine Translation

被引:16
|
作者
Li, Qiang [1 ,2 ]
Wong, Derek F. [3 ]
Chao, Lidia S. [3 ]
Zhu, Muhua [2 ]
Xiao, Tong [1 ,4 ]
Zhu, Jingbo [1 ,4 ]
Zhang, Min [5 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Nat Language Proc Lab, Shenyang 110819, Liaoning, Peoples R China
[2] Alibaba Inc, Hangzhou 311121, Zhejiang, Peoples R China
[3] Univ Macau, Nat Language Proc & Portuguese Chinese Machine Tr, Macau, Peoples R China
[4] Shenyang Yatrans Network Technol Co Ltd, Shenyang 110004, Liaoning, Peoples R China
[5] Soochow Univ, Inst Artificial Intelligence, Sch Comp Sci & Technol, Suzhou 215000, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention gate; knowledge block; knowledge gate; neural machine translation (NMT);
D O I
10.1109/TASLP.2018.2864648
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, researchers have shown an increasing interest in incorporating linguistic knowledge into neural machine translation (NMT). To this end, previous works choose either to alter the architecture of NMT encoder to incorporate syntactic information into the translation model, or to generalize the embedding layer of the encoder to encode additional linguistic features. The former approach mainly focuses on injecting the syntactic structure of the source sentence into the encoding process, leading to a complicated model that lacks the flexibility to incorporate other types of knowledge. The latter extends word embeddings by considering additional linguistic knowledge as features to enrich the word representation. It thus does not explicitly balance the contribution from word embeddings and the contribution from additional linguistic knowledge. To address these limitations, this paper proposes a knowledge-aware NMT approach that models additional linguistic features in parallel to the word feature. The core idea is that we propose modeling a series of linguistic features at the word level (knowledge block) using a recurrent neural network (RNN). And in sentence level, those word-corresponding feature blocks are further encoded using a RNN encoder. In decoding, we propose a knowledge gate and an attention gate to dynamically control the proportions of information contributing to the generation of target words from different sources. Extensive experiments show that our approach is capable of better accounting for importance of additional linguistic, and we observe significant improvements from 1.0 to 2.3 BLEU points on Chinese <-> English and English -> German translation tasks.
引用
收藏
页码:2341 / 2354
页数:14
相关论文
共 50 条
  • [1] Linguistic knowledge-based vocabularies for Neural Machine Translation
    Casas, Noe
    Costa-jussa, Marta R.
    Fonollosa, Jose A. R.
    Alonso, Juan A.
    Fanlo, Ramon
    [J]. NATURAL LANGUAGE ENGINEERING, 2021, 27 (04) : 485 - 506
  • [2] Future-Aware Knowledge Distillation for Neural Machine Translation
    Zhang, Biao
    Xiong, Deyi
    Su, Jinsong
    Luo, Jiebo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2278 - 2287
  • [3] Knowledge-Aware Hypergraph Neural Network for Recommender Systems
    Liu, Binghao
    Zhao, Pengpeng
    Zhuang, Fuzhen
    Xian, Xuefeng
    Liu, Yanchi
    Sheng, Victor S.
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT III, 2021, 12683 : 132 - 147
  • [4] Context-Aware Linguistic Steganography Model Based on Neural Machine Translation
    Ding, Changhao
    Fu, Zhangjie
    Yang, Zhongliang
    Yu, Qi
    Li, Daqiu
    Huang, Yongfeng
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 (868-878) : 868 - 878
  • [5] Knowledge-aware Coupled Graph Neural Network for Social Recommendation
    Huang, Chao
    Xu, Huance
    Xu, Yong
    Dai, Peng
    Xia, Lianghao
    Lu, Mengyin
    Bo, Liefeng
    Xing, Hao
    Lai, Xiaoping
    Ye, Yanfang
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4115 - 4122
  • [6] Knowledge-Aware Neural Networks for Medical Forum Question Classification
    Roy, Soumyadeep
    Chakraborty, Sudip
    Mandal, Aishik
    Balde, Gunjan
    Sharma, Prakhar
    Natarajan, Anandhavelu
    Khosla, Megha
    Sural, Shamik
    Ganguly, Niloy
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3398 - 3402
  • [7] Leveraging Hyperbolic Dynamic Neural Networks for Knowledge-Aware Recommendation
    Zhang, Yihao
    Li, Kaibei
    Zhu, Junlin
    Yuan, Meng
    Huang, Yonghao
    Li, Xiaokang
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (03): : 4396 - 4411
  • [8] KNCR: Knowledge-Aware Neural Collaborative Ranking for Recommender Systems
    Huang, Chen
    Gan, Zhongyuan
    Ye, Feng
    Wang, Pan
    Zhang, Moxuan
    [J]. 2020 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2020, : 339 - 344
  • [9] KSRG: Knowledge-Aware Sequential Recommendation with Graph Neural Networks
    Yuan, Yuan
    Tang, Yan
    Yan, Zhiqiang
    Hu, Min
    Du, Luomin
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2408 - 2414
  • [10] Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge
    Deng, Yang
    Xie, Yuexiang
    Li, Yaliang
    Yang, Min
    Lam, Wai
    Shen, Ying
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (01)