Linguistic Knowledge-Aware Neural Machine Translation

被引：16

作者：

Li, Qiang ^{[1
,2
]}

Wong, Derek F. ^{[3
]}

Chao, Lidia S. ^{[3
]}

Zhu, Muhua ^{[2
]}

Xiao, Tong ^{[1
,4
]}

Zhu, Jingbo ^{[1
,4
]}

Zhang, Min ^{[5
]}

机构：

[1] Northeastern Univ, Sch Comp Sci & Engn, Nat Language Proc Lab, Shenyang 110819, Liaoning, Peoples R China

[2] Alibaba Inc, Hangzhou 311121, Zhejiang, Peoples R China

[3] Univ Macau, Nat Language Proc & Portuguese Chinese Machine Tr, Macau, Peoples R China

[4] Shenyang Yatrans Network Technol Co Ltd, Shenyang 110004, Liaoning, Peoples R China

[5] Soochow Univ, Inst Artificial Intelligence, Sch Comp Sci & Technol, Suzhou 215000, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2018年 / 26卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Attention gate; knowledge block; knowledge gate; neural machine translation (NMT);

D O I：

10.1109/TASLP.2018.2864648

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Recently, researchers have shown an increasing interest in incorporating linguistic knowledge into neural machine translation (NMT). To this end, previous works choose either to alter the architecture of NMT encoder to incorporate syntactic information into the translation model, or to generalize the embedding layer of the encoder to encode additional linguistic features. The former approach mainly focuses on injecting the syntactic structure of the source sentence into the encoding process, leading to a complicated model that lacks the flexibility to incorporate other types of knowledge. The latter extends word embeddings by considering additional linguistic knowledge as features to enrich the word representation. It thus does not explicitly balance the contribution from word embeddings and the contribution from additional linguistic knowledge. To address these limitations, this paper proposes a knowledge-aware NMT approach that models additional linguistic features in parallel to the word feature. The core idea is that we propose modeling a series of linguistic features at the word level (knowledge block) using a recurrent neural network (RNN). And in sentence level, those word-corresponding feature blocks are further encoded using a RNN encoder. In decoding, we propose a knowledge gate and an attention gate to dynamically control the proportions of information contributing to the generation of target words from different sources. Extensive experiments show that our approach is capable of better accounting for importance of additional linguistic, and we observe significant improvements from 1.0 to 2.3 BLEU points on Chinese <-> English and English -> German translation tasks.

引用

页码：2341 / 2354

页数：14

共 50 条

[1] Linguistic knowledge-based vocabularies for Neural Machine Translation
Casas, Noe
Costa-jussa, Marta R.
Fonollosa, Jose A. R.
Alonso, Juan A.
Fanlo, Ramon
[J]. NATURAL LANGUAGE ENGINEERING, 2021, 27 (04) : 485 - 506
[2] Future-Aware Knowledge Distillation for Neural Machine Translation
Zhang, Biao
Xiong, Deyi
Su, Jinsong
Luo, Jiebo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2278 - 2287
[3] Knowledge-Aware Hypergraph Neural Network for Recommender Systems
Liu, Binghao
Zhao, Pengpeng
Zhuang, Fuzhen
Xian, Xuefeng
Liu, Yanchi
Sheng, Victor S.
[J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT III, 2021, 12683 : 132 - 147
[4] Context-Aware Linguistic Steganography Model Based on Neural Machine Translation
Ding, Changhao
Fu, Zhangjie
Yang, Zhongliang
Yu, Qi
Li, Daqiu
Huang, Yongfeng
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 (868-878) : 868 - 878
[5] Knowledge-aware Coupled Graph Neural Network for Social Recommendation
Huang, Chao
Xu, Huance
Xu, Yong
Dai, Peng
Xia, Lianghao
Lu, Mengyin
Bo, Liefeng
Xing, Hao
Lai, Xiaoping
Ye, Yanfang
[J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4115 - 4122
[6] Knowledge-Aware Neural Networks for Medical Forum Question Classification
Roy, Soumyadeep
Chakraborty, Sudip
Mandal, Aishik
Balde, Gunjan
Sharma, Prakhar
Natarajan, Anandhavelu
Khosla, Megha
Sural, Shamik
Ganguly, Niloy
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3398 - 3402
[7] Leveraging Hyperbolic Dynamic Neural Networks for Knowledge-Aware Recommendation
Zhang, Yihao
Li, Kaibei
Zhu, Junlin
Yuan, Meng
Huang, Yonghao
Li, Xiaokang
[J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (03): : 4396 - 4411
[8] KNCR: Knowledge-Aware Neural Collaborative Ranking for Recommender Systems
Huang, Chen
Gan, Zhongyuan
Ye, Feng
Wang, Pan
Zhang, Moxuan
[J]. 2020 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2020, : 339 - 344
[9] KSRG: Knowledge-Aware Sequential Recommendation with Graph Neural Networks
Yuan, Yuan
Tang, Yan
Yan, Zhiqiang
Hu, Min
Du, Luomin
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2408 - 2414
[10] Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge
Deng, Yang
Xie, Yuexiang
Li, Yaliang
Yang, Min
Lam, Wai
Shen, Ying
[J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (01)

← 1 2 3 4 5 →