Generative adversarial training for neural machine translation

被引：15

作者：

Yang, Zhen ^{[1
,2
]}

Chen, Wei ^{[2
]}

Wang, Feng ^{[2
]}

Xu, Bo ^{[2
]}

机构：

[1] Univ Chinese Acad Sci, Beijing, Peoples R China

[2] Chinese Acad Sci, Inst Automat, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China

来源：

NEUROCOMPUTING | 2018年 / 321卷

关键词：

Neural machine translation; Multi generative adversarial net; Human-like translation;

D O I：

10.1016/j.neucom.2018.09.006

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural machine translation (NMT) is typically optimized to generate sentences which cover n-grams with ground target as much as possible. However, it is widely acknowledged that n-gram precisions, the manually designed approximate loss function, may mislead the model to generate suboptimal translations. To solve this problem, we train the NMT model to generate human-like translations directly by using the generative adversarial net, which has achieved great success in computer vision. In this paper, we build a conditional sequence generative adversarial net (CSGAN-NMT) which comprises of two adversarial sub models, a generative model (generator) which translates the source sentence into the target sentence as the traditional NMT models do and a discriminative model (discriminator) which discriminates the machine-translated target sentence from the human-translated one. The two sub models play a mini max game and achieve a win-win situation when reaching a Nash Equilibrium. As a variant of the single generator-discriminator model, the multi-CSGAN-NMT which contains multiple discriminators and generators, is also proposed. In the multi-CSGAN-NMT model, each generator is viewed as an agent which can interact with others and even transfer messages. Experiments show that the proposed CSGAN-NMT model obtains substantial improvements than the strong baseline and the improvement of the multi-CSGAN-NMT model is more remarkable. (C) 2018 Elsevier B.V. All rights reserved.

引用

页码：146 / 155

页数：10

共 50 条

[1] Adversarial Training for Unknown Word Problems in Neural Machine Translation
Ji, Yatu
Hou, Hongxu
Chen, Junjie
Wu, Nier
[J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)
[2] Generative Adversarial Neural Machine Translation for Phonetic Languages via Reinforcement Learning
Kumar, Amit
Pratap, Ajay
Singh, Anil Kumar
[J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (01): : 190 - 199
[3] Generative Neural Machine Translation
Shah, Harshil
Barber, David
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[4] Noise-Based Adversarial Training for Enhancing Agglutinative Neural Machine Translation
Ji, Yatu
Hou, Hongxu
Chen, Junjie
Wu, Nier
[J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 : 392 - 396
[5] Crafting Adversarial Examples for Neural Machine Translation
Zhang, Xinze
Zhang, Junzhe
Chen, Zhenhua
He, Kun
[J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1967 - 1977
[6] Manifold Adversarial Augmentation for Neural Machine Translation
Chen, Guandan
Fan, Kai
Zhang, Kaibo
Chen, Boxing
Huang, Zhongqiang
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3184 - 3189
[7] Effective Adversarial Regularization for Neural Machine Translation
Sato, Motoki
Suzuki, Jun
Kiyono, Shun
[J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 204 - 210
[8] A4NT: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation
Shetty, Rakshith
Schiele, Bernt
Fritz, Mario
[J]. PROCEEDINGS OF THE 27TH USENIX SECURITY SYMPOSIUM, 2018, : 1633 - 1650
[9] Robust Neural Machine Translation with Doubly Adversarial Inputs
Cheng, Yong
Jiang, Lu
Macherey, Wolfgang
[J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4324 - 4333
[10] Adversarial Subword Regularization for Robust Neural Machine Translation
Park, Jungsoo
Sung, Mujeen
Lee, Jinhyuk
Kang, Jaewoo
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1945 - 1953

← 1 2 3 4 5 →