Improving Non-autoregressive Neural Machine Translation with Monolingual Data

被引：0

作者：

Zhou, Jiawei ^{[1
]}

Keung, Phillip ^{[2
]}

机构：

[1] Harvard Univ, Cambridge, MA 02138 USA

[2] Amazon Inc, Bellevue, WA USA

来源：

58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020) | 2020年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Non-autoregressive (NAR) neural machine translation is usually done via knowledge distillation from an autoregressive (AR) model. Under this framework, we leverage large monolingual corpora to improve the NAR model's performance, with the goal of transferring the AR model's generalization ability while preventing overfitting. On top of a strong NAR baseline, our experimental results on the WMT14 En-De and WMT16 En-Ro news translation tasks confirm that monolingual data augmentation consistently improves the performance of the NAR model to approach the teacher AR model's performance, yields comparable or better results than the best non-iterative NAR methods in the literature and helps reduce overfitting in the training process.

引用

页码：1893 / 1898

页数：6

共 50 条

[21] Non-Autoregressive Machine Translation with Latent Alignments
Saharia, Chitwan
Chan, William
Saxena, Saurabh
Norouzi, Mohammad
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1098 - 1108
[22] Enriching Non-Autoregressive Transformer with Syntactic and Semantic Structures for Neural Machine Translation
Liu, Ye
Wan, Yao
Zhang, Jian-Guo
Zhao, Wenting
Yu, Philip S.
[J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1235 - 1244
[23] Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation
Liu, Jinglin
Ren, Yi
Tan, Xu
Zhang, Chen
Qin, Tao
Zhao, Zhou
Liu, Tie-Yan
[J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3861 - 3867
[24] Non-Autoregressive Neural Machine Translation with Consistency Regularization Optimized Variational Framework
Zhu, Minghao
Wang, Junli
Yan, Chungang
[J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 607 - 617
[25] Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Guo, Junliang
Tan, Xu
Xu, Linli
Qin, Tao
Chen, Enhong
Liu, Tie-Yan
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7839 - 7846
[26] Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation
Shao, Chenze
Zhang, Jinchao
Feng, Yang
Meng, Fandong
Zhou, Jie
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 198 - 205
[27] Incorporating history and future into non-autoregressive machine translation
Wang, Shuheng
Huang, Heyan
Shi, Shumin
[J]. COMPUTER SPEECH AND LANGUAGE, 2022, 77
[28] Non-Autoregressive Machine Translation: It's Not as Fast as it Seems
Helel, Jindrich
Haddow, Barry
Birch, Alexandra
[J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1780 - 1790
[29] Non-autoregressive Machine Translation with Disentangled Context Transformer
Kasai, Jungo
Cross, James
Ghazvininejad, Marjan
Gu, Jiatao
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[30] Aligned Cross Entropy for Non-Autoregressive Machine Translation
Ghazvininejad, Marjan
Karpukhin, Vladimir
Zettlemoyer, Luke
Levy, Omer
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119

← 1 2 3 4 5 →