Non-Autoregressive Machine Translation with Latent Alignments

被引：0

作者：

Saharia, Chitwan ^{[1
]}

Chan, William ^{[1
]}

Saxena, Saurabh ^{[1
]}

Norouzi, Mohammad ^{[1
]}

机构：

[1] Brain Team, Google Res, Mountain View, CA 94043 USA

来源：

PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents two strong methods, CTC and Imputer, for non-autoregressive machine translation that model latent alignments with dynamic programming. We revisit CTC for machine translation and demonstrate that a simple CTC model can achieve state-of-the-art for single-step non-autoregressive machine translation, contrary to what prior work indicates. In addition, we adapt the Imputer model for non-autoregressive machine translation and demonstrate that Imputer with just 4 generation steps can match the performance of an autoregressive Transformer baseline. Our latent alignment models are simpler than many existing non-autoregressive translation baselines; for example, we do not require target length prediction or re-scoring with an autoregressive model. On the competitive WMT'14 En!De task, our CTC model achieves 25.7 BLEU with a single generation step, while Imputer achieves 27.5 BLEU with 2 generation steps, and 28.0 BLEU with 4 generation steps. This compares favourably to the autoregressive Transformer baseline at 27.8 BLEU.

引用

页码：1098 / 1108

页数：11

共 50 条

[1] Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation
Shao, Chenze
Feng, Yang
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[2] Integrating Translation Memories into Non-Autoregressive Machine Translation
Xu, Jitao
Crego, Josep
Yvon, Francois
[J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1326 - 1338
[3] Can Latent Alignments Improve Autoregressive Machine Translation?
Haviv, Adi
Vassertail, Lior
Levy, Omer
[J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2637 - 2641
[4] Enhanced encoder for non-autoregressive machine translation
Wang, Shuheng
Shi, Shumin
Huang, Heyan
[J]. MACHINE TRANSLATION, 2021, 35 (04) : 595 - 609
[5] Acyclic Transformer for Non-Autoregressive Machine Translation
Huang, Fei
Zhou, Hao
Liu, Yang
Li, Hang
Huang, Minlie
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[6] Non-Autoregressive Machine Translation with Auxiliary Regularization
Wang, Yiren
Tian, Fei
He, Di
Qin, Tao
Zhai, ChengXiang
Liu, Tie-Yan
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5377 - 5384
[7] A Survey of Non-Autoregressive Neural Machine Translation
Li, Feng
Chen, Jingxian
Zhang, Xuejun
[J]. ELECTRONICS, 2023, 12 (13)
[8] Modeling Coverage for Non-Autoregressive Neural Machine Translation
Shan, Yong
Feng, Yang
Shao, Chenze
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[9] Incorporating history and future into non-autoregressive machine translation
Wang, Shuheng
Huang, Heyan
Shi, Shumin
[J]. COMPUTER SPEECH AND LANGUAGE, 2022, 77
[10] Glancing Transformer for Non-Autoregressive Neural Machine Translation
Qian, Lihua
Zhou, Hao
Bao, Yu
Wang, Mingxuan
Qiu, Lin
Zhang, Weinan
Yu, Yong
Li, Lei
[J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1993 - 2003

← 1 2 3 4 5 →