Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade

被引：0

作者：

Gu, Jiatao ^{[1
]}

Kong, Xiang ^{[2
]}

机构：

[1] Facebook AI Res, Menlo Pk, CA 94025 USA

[2] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA USA

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021 | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fully non-autoregressive neural machine translation (NAT) simultaneously predicts tokens with single forward of neural networks, which significantly reduces the inference latency at the expense of quality drop compared to the Transformer baseline. In this work, we target on closing the performance gap while maintaining the latency advantage. We first inspect the fundamental issues of fully NAT models, and adopt dependency reduction in the learning space of output tokens as the primary guidance. Then, we revisit methods in four different aspects that have been proven effective for improving NAT models, and carefully combine these techniques with necessary modifications. Our extensive experiments on three translation benchmarks show that the proposed system achieves the state-of-the-art results for fully NAT models, and obtains comparable performance with the autoregressive and iterative NAT systems. For instance, one of the proposed models achieves 27.49 BLEU points on WMT14 En-De with 16.5x speed-up compared to similar sized autoregressive baseline under the same inference condition. The implementation of our model is available here(1).

引用

页码：120 / 133

页数：14

共 50 条

[1] A Survey of Non-Autoregressive Neural Machine Translation
Li, Feng
Chen, Jingxian
Zhang, Xuejun
ELECTRONICS, 2023, 12 (13)
[2] Modeling Coverage for Non-Autoregressive Neural Machine Translation
Shan, Yong
Feng, Yang
Shao, Chenze
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[3] Glancing Transformer for Non-Autoregressive Neural Machine Translation
Qian, Lihua
Zhou, Hao
Bao, Yu
Wang, Mingxuan
Qiu, Lin
Zhang, Weinan
Yu, Yong
Li, Lei
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1993 - 2003
[4] Imitation Learning for Non-Autoregressive Neural Machine Translation
Wei, Bingzhen
Wang, Mingxuan
Zhou, Hao
Lin, Junyang
Sun, Xu
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1304 - 1312
[5] Learning to Rewrite for Non-Autoregressive Neural Machine Translation
Geng, Xinwei
Feng, Xiaocheng
Qin, Bing
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3297 - 3308
[6] Uncertainty-aware non-autoregressive neural machine translation
Liu, Chuanming
Yu, Jingqi
COMPUTER SPEECH AND LANGUAGE, 2023, 78
[7] Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation
Liu, Min
Bao, Yu
Zhao, Chengqi
Huang, Shujian
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13246 - 13254
[8] Non-autoregressive neural machine translation with auxiliary representation fusion
Du, Quan
Feng, Kai
Xu, Chen
Xiao, Tong
Zhu, Jingbo
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7229 - 7239
[9] Improving Non-autoregressive Neural Machine Translation with Monolingual Data
Zhou, Jiawei
Keung, Phillip
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1893 - 1898
[10] A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Xiao Y.
Wu L.
Guo J.
Li J.
Zhang M.
Qin T.
Liu T.-Y.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45 (10) : 11407 - 11427

← 1 2 3 4 5 →