Glancing Transformer for Non-Autoregressive Neural Machine Translation

被引：0

作者：

Qian, Lihua ^{[1
,2
,4
]}

Zhou, Hao ^{[2
]}

Bao, Yu ^{[3
]}

Wang, Mingxuan ^{[2
]}

Qiu, Lin ^{[1
]}

Zhang, Weinan ^{[1
]}

Yu, Yong ^{[1
]}

Li, Lei ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] ByteDance AI Lab, Beijing, Peoples R China

[3] Nanjing Univ, Nanjing, Peoples R China

[4] Bytedance, Beijing, Peoples R China

来源：

59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent work on non-autoregressive neural machine translation (NAT) aims at improving the efficiency by parallel decoding without sacrificing the quality. However, existing NAT methods are either inferior to Transformer or require multiple decoding passes, leading to reduced speedup. We propose the Glancing Language Model (GLM) for single-pass parallel generation models. With GLM, we develop Glancing Transformer (GLAT) for machine translation. With only single-pass parallel decoding, GLAT is able to generate high-quality translation with 8x-15x speedup. Note that GLAT does not modify the network architecture, which is a training method to learn word interdependency. Experiments on multiple WMT language directions show that GLAT outperforms all previous single pass non-autoregressive methods, and is nearly comparable to Transformer, reducing the gap to 0.25-0.9 BLEU points.

引用

页码：1993 / 2003

页数：11

共 50 条

[1] Acyclic Transformer for Non-Autoregressive Machine Translation
Huang, Fei
Zhou, Hao
Liu, Yang
Li, Hang
Huang, Minlie
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[2] Non-autoregressive Machine Translation with Disentangled Context Transformer
Kasai, Jungo
Cross, James
Ghazvininejad, Marjan
Gu, Jiatao
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[3] Enriching Non-Autoregressive Transformer with Syntactic and Semantic Structures for Neural Machine Translation
Liu, Ye
Wan, Yao
Zhang, Jian-Guo
Zhao, Wenting
Yu, Philip S.
[J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1235 - 1244
[4] A Survey of Non-Autoregressive Neural Machine Translation
Li, Feng
Chen, Jingxian
Zhang, Xuejun
[J]. ELECTRONICS, 2023, 12 (13)
[5] Modeling Coverage for Non-Autoregressive Neural Machine Translation
Shan, Yong
Feng, Yang
Shao, Chenze
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[6] Learning to Rewrite for Non-Autoregressive Neural Machine Translation
Geng, Xinwei
Feng, Xiaocheng
Qin, Bing
[J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3297 - 3308
[7] Imitation Learning for Non-Autoregressive Neural Machine Translation
Wei, Bingzhen
Wang, Mingxuan
Zhou, Hao
Lin, Junyang
Sun, Xu
[J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1304 - 1312
[8] Uncertainty-aware non-autoregressive neural machine translation
Liu, Chuanming
Yu, Jingqi
[J]. COMPUTER SPEECH AND LANGUAGE, 2023, 78
[9] Non-autoregressive neural machine translation with auxiliary representation fusion
Du, Quan
Feng, Kai
Xu, Chen
Xiao, Tong
Zhu, Jingbo
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7229 - 7239
[10] Improving Non-autoregressive Neural Machine Translation with Monolingual Data
Zhou, Jiawei
Keung, Phillip
[J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1893 - 1898

← 1 2 3 4 5 →