Modeling Coverage for Non-Autoregressive Neural Machine Translation

被引：1

作者：

Shan, Yong

Feng, Yang ^{[1
]}

Shao, Chenze

机构：

[1] Chinese Acad Sci ICT CAS, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing, Peoples R China

来源：

2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2021年

关键词：

D O I：

10.1109/IJCNN52387.2021.9533529

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Non-Autoregressive Neural Machine Translation (NAT) has achieved significant inference speedup by generating all tokens simultaneously. Despite its high efficiency, NAT usually suffers from two kinds of translation errors: over-translation (e.g. repeated tokens) and under-translation (e.g. missing translations), which eventually limits the translation quality. In this paper, we argue that these issues of NAT can be addressed through coverage modeling, which has been proved to be useful in autoregressive decoding. We propose a novel Coverage-NAT to model the coverage information directly by a token-level coverage iterative refinement mechanism and a sentence-level coverage agreement, which can remind the model if a source token has been translated or not and improve the semantics consistency between the translation and the source, respectively. Experimental results on WMT14 En <-> De and WMT16 En <-> Ro translation tasks show that our method can alleviate those errors and achieve strong improvements over the baseline system.

引用

页数：8

共 50 条

[21] Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation
Liu, Jinglin
Ren, Yi
Tan, Xu
Zhang, Chen
Qin, Tao
Zhao, Zhou
Liu, Tie-Yan
[J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3861 - 3867
[22] Non-Autoregressive Neural Machine Translation with Consistency Regularization Optimized Variational Framework
Zhu, Minghao
Wang, Junli
Yan, Chungang
[J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 607 - 617
[23] Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Guo, Junliang
Tan, Xu
Xu, Linli
Qin, Tao
Chen, Enhong
Liu, Tie-Yan
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7839 - 7846
[24] Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation
Shao, Chenze
Zhang, Jinchao
Feng, Yang
Meng, Fandong
Zhou, Jie
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 198 - 205
[25] Incorporating history and future into non-autoregressive machine translation
Wang, Shuheng
Huang, Heyan
Shi, Shumin
[J]. COMPUTER SPEECH AND LANGUAGE, 2022, 77
[26] Non-Autoregressive Machine Translation: It's Not as Fast as it Seems
Helel, Jindrich
Haddow, Barry
Birch, Alexandra
[J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1780 - 1790
[27] Non-autoregressive Machine Translation with Disentangled Context Transformer
Kasai, Jungo
Cross, James
Ghazvininejad, Marjan
Gu, Jiatao
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[28] Aligned Cross Entropy for Non-Autoregressive Machine Translation
Ghazvininejad, Marjan
Karpukhin, Vladimir
Zettlemoyer, Luke
Levy, Omer
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[29] Aligned Cross Entropy for Non-Autoregressive Machine Translation
Ghazvininejad, Marjan
Karpukhin, Vladimir
Zettlemoyer, Luke
Levy, Omer
[J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[30] A Non-Autoregressive Neural Machine Translation Model With Iterative Length Update of Target Sentence
Lim, Yeon-Soo
Park, Eun-Ju
Song, Hyun-Je
Park, Seong-Bae
[J]. IEEE ACCESS, 2022, 10 : 43341 - 43350

← 1 2 3 4 5 →