Incorporating history and future into non-autoregressive machine translation

被引：2

作者：

Wang, Shuheng ^{[1
]}

Huang, Heyan ^{[2
]}

Shi, Shumin ^{[2
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China

[2] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China

来源：

COMPUTER SPEECH AND LANGUAGE | 2022年 / 77卷

基金：

中国国家自然科学基金;

关键词：

Machine translation; Non-autoregressive; Capsule network; History and future information;

D O I：

10.1016/j.csl.2022.101439

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In non-autoregressive machine translation, the target tokens are generated by the decoder in one shot. Although this decoding process can significantly reduce the decoding latency, non-autoregressive machine translation still suffers from the sacrifice of translation accuracy. We argue that the reason for such decrease is the lack of the target dependencies, history and future information, between target tokens. So, in this work, we propose a novel method to address this problem. We suppose the hidden representation of a target token from the decoder should consist of three parts: history, present, and future information. And we dynamically aggregate such parts-to-whole information with capsule network for the decoder to improve the performance of non-autoregressive machine translation. In addition, to ensure the capsules learn the information as we expect, we introduce an autoregressive decoder. Several experiments on benchmark tasks demonstrate that the explicit modeling of history and future information can significantly improve performance of NAT model. Extensive analyses show that our model is able to learn history and future information as we expect.

引用

页数：11

共 50 条

[1] Incorporating a local translation mechanism into non-autoregressive translation
Kong, Xiang
Zhang, Zhisong
Hovy, Eduard
arXiv, 2020,
[2] Incorporating a Local Translation Mechanism into Non-autoregressive Translation
Kong, Xiang
Zhang, Zhisong
Hovy, Eduard
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1067 - 1073
[3] Integrating Translation Memories into Non-Autoregressive Machine Translation
Xu, Jitao
Crego, Josep
Yvon, Francois
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1326 - 1338
[4] Enhanced encoder for non-autoregressive machine translation
Wang, Shuheng
Shi, Shumin
Huang, Heyan
MACHINE TRANSLATION, 2021, 35 (04) : 595 - 609
[5] Rephrasing the Reference for Non-autoregressive Machine Translation
Shao, Chenze
Zhang, Jinchao
Zhou, Jie
Feng, Yang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13538 - 13546
[6] Acyclic Transformer for Non-Autoregressive Machine Translation
Huang, Fei
Zhou, Hao
Liu, Yang
Li, Hang
Huang, Minlie
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[7] Non-Autoregressive Machine Translation with Auxiliary Regularization
Wang, Yiren
Tian, Fei
He, Di
Qin, Tao
Zhai, ChengXiang
Liu, Tie-Yan
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5377 - 5384
[8] A Survey of Non-Autoregressive Neural Machine Translation
Li, Feng
Chen, Jingxian
Zhang, Xuejun
ELECTRONICS, 2023, 12 (13)
[9] Non-Autoregressive Machine Translation as Constrained HMM
Li, Haoran
Jie, Zhanming
Lui, Wei
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12361 - 12372
[10] Non-Autoregressive Machine Translation with Latent Alignments
Saharia, Chitwan
Chan, William
Saxena, Saurabh
Norouzi, Mohammad
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1098 - 1108

← 1 2 3 4 5 →