Improving Robustness of Neural Machine Translation with Multi-task Learning

被引：0

作者：

Zhou, Shuyan ^{[1
]}

Zeng, Xiangkai ^{[1
]}

Zhou, Yingqi ^{[1
]}

Anastasopoulos, Antonios ^{[1
]}

Neubig, Graham ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Sch Comp Sci, Language Technol Inst, Pittsburgh, PA 15213 USA

来源：

FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019) | 2019年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While neural machine translation (NMT) achieves remarkable performance on clean, in-domain text, performance is known to degrade drastically when facing text which is full of typos, grammatical errors and other varieties of noise. In this work, we propose a multitask learning algorithm for transformer-based MT systems that is more resilient to this noise. We describe our submission to the WMT 2019 Robustness shared task (Li et al., 2019) based on this method. Our model achieves a BLEU score of 32.8 on the shared task French to English dataset, which is 7.1 BLEU points higher than the baseline vanilla transformer trained with clean text(1).

引用

页码：565 / 571

页数：7

共 50 条

[1] Multi-task Learning for Multilingual Neural Machine Translation
Wang, Yiren
Zhai, ChengXiang
Awadalla, Hany Hassan
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1022 - 1034
[2] Improving Machine Translation of Arabic Dialects Through Multi-task Learning
Moukafih, Youness
Sbihi, Nada
Ghogho, Mounir
Smaili, Kamel
[J]. AIXIA 2021 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13196 : 580 - 590
[3] Neural Machine Translation Based on Multi-task Learning of Discourse Structure
Kang, Xiao-Mian
Zong, Cheng-Qing
[J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10): : 3806 - 3818
[4] Adaptive Knowledge Sharing in Multi-Task Learning: Improving Low-Resource Neural Machine Translation
Zaremoodi, Poorya
Buntine, Wray
Haffari, Gholamreza
[J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 656 - 661
[5] Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation
Wang, Qiang
Xiao, Tong
Zhu, Jingbo
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4307 - 4312
[6] Scheduled Multi-task Learning for Neural Chat Translation
Liang, Yunlong
Meng, Fandong
Xu, Jinan
Chen, Yufeng
Zhou, Jie
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4375 - 4388
[7] Improving a neural network classifier ensemble with multi-task learning
Ye, Qiang
Munro, Paul W.
[J]. 2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 5164 - 5170
[8] Autocorrect in the Process of Translation- Multi-task Learning Improves Dialogue Machine Translation
Wang, Tao
Zhao, Chengqi
Wang, Mingxuan
Li, Lei
Xiong, Deyi
[J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 105 - 112
[9] Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach
Sanchez-Cartagena, Victor M.
Espla-Gomis, Miquel
Antonio Perez-Ortiz, Juan
Sanchez-Martinez, Felipe
[J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8502 - 8516
[10] Improving adversarial robustness of Bayesian neural networks via multi-task adversarial training
Chen, Xu
Liu, Chuancai
Zhao, Yue
Jia, Zhiyang
Jin, Ge
[J]. INFORMATION SCIENCES, 2022, 592 : 156 - 173

← 1 2 3 4 5 →