Pre-Training on Mixed Data for Low-Resource Neural Machine Translation

被引：6

作者：

Zhang, Wenbo ^{[1
,2
,3
]}

Li, Xiao ^{[1
,2
,3
]}

Yang, Yating ^{[1
,2
,3
]}

Dong, Rui ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China

来源：

INFORMATION | 2021年 / 12卷 / 03期

关键词：

neural machine translation; pre-training; low resource; word translation;

D O I：

10.3390/info12030133

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The pre-training fine-tuning mode has been shown to be effective for low resource neural machine translation. In this mode, pre-training models trained on monolingual data are used to initiate translation models to transfer knowledge from monolingual data into translation models. In recent years, pre-training models usually take sentences with randomly masked words as input, and are trained by predicting these masked words based on unmasked words. In this paper, we propose a new pre-training method that still predicts masked words, but randomly replaces some of the unmasked words in the input with their translation words in another language. The translation words are from bilingual data, so that the data for pre-training contains both monolingual data and bilingual data. We conduct experiments on Uyghur-Chinese corpus to evaluate our method. The experimental results show that our method can make the pre-training model have a better generalization ability and help the translation model to achieve better performance. Through a word translation task, we also demonstrate that our method enables the embedding of the translation model to acquire more alignment knowledge.

引用

页数：10

共 50 条

[31] On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation
Liu, Xuebo
Wang, Longyue
Wong, Derek F.
Ding, Liang
Chao, Lidia S.
Shi, Shuming
Tu, Zhaopeng
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2900 - 2907
[32] A Strategy for Referential Problem in Low-Resource Neural Machine Translation
Ji, Yatu
Shi, Lei
Su, Yila
Ren, Qing-dao-er-ji
Wu, Nier
Wang, Hongbin
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 321 - 332
[33] Machine Translation in Low-Resource Languages by an Adversarial Neural Network
Sun, Mengtao
Wang, Hao
Pasquine, Mark
Hameed, Ibrahim A.
[J]. APPLIED SCIENCES-BASEL, 2021, 11 (22):
[34] Low-Resource Neural Machine Translation: A Systematic Literature Review
Yazar, Bilge Kagan
Sahin, Durmus Ozkan
Kilic, Erdal
[J]. IEEE ACCESS, 2023, 11 : 131775 - 131813
[35] Meta-Learning for Low-Resource Neural Machine Translation
Gu, Jiatao
Wang, Yong
Chen, Yun
Cho, Kyunghyun
Li, Victor O. K.
[J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3622 - 3631
[36] Language Model Prior for Low-Resource Neural Machine Translation
Baziotis, Christos
Haddow, Barry
Birch, Alexandra
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7622 - 7634
[37] Unsupervised Source Hierarchies for Low-Resource Neural Machine Translation
Currey, Anna
Heafield, Kenneth
[J]. RELEVANCE OF LINGUISTIC STRUCTURE IN NEURAL ARCHITECTURES FOR NLP, 2018, : 6 - 12
[38] Revisiting Low-Resource Neural Machine Translation: A Case Study
Sennrich, Rico
Zhang, Biao
[J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 211 - 221
[39] Universal Conditional Masked Language Pre-training for Neural Machine Translation
Li, Pengfei
Li, Liangyou
Zhang, Meng
Wu, Minghao
Liu, Qun
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6379 - 6391
[40] Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information
Lin, Zehui
Pan, Xiao
Wang, Mingxuan
Qiu, Xipeng
Feng, Jiangtao
Zhou, Hao
Li, Lei
[J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2649 - 2663

← 1 2 3 4 5 →