Improving Low-Resource Neural Machine Translation With Teacher-Free Knowledge Distillation

被引：3

作者：

Zhang, Xinlu ^{[1
,2
,3
]}

Li, Xiao ^{[1
,2
,3
]}

Yang, Yating ^{[1
,2
,3
]}

Dong, Rui ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Training; Decoding; Vocabulary; Task analysis; Standards; Knowledge engineering; Computational modeling; Neural machine translation; knowledge distillation; prior knowledge;

D O I：

10.1109/ACCESS.2020.3037821

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Knowledge Distillation (KD) aims to distill the knowledge of a cumbersome teacher model into a lightweight student model. Its success is generally attributed to the privileged information on similarities among categories provided by the teacher model, and in this sense, only strong teacher models are deployed to teach weaker students in practice. However, in low-resource neural machine translation, a stronger teacher model is not available. To counteract this, We therefore propose a novel Teacher-free Knowledge Distillation framework for low-resource neural machine translation, where the model learns from manually designed regularization distribution as a virtual teacher model. The prior distribution of artificial design can not only obtain the similarity information between words, but also provide effective regularity for model training. Experimental results show that the proposed method has improved performance in low-resource language effectively.

引用

页码：206638 / 206645

页数：8

共 50 条

[31] Semantic Perception-Oriented Low-Resource Neural Machine Translation
Wu, Nier
Hou, Hongxu
Li, Haoran
Chang, Xin
Jia, Xiaoning
MACHINE TRANSLATION, CCMT 2021, 2021, 1464 : 51 - 62
[32] A Content Word Augmentation Method for Low-Resource Neural Machine Translation
Li, Fuxue
Zhao, Zhongchao
Chi, Chuncheng
Yan, Hong
Zhang, Zhen
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 720 - 731
[33] Incremental Domain Adaptation for Neural Machine Translation in Low-Resource Settings
Kalimuthu, Marimuthu
Barz, Michael
Sonntag, Daniel
FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), 2019, : 1 - 10
[34] Benchmarking Neural and Statistical Machine Translation on Low-Resource African Languages
Duh, Kevin
McNamee, Paul
Post, Matt
Thompson, Brian
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2667 - 2675
[35] An Analysis of Massively Multilingual Neural Machine Translation for Low-Resource Languages
Mueller, Aaron
Nicolai, Garrett
McCarthy, Arya D.
Lewis, Dylan
Wu, Winston
Yarowsky, David
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3710 - 3718
[36] Towards a Low-Resource Neural Machine Translation for Indigenous Languages in Canada
Ngoc Tan Le
Sadat, Fatiha
TRAITEMENT AUTOMATIQUE DES LANGUES, 2021, 62 (03): : 39 - 63
[37] Regressing Word and Sentence Embeddings for Low-Resource Neural Machine Translation
Unanue I.J.
Borzeshi E.Z.
Piccardi M.
IEEE Transactions on Artificial Intelligence, 2023, 4 (03): : 450 - 463
[38] Neural machine translation for low-resource languages without parallel corpora
Karakanta, Alina
Dehdari, Jon
van Genabith, Josef
MACHINE TRANSLATION, 2018, 32 (1-2) : 167 - 189
[39] Efficient Low-Resource Neural Machine Translation with Reread and Feedback Mechanism
Yu, Zhiqiang
Yu, Zhengtao
Guo, Junjun
Huang, Yuxin
Wen, Yonghua
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (03)
[40] Hierarchical Transfer Learning Architecture for Low-Resource Neural Machine Translation
Luo, Gongxu
Yang, Yating
Yuan, Yang
Chen, Zhanheng
Ainiwaer, Aizimaiti
IEEE ACCESS, 2019, 7 : 154157 - 154166

← 1 2 3 4 5 →