Improving Low-Resource Neural Machine Translation With Teacher-Free Knowledge Distillation

被引：3

作者：

Zhang, Xinlu ^{[1
,2
,3
]}

Li, Xiao ^{[1
,2
,3
]}

Yang, Yating ^{[1
,2
,3
]}

Dong, Rui ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Training; Decoding; Vocabulary; Task analysis; Standards; Knowledge engineering; Computational modeling; Neural machine translation; knowledge distillation; prior knowledge;

D O I：

10.1109/ACCESS.2020.3037821

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Knowledge Distillation (KD) aims to distill the knowledge of a cumbersome teacher model into a lightweight student model. Its success is generally attributed to the privileged information on similarities among categories provided by the teacher model, and in this sense, only strong teacher models are deployed to teach weaker students in practice. However, in low-resource neural machine translation, a stronger teacher model is not available. To counteract this, We therefore propose a novel Teacher-free Knowledge Distillation framework for low-resource neural machine translation, where the model learns from manually designed regularization distribution as a virtual teacher model. The prior distribution of artificial design can not only obtain the similarity information between words, but also provide effective regularity for model training. Experimental results show that the proposed method has improved performance in low-resource language effectively.

引用

页码：206638 / 206645

页数：8

共 50 条

[21] Low-Resource Neural Machine Translation: A Systematic Literature Review
Yazar, Bilge Kagan
Sahin, Durmus Ozkan
Kilic, Erdal
IEEE ACCESS, 2023, 11 : 131775 - 131813
[22] Meta-Learning for Low-Resource Neural Machine Translation
Gu, Jiatao
Wang, Yong
Chen, Yun
Cho, Kyunghyun
Li, Victor O. K.
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3622 - 3631
[23] Neural Machine Translation of Low-Resource and Similar Languages with Backtranslation
Przystupa, Michael
Abdul-Mageed, Muhammad
FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), VOL 3: SHARED TASK PAPERS, DAY 2, 2019, : 224 - 235
[24] Extremely low-resource neural machine translation for Asian languages
Rubino, Raphael
Marie, Benjamin
Dabre, Raj
Fujita, Atushi
Utiyama, Masao
Sumita, Eiichiro
MACHINE TRANSLATION, 2020, 34 (04) : 347 - 382
[25] Revisiting Low-Resource Neural Machine Translation: A Case Study
Sennrich, Rico
Zhang, Biao
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 211 - 221
[26] Improving neural machine translation with POS-tag features for low-resource language pairs
Hlaing, Zar Zar
Thu, Ye Kyaw
Supnithi, Thepchai
Netisopakul, Ponrudee
HELIYON, 2022, 8 (08)
[27] Improving neural machine translation by integrating transliteration for low-resource English-Assamese language
Nath, Basab
Sarkar, Sunita
Mukhopadhyay, Somnath
Roy, Arindam
NATURAL LANGUAGE PROCESSING, 2025, 31 (02): : 306 - 327
[28] Survey of Low-Resource Machine Translation
Haddow, Barry
Bawden, Rachel
Barone, Antonio Valerio Miceli
Helcl, Jindrich
Birch, Alexandra
COMPUTATIONAL LINGUISTICS, 2022, 48 (03) : 673 - 732
[29] Rethinking the Exploitation of Monolingual Data for Low-Resource Neural Machine Translation
Pang, Jianhui
Yang, Baosong
Wong, Derek Fai
Wan, Yu
Liu, Dayiheng
Chao, Lidia Sam
Xie, Jun
COMPUTATIONAL LINGUISTICS, 2023, 50 (01) : 25 - 47
[30] A Diverse Data Augmentation Strategy for Low-Resource Neural Machine Translation
Li, Yu
Li, Xiao
Yang, Yating
Dong, Rui
INFORMATION, 2020, 11 (05)

← 1 2 3 4 5 →