Improving Low-Resource Neural Machine Translation With Teacher-Free Knowledge Distillation

被引：3

作者：

Zhang, Xinlu ^{[1
,2
,3
]}

Li, Xiao ^{[1
,2
,3
]}

Yang, Yating ^{[1
,2
,3
]}

Dong, Rui ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Training; Decoding; Vocabulary; Task analysis; Standards; Knowledge engineering; Computational modeling; Neural machine translation; knowledge distillation; prior knowledge;

D O I：

10.1109/ACCESS.2020.3037821

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Knowledge Distillation (KD) aims to distill the knowledge of a cumbersome teacher model into a lightweight student model. Its success is generally attributed to the privileged information on similarities among categories provided by the teacher model, and in this sense, only strong teacher models are deployed to teach weaker students in practice. However, in low-resource neural machine translation, a stronger teacher model is not available. To counteract this, We therefore propose a novel Teacher-free Knowledge Distillation framework for low-resource neural machine translation, where the model learns from manually designed regularization distribution as a virtual teacher model. The prior distribution of artificial design can not only obtain the similarity information between words, but also provide effective regularity for model training. Experimental results show that the proposed method has improved performance in low-resource language effectively.

引用

页码：206638 / 206645

页数：8

共 50 条

[1] Understanding and Improving Low-Resource Neural Machine Translation with Shallow Features
Sun, Yanming
Liu, Xuebo
Wong, Derek F.
Lin, Yuchu
Li, Bei
Zhan, Runzhe
Chao, Lidia S.
Zhang, Min
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 227 - 239
[2] A Survey on Low-Resource Neural Machine Translation
Wang, Rui
Tan, Xu
Luo, Renqian
Qin, Tao
Liu, Tie-Yan
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4636 - 4643
[3] A Survey on Low-resource Neural Machine Translation
Li H.-Z.
Feng C.
Huang H.-Y.
Huang, He-Yan (hhy63@bit.edu.cn), 1600, Science Press (47): : 1217 - 1231
[4] Transformers for Low-resource Neural Machine Translation
Gezmu, Andargachew Mekonnen
Nuernberger, Andreas
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2022, : 459 - 466
[5] Adaptive Knowledge Sharing in Multi-Task Learning: Improving Low-Resource Neural Machine Translation
Zaremoodi, Poorya
Buntine, Wray
Haffari, Gholamreza
PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 656 - 661
[6] Multi-granularity Knowledge Sharing in Low-resource Neural Machine Translation
Mi, Chenggang
Xie, Shaoliang
Fan, Yi
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (02)
[7] Decoding Strategies for Improving Low-Resource Machine Translation
Park, Chanjun
Yang, Yeongwook
Park, Kinam
Lim, Heuiseok
ELECTRONICS, 2020, 9 (10) : 1 - 15
[8] Low-Resource Neural Machine Translation with Neural Episodic Control
Wu, Nier
Hou, Hongxu
Sun, Shuo
Zheng, Wei
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[9] Low-resource Neural Machine Translation: Methods and Trends
Shi, Shumin
Wu, Xing
Su, Rihai
Huang, Heyan
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
[10] Neural Machine Translation for Low-resource Languages: A Survey
Ranathunga, Surangika
Lee, En-Shiun Annie
Skenduli, Marjana Prifti
Shekhar, Ravi
Alam, Mehreen
Kaur, Rishemjit
ACM COMPUTING SURVEYS, 2023, 55 (11)

← 1 2 3 4 5 →