Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation

被引：6

作者：

Mao, Zhuoyuan ^{[1
]}

Chu, Chenhui ^{[1
]}

Kurohashi, Sadao ^{[1
]}

机构：

[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2022年 / 21卷 / 04期

关键词：

Low-resource neural machine translation; pre-training; linguistically-driven;

D O I：

10.1145/3491065

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English. JASS focuses on masking and reordering Japanese linguistic units known as bunsetsu, whereas ENSS is proposed based on phrase structure masking and reordering tasks. Experiments on ASPEC Japanese-English & Japanese-Chinese, Wikipedia Japanese-Chinese, News English-Korean corpora demonstrate that JASS and ENSS outperform MASS and other existing language-agnostic pre-training methods by up to +2.9 BLEU points for the Japanese-English tasks, up to +7.0 BLEU points for the Japanese-Chinese tasks and up to +1.3 BLEU points for English-Korean tasks. Empirical analysis, which focuses on the relationship between individual parts in JASS and ENSS, reveals the complementary nature of the subtasks of JASS and ENSS. Adequacy evaluation using LASER, human evaluation, and case studies reveals that our proposed methods significantly outperform pre-training methods without injected linguistic knowledge and they have a larger positive impact on the adequacy as compared to the fluency.

引用

页数：29

共 50 条

[31] DEEP: DEnoising Entity Pre-training for Neural Machine Translation
Hu, Junjie
Hayashi, Hiroaki
Cho, Kyunghyun
Neubig, Graham
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1753 - 1766
[32] Multi-task Pre-training Language Model for Semantic Network Completion
Li, Da
Zhu, Boqing
Yang, Sen
Xu, Kele
Yi, Ming
He, Yukai
Wang, Huaimin
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
[33] Improving AMR-to-text Generation with Multi-task Pre-training
Xu D.-Q.
Li J.-H.
Zhu M.-H.
Zhou G.-D.
Ruan Jian Xue Bao/Journal of Software, 2021, 32 (10): : 3036 - 3050
[34] XLIT: A Method to Bridge Task Discrepancy in Machine Translation Pre-training
Pham, Khang
Nguyen, Long
Dinh, Dien
ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 23 (10)
[35] Multi-task Pre-training for Lhasa-Tibetan Speech Recognition
Liu, Yigang
Zhao, Yue
Xu, Xiaona
Xu, Liang
Zhang, Xubei
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IX, 2023, 14262 : 78 - 90
[36] Low-Resource Named Entity Recognition via the Pre-Training Model
Chen, Siqi
Pei, Yijie
Ke, Zunwang
Silamu, Wushour
SYMMETRY-BASEL, 2021, 13 (05):
[37] Improving News Recommendation via Bottlenecked Multi-task Pre-training
Xiao, Xiongfeng
Li, Qing
Liu, Songlin
Zhou, Kun
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2082 - 2086
[38] MULTI-TASK SELF-SUPERVISED PRE-TRAINING FOR MUSIC CLASSIFICATION
Wu, Ho-Hsiang
Kao, Chieh-Chi
Tang, Qingming
Sun, Ming
McFee, Brian
Bello, Juan Pablo
Wang, Chao
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 556 - 560
[39] On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation
Liu, Xuebo
Wang, Longyue
Wong, Derek F.
Ding, Liang
Chao, Lidia S.
Shi, Shuming
Tu, Zhaopeng
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2900 - 2907
[40] Improving Robustness of Neural Machine Translation with Multi-task Learning
Zhou, Shuyan
Zeng, Xiangkai
Zhou, Yingqi
Anastasopoulos, Antonios
Neubig, Graham
FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 565 - 571

← 1 2 3 4 5 →