Multilingual Neural Machine Translation: Can Linguistic Hierarchies Help?

被引:0
|
作者
Saleh, Fahimeh [1 ]
Buntine, Wray [1 ]
Haffari, Gholamreza [1 ]
Du, Lan [1 ]
机构
[1] Monash Univ, Melbourne, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilingual Neural Machine Translation (MNMT) trains a single NMT model that supports translation between multiple languages, rather than training separate models for different languages. Learning a single model can enhance the low-resource translation by leveraging data from multiple languages. However, the performance of an MNMT model is highly dependent on the type of languages used in training, as transferring knowledge from a diverse set of languages degrades the translation performance due to negative transfer. In this paper, we propose a Hierarchical Knowledge Distillation (HKD) approach for MNMT which capitalises on language groups generated according to typological features and phylogeny of languages to overcome the issue of negative transfer. HKD generates a set of multilingual teacher-assistant models via a selective knowledge distillation mechanism based on the language groups, and then distills the ultimate multilingual model from those assistants in an adaptive way. Experimental results derived from the TED dataset with 53 languages demonstrate the effectiveness of our approach in avoiding the negative transfer effect in MNMT, leading to an improved translation performance (about 1 BLEU score on average) compared to strong baselines.
引用
收藏
页码:1313 / 1330
页数:18
相关论文
共 50 条
  • [1] Multilingual Agreement for Multilingual Neural Machine Translation
    Yang, Jian
    Yin, Yuwei
    Ma, Shuming
    Huang, Haoyang
    Zhang, Dongdong
    Li, Zhoujun
    Wei, Furu
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 233 - 239
  • [2] A Survey of Multilingual Neural Machine Translation
    Dabre, Raj
    Chu, Chenhui
    Kunchukuttan, Anoop
    [J]. ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [3] Massively Multilingual Neural Machine Translation
    Aharoni, Roee
    Johnson, Melvin
    Firat, Orhan
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3874 - 3884
  • [4] Multilingual Simultaneous Neural Machine Translation
    Arthur, Philip
    Ryu, Dongwon K.
    Haffari, Gholamreza
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4758 - 4766
  • [5] Survey on Neural Machine Translation for multilingual translation system
    Basmatkar, Pranjali
    Holani, Hemant
    Kaushal, Shivani
    [J]. PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 443 - 448
  • [6] On the Pareto Front of Multilingual Neural Machine Translation
    Chen, Liang
    Ma, Shuming
    Zhang, Dongdong
    Wei, Furu
    Chang, Baobao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Multilingual Neural Machine Translation with Language Clustering
    Tan, Xu
    Chen, Jiale
    He, Di
    Xia, Yingce
    Qin, Tao
    Liu, Tie-Yan
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 963 - 973
  • [8] Efficient Inference for Multilingual Neural Machine Translation
    Berard, Alexandre
    Lee, Dain
    Clinchant, Stephane
    Jung, Kweonwoo
    Nikoulina, Vassilina
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8563 - 8583
  • [9] Synchronous Inference for Multilingual Neural Machine Translation
    Wang, Qian
    Zhang, Jiajun
    Zong, Chengqing
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1827 - 1839
  • [10] Can Multilingual Machine Translation Help Make Medical Record Content More Comprehensible to Patients?
    Zeng-Treitler, Qing
    Kim, Hyeoneui
    Rosemblat, Graciela
    Keselman, Alla
    [J]. MEDINFO 2010, PTS I AND II, 2010, 160 : 73 - 77