Effective Guidance in Zero-Shot Multilingual Translation via Multiple Language Prototypes

被引:0
|
作者
Zheng, Yafang [1 ,2 ]
Lin, Lei [1 ,2 ]
Yuan, Yuxuan [1 ,2 ]
Shi, Xiaodong [1 ,2 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China
[2] Minist Culture & Tourism, Key Lab Digital Protect & Intelligent Proc Intang, Xiamen, Peoples R China
关键词
Zero-Shot Multilingual Machine Translation; Off-Target Issue; Language Tag Strategy;
D O I
10.1007/978-981-99-8076-5_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a multilingual neural machine translation model that fully shares parameters across all languages, a popular approach is to use an artificial language token to guide translation into the desired target language. However, recent studies have shown that language-specific signals in prepended language tokens are not adequate to guide the MNMT models to translate into right directions, especially on zero-shot translation (i.e., off-target translation issue). We argue that the representations of prepended language tokens are overly affected by its context information, resulting in potential information loss of language tokens and insufficient indicative ability. To address this issue, we introduce multiple language prototypes to guide translation into the desired target language. Specifically, we categorize sparse contextualized language representations into a few representative prototypes over training set, and inject their representations into each individual token to guide the models. Experiments on several multilingual datasets show that our method significantly alleviates the off-target translation issue and improves the translation quality on both zero-shot and supervised directions.
引用
收藏
页码:226 / 238
页数:13
相关论文
共 50 条
  • [1] ENABLING ZERO-SHOT MULTILINGUAL SPOKEN LANGUAGE TRANSLATION WITH LANGUAGE-SPECIFIC ENCODERS AND DECODERS
    Escolano, Carlos
    Costa-jussa, Marta R.
    Fonollosa, Jose A. R.
    Segura, Carlos
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 694 - 701
  • [2] Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation
    Zhang, Biao
    Williams, Philip
    Titov, Ivan
    Sennrich, Rico
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1628 - 1639
  • [3] Transferring Zero-shot Multilingual Chinese-Chinese Translation Model for Chinese Minority Language Translation
    Yan, Ziyue
    Zan, Hongying
    Guo, Yifan
    Xu, Hongfei
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 133 - 138
  • [4] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Xu, Hanwen
    Woicik, Addie
    Poon, Hoifung
    Altman, Russ B.
    Wang, Sheng
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [5] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Hanwen Xu
    Addie Woicik
    Hoifung Poon
    Russ B. Altman
    Sheng Wang
    Nature Communications, 14
  • [6] From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers
    Lauscher, Anne
    Ravishankar, Vinit
    Vulic, Ivan
    Glavas, Goran
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4483 - 4499
  • [7] Pruning Residual Networks in Multilingual Neural Machine Translation to Improve Zero-Shot Translation
    Lu, Kaiwen
    Yang, Yating
    Dong, Rui
    Ma, Bo
    Wang, Lei
    Zhou, Xi
    Ahmat, Ahtamjan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 280 - 292
  • [8] Incremental Embedding Learning via Zero-Shot Translation
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    Li, Maosen
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10254 - 10262
  • [9] Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 121 - 138
  • [10] Preventing Author Profiling through Zero-Shot Multilingual Back-Translation
    Adelani, David Ifeoluwa
    Zhang, Miaoran
    Shen, Xiaoyu
    Davody, Ali
    Kleinbauer, Thomas
    Klakow, Dietrich
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8687 - 8695