Effective Guidance in Zero-Shot Multilingual Translation via Multiple Language Prototypes

被引:0
|
作者
Zheng, Yafang [1 ,2 ]
Lin, Lei [1 ,2 ]
Yuan, Yuxuan [1 ,2 ]
Shi, Xiaodong [1 ,2 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China
[2] Minist Culture & Tourism, Key Lab Digital Protect & Intelligent Proc Intang, Xiamen, Peoples R China
关键词
Zero-Shot Multilingual Machine Translation; Off-Target Issue; Language Tag Strategy;
D O I
10.1007/978-981-99-8076-5_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a multilingual neural machine translation model that fully shares parameters across all languages, a popular approach is to use an artificial language token to guide translation into the desired target language. However, recent studies have shown that language-specific signals in prepended language tokens are not adequate to guide the MNMT models to translate into right directions, especially on zero-shot translation (i.e., off-target translation issue). We argue that the representations of prepended language tokens are overly affected by its context information, resulting in potential information loss of language tokens and insufficient indicative ability. To address this issue, we introduce multiple language prototypes to guide translation into the desired target language. Specifically, we categorize sparse contextualized language representations into a few representative prototypes over training set, and inject their representations into each individual token to guide the models. Experiments on several multilingual datasets show that our method significantly alleviates the off-target translation issue and improves the translation quality on both zero-shot and supervised directions.
引用
收藏
页码:226 / 238
页数:13
相关论文
共 50 条
  • [1] ENABLING ZERO-SHOT MULTILINGUAL SPOKEN LANGUAGE TRANSLATION WITH LANGUAGE-SPECIFIC ENCODERS AND DECODERS
    Escolano, Carlos
    Costa-jussa, Marta R.
    Fonollosa, Jose A. R.
    Segura, Carlos
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 694 - 701
  • [2] Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation
    Zhang, Biao
    Williams, Philip
    Titov, Ivan
    Sennrich, Rico
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1628 - 1639
  • [3] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Xu, Hanwen
    Woicik, Addie
    Poon, Hoifung
    Altman, Russ B.
    Wang, Sheng
    [J]. NATURE COMMUNICATIONS, 2023, 14 (01)
  • [4] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Hanwen Xu
    Addie Woicik
    Hoifung Poon
    Russ B. Altman
    Sheng Wang
    [J]. Nature Communications, 14
  • [5] From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers
    Lauscher, Anne
    Ravishankar, Vinit
    Vulic, Ivan
    Glavas, Goran
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4483 - 4499
  • [6] Incremental Embedding Learning via Zero-Shot Translation
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    Li, Maosen
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10254 - 10262
  • [7] Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 121 - 138
  • [8] Preventing Author Profiling through Zero-Shot Multilingual Back-Translation
    Adelani, David Ifeoluwa
    Zhang, Miaoran
    Shen, Xiaoyu
    Davody, Ali
    Kleinbauer, Thomas
    Klakow, Dietrich
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8687 - 8695
  • [9] Improving Zero-shot Translation with Language-Independent Constraints
    Pham, Ngoc-Quan
    Niehues, Jan
    Ha, Thanh-Le
    Waibel, Alex
    [J]. FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), VOL 1: RESEARCH PAPERS, 2019, : 13 - 23
  • [10] Language Tags Matter for Zero-Shot Neural Machine Translation
    Wu, Liwei
    Cheng, Shanbo
    Wang, Mingxuan
    Li, Lei
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3001 - 3007