Effective Guidance in Zero-Shot Multilingual Translation via Multiple Language Prototypes

被引:0
|
作者
Zheng, Yafang [1 ,2 ]
Lin, Lei [1 ,2 ]
Yuan, Yuxuan [1 ,2 ]
Shi, Xiaodong [1 ,2 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China
[2] Minist Culture & Tourism, Key Lab Digital Protect & Intelligent Proc Intang, Xiamen, Peoples R China
关键词
Zero-Shot Multilingual Machine Translation; Off-Target Issue; Language Tag Strategy;
D O I
10.1007/978-981-99-8076-5_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a multilingual neural machine translation model that fully shares parameters across all languages, a popular approach is to use an artificial language token to guide translation into the desired target language. However, recent studies have shown that language-specific signals in prepended language tokens are not adequate to guide the MNMT models to translate into right directions, especially on zero-shot translation (i.e., off-target translation issue). We argue that the representations of prepended language tokens are overly affected by its context information, resulting in potential information loss of language tokens and insufficient indicative ability. To address this issue, we introduce multiple language prototypes to guide translation into the desired target language. Specifically, we categorize sparse contextualized language representations into a few representative prototypes over training set, and inject their representations into each individual token to guide the models. Experiments on several multilingual datasets show that our method significantly alleviates the off-target translation issue and improves the translation quality on both zero-shot and supervised directions.
引用
收藏
页码:226 / 238
页数:13
相关论文
共 50 条
  • [21] Zero-Shot Reward Specification via Grounded Natural Language
    Mahmoudieh, Parsa
    Pathak, Deepak
    Darrell, Trevor
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [22] Zero-shot autonomous robot manipulation via natural language
    Han, Changheon
    Lee, Jiho
    Lee, Hojun
    Sim, Yuseop
    Jeon, Jurim
    Jun, Martin Byung-Guk
    MANUFACTURING LETTERS, 2024, 42 : 16 - 20
  • [23] Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation with Pretrained Language Models
    Kang, Haoqiang
    Blevins, Terra
    Zettlemoyer, Luke
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 1562 - 1575
  • [24] ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
    Yang, Bang
    Liu, Fenglin
    Zou, Yuexian
    Wu, Xian
    Wang, Yaowei
    Clifton, David A.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5712 - 5724
  • [25] Zero-Shot Object Counting With Vision-Language Prior Guidance Network
    Zhai, Wenzhe
    Xing, Xianglei
    Gao, Mingliang
    Li, Qilei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2487 - 2498
  • [26] Zero-Shot Learners for Natural Language Understanding via a Unified Multiple-Choice Perspective
    Wang, Junjie
    Yang, Ping
    Gan, Ruyi
    Zhang, Yuxiang
    Zhang, Jiaxing
    Sakai, Tetsuya
    IEEE ACCESS, 2023, 11 : 142829 - 142845
  • [27] Zero-Shot Recommendation as Language Modeling
    Sileo, Damien
    Vossen, Wout
    Raymaekers, Robbe
    ADVANCES IN INFORMATION RETRIEVAL, PT II, 2022, 13186 : 223 - 230
  • [28] Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
    Zhang, Biao
    Bapna, Ankur
    Johnson, Melvin
    Dabirmoghaddam, Ali
    Arivazhagan, Naveen
    Firat, Orhan
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4176 - 4192
  • [29] Enhancing Zero-Shot Translation in Multilingual Neural Machine Translation: Focusing on Obtaining Location-Agnostic Representations
    Zhang, Jiarui
    Huang, Heyan
    Hu, Yue
    Guo, Ping
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VII, 2024, 15022 : 194 - 208
  • [30] Towards Zero-shot Language Modeling
    Ponti, Edoardo M.
    Vulic, Ivan
    Cotterell, Ryan
    Reichart, Roi
    Korhonen, Anna
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2900 - +