Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation

被引:0
|
作者
Zhang, Biao [1 ]
Williams, Philip [1 ]
Titov, Ivan [1 ,2 ]
Sennrich, Rico [1 ,3 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Univ Amsterdam, ILLC, Amsterdam, Netherlands
[3] Univ Zurich, Dept Computat Linguist, Zurich, Switzerland
基金
瑞士国家科学基金会; 欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Massively multilingual models for neural machine translation (NMT) are theoretically attractive, but often underperform bilingual models and deliver poor zero-shot translations. In this paper, we explore ways to improve them. We argue that multilingual NMT requires stronger modeling capacity to support language pairs with varying typological characteristics, and overcome this bottleneck via language-specific components and deepening NMT architectures. We identify the off-target translation issue (i.e. translating into a wrong target language) as the major source of the inferior zero-shot performance, and propose random online backtranslation to enforce the translation of unseen training language pairs. Experiments on OPUS-100 (a novel multilingual dataset with 100 languages) show that our approach substantially narrows the performance gap with bilingual models in both one-to-many and many-to-many settings, and improves zero-shot performance by similar to 10 BLEU, approaching conventional pivot-based methods.
引用
收藏
页码:1628 / 1639
页数:12
相关论文
共 50 条
  • [31] ENABLING ZERO-SHOT MULTILINGUAL SPOKEN LANGUAGE TRANSLATION WITH LANGUAGE-SPECIFIC ENCODERS AND DECODERS
    Escolano, Carlos
    Costa-jussa, Marta R.
    Fonollosa, Jose A. R.
    Segura, Carlos
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 694 - 701
  • [32] Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
    Zhang, Biao
    Bapna, Ankur
    Johnson, Melvin
    Dabirmoghaddam, Ali
    Arivazhagan, Naveen
    Firat, Orhan
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4176 - 4192
  • [33] A Survey of Multilingual Neural Machine Translation
    Dabre, Raj
    Chu, Chenhui
    Kunchukuttan, Anoop
    [J]. ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [34] Multilingual Simultaneous Neural Machine Translation
    Arthur, Philip
    Ryu, Dongwon K.
    Haffari, Gholamreza
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4758 - 4766
  • [35] Incremental Embedding Learning via Zero-Shot Translation
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    Li, Maosen
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10254 - 10262
  • [36] Rectifying Ill-Formed Interlingual Space: A Framework for Zero-Shot Translation on Modularized Multilingual NMT
    Liao, Junwei
    Shi, Yu
    [J]. MATHEMATICS, 2022, 10 (22)
  • [37] Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
    Artetxe, Mikel
    Schwenk, Holger
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 : 597 - 610
  • [38] Neural Machine Translation Based on Back-Translation for Multilingual Translation Evaluation Task
    School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
    [J]. Commun. Comput. Info. Sci, (132-141):
  • [39] Efficient Inference for Multilingual Neural Machine Translation
    Berard, Alexandre
    Lee, Dain
    Clinchant, Stephane
    Jung, Kweonwoo
    Nikoulina, Vassilina
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8563 - 8583
  • [40] Multilingual Neural Machine Translation with Language Clustering
    Tan, Xu
    Chen, Jiale
    He, Di
    Xia, Yingce
    Qin, Tao
    Liu, Tie-Yan
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 963 - 973