Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation

被引:0
|
作者
Zhang, Biao [1 ]
Williams, Philip [1 ]
Titov, Ivan [1 ,2 ]
Sennrich, Rico [1 ,3 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Univ Amsterdam, ILLC, Amsterdam, Netherlands
[3] Univ Zurich, Dept Computat Linguist, Zurich, Switzerland
基金
欧盟地平线“2020”; 瑞士国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Massively multilingual models for neural machine translation (NMT) are theoretically attractive, but often underperform bilingual models and deliver poor zero-shot translations. In this paper, we explore ways to improve them. We argue that multilingual NMT requires stronger modeling capacity to support language pairs with varying typological characteristics, and overcome this bottleneck via language-specific components and deepening NMT architectures. We identify the off-target translation issue (i.e. translating into a wrong target language) as the major source of the inferior zero-shot performance, and propose random online backtranslation to enforce the translation of unseen training language pairs. Experiments on OPUS-100 (a novel multilingual dataset with 100 languages) show that our approach substantially narrows the performance gap with bilingual models in both one-to-many and many-to-many settings, and improves zero-shot performance by similar to 10 BLEU, approaching conventional pivot-based methods.
引用
收藏
页码:1628 / 1639
页数:12
相关论文
共 50 条
  • [1] An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation
    Raganato, Alessandro
    Vazquez, Raul
    Creutz, Mathias
    Tiedemann, Jorg
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8449 - 8456
  • [2] Consistency by Agreement in Zero-shot Neural Machine Translation
    Al-Shedivat, Maruan
    Parikh, Ankur P.
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1184 - 1197
  • [3] Monolingual Adapters for Zero-Shot Neural Machine Translation
    Philip, Jerin
    Berard, Alexandre
    Galle, Matthias
    Besacier, Laurent
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4465 - 4470
  • [4] Massively Multilingual Neural Machine Translation
    Aharoni, Roee
    Johnson, Melvin
    Firat, Orhan
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3874 - 3884
  • [5] TACKLING DATA SCARCITY IN SPEECH TRANSLATION USING ZERO-SHOT MULTILINGUAL MACHINE TRANSLATION TECHNIQUES
    Tu Anh Dinh
    Liu, Danni
    Niehues, Jan
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6222 - 6226
  • [6] Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders
    Chen, Guanhua
    Ma, Shuming
    Chen, Yun
    Dong, Li
    Zhang, Dongdong
    Pan, Jia
    Wang, Wenping
    Wei, Furu
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 15 - 26
  • [7] Language Tags Matter for Zero-Shot Neural Machine Translation
    Wu, Liwei
    Cheng, Shanbo
    Wang, Mingxuan
    Li, Lei
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3001 - 3007
  • [8] Improving Zero-shot Neural Machine Translation on Language-specific Encoders-Decoders
    Liao, Junwei
    Shi, Yu
    Gong, Ming
    Shou, Linjun
    Qu, Hong
    Zeng, Michael
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation
    Mao, Zhuoyuan
    Dabre, Raj
    Liu, Qianying
    Song, Haiyue
    Chu, Chenhui
    Kurohashi, Sadao
    [J]. 61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1300 - 1316
  • [10] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Xu, Hanwen
    Woicik, Addie
    Poon, Hoifung
    Altman, Russ B.
    Wang, Sheng
    [J]. NATURE COMMUNICATIONS, 2023, 14 (01)