Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation

被引:0
|
作者
Siddhant, Aditya [1 ]
Johnson, Melvin [1 ]
Tsai, Henry [1 ]
Ari, Naveen [1 ]
Riesa, Jason [1 ]
Bapna, Ankur [1 ]
Firat, Orhan [1 ]
Raman, Karthik [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recently proposed massively multilingual neural machine translation (NMT) system has been shown to be capable of translating over 100 languages to and from English within a single model (Aharoni, Johnson, and Firat 2019). Its improved translation performance on low resource languages hints at potential cross-lingual transfer capability for downstream tasks. In this paper, we evaluate the cross-lingual effectiveness of representations from the encoder of a massively multilingual NMT model on 5 downstream classification and sequence labeling tasks covering a diverse set of over 50 languages. We compare against a strong baseline, multilingual BERT (mBERT) (Devlin et al. 2018), in different cross-lingual transfer learning scenarios and show gains in zero-shot transfer in 4 out of these 5 tasks.
引用
收藏
页码:8854 / 8861
页数:8
相关论文
共 50 条
  • [1] Unsupervised multilingual machine translation with pretrained cross-lingual encoders
    Shen, Yingli
    Bao, Wei
    Gao, Ge
    Zhou, Maoke
    Zhao, Xiaobing
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [2] Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders
    Chen, Guanhua
    Ma, Shuming
    Chen, Yun
    Dong, Li
    Zhang, Dongdong
    Pan, Jia
    Wang, Wenping
    Wei, Furu
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 15 - 26
  • [3] XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization
    Hu, Junjie
    Ruder, Sebastian
    Siddhant, Aditya
    Neubig, Graham
    Firat, Orhan
    Johnson, Melvin
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [4] Massively Multilingual Neural Machine Translation
    Aharoni, Roee
    Johnson, Melvin
    Firat, Orhan
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3874 - 3884
  • [5] Cross-lingual Supervision Improves Unsupervised Neural Machine Translation
    Wang, Mingxuan
    Bai, Hongxiao
    Zhao, Hai
    Li, Lei
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 89 - 96
  • [6] CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task
    Sun, Shuo
    Sia, Suzanna
    Duh, Kevin
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, 2020, : 134 - 141
  • [7] Evaluating Effects of Machine Translation Accuracy on Cross-Lingual Patent Retrieval
    Fujii, Atsushi
    Utiyama, Masao
    Yamamoto, Mikio
    Utsuro, Takehito
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 674 - 675
  • [9] Unsupervised Neural Machine Translation With Cross-Lingual Language Representation Agreement
    Sun, Haipeng
    Wang, Rui
    Chen, Kehai
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Tiejun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 1170 - 1182
  • [10] Cross-Lingual Preposition Disambiguation for Machine Translation
    Kumar, M. Anand
    Rajendran, S.
    Soman, K. P.
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 291 - 300