Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents

被引:0
|
作者
Zhang, Biao [1 ]
Bapna, Ankur [2 ]
Johnson, Melvin [2 ]
Dabirmoghaddam, Ali [2 ]
Arivazhagan, Naveen [2 ]
Firat, Orhan [2 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Google Res, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level neural machine translation (DocNMT) achieves coherent translations by incorporating cross-sentence context. However, for most language pairs there's a shortage of parallel documents, although parallel sentences are readily available. In this paper, we study whether and how contextual modeling in DocNMT is transferable via multilingual modeling. We focus on the scenario of zero-shot transfer from teacher languages with document level data to student languages with no documents but sentence level data, and for the first time treat document-level translation as a transfer learning problem. Using simple concatenation-based DocNMT, we explore the effect of 3 factors on the transfer: the number of teacher languages with document level data, the balance between document and sentence level data at training, and the data condition of parallel documents (genuine vs. back-translated). Our experiments on Europarl-7 and IWSLT-10 show the feasibility of multilingual transfer for DocNMT, particularly on document-specific metrics. We observe that more teacher languages and adequate data balance both contribute to better transfer quality. Surprisingly, the transfer is less sensitive to the data condition, where multilingual DocNMT delivers decent performance with either back-translated or genuine document pairs.
引用
收藏
页码:4176 / 4192
页数:17
相关论文
共 40 条
  • [1] From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers
    Lauscher, Anne
    Ravishankar, Vinit
    Vulic, Ivan
    Glavas, Goran
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4483 - 4499
  • [2] Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation
    Zhang, Biao
    Williams, Philip
    Titov, Ivan
    Sennrich, Rico
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1628 - 1639
  • [3] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Xu, Hanwen
    Woicik, Addie
    Poon, Hoifung
    Altman, Russ B.
    Wang, Sheng
    [J]. NATURE COMMUNICATIONS, 2023, 14 (01)
  • [4] Multilingual translation for zero-shot biomedical classification using BioTranslator
    Hanwen Xu
    Addie Woicik
    Hoifung Poon
    Russ B. Altman
    Sheng Wang
    [J]. Nature Communications, 14
  • [5] Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders
    Chen, Guanhua
    Ma, Shuming
    Chen, Yun
    Dong, Li
    Zhang, Dongdong
    Pan, Jia
    Wang, Wenping
    Wei, Furu
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 15 - 26
  • [6] Towards Zero-Shot Multilingual Transfer for Code-Switched Responses
    Wu, Ting-Wei
    Zhao, Changsheng
    Chang, Ernie
    Shi, Yangyang
    Chuang, Pierce
    Chandra, Vikas
    Juang, Biing
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7551 - 7563
  • [7] Preventing Author Profiling through Zero-Shot Multilingual Back-Translation
    Adelani, David Ifeoluwa
    Zhang, Miaoran
    Shen, Xiaoyu
    Davody, Ali
    Kleinbauer, Thomas
    Klakow, Dietrich
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8687 - 8695
  • [8] Effective Guidance in Zero-Shot Multilingual Translation via Multiple Language Prototypes
    Zheng, Yafang
    Lin, Lei
    Yuan, Yuxuan
    Shi, Xiaodong
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI, 2024, 14452 : 226 - 238
  • [9] TACKLING DATA SCARCITY IN SPEECH TRANSLATION USING ZERO-SHOT MULTILINGUAL MACHINE TRANSLATION TECHNIQUES
    Tu Anh Dinh
    Liu, Danni
    Niehues, Jan
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6222 - 6226
  • [10] Multi-level multilingual semantic alignment for zero-shot cross-lingual transfer learning
    Gui, Anchun
    Xiao, Han
    [J]. NEURAL NETWORKS, 2024, 173