Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer

被引:0
|
作者
Muller, Benjamin [1 ]
Gupta, Deepanshu [2 ]
Patwardhan, Siddharth [2 ]
Fauconnier, Jean-Philippe [2 ]
Vandyke, David [2 ]
Agarwal, Sachin [2 ]
机构
[1] INRIA, Paris, France
[2] Apple, Cupertino, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we analyze a pre-trained mT5 to discover the attributes of cross-lingual connections learned by this model. Through a statistical interpretation framework over 90 language pairs across three tasks, we show that transfer performance can be modeled by a few linguistic and data-derived features. These observations enable us to interpret cross-lingual understanding of the mT5 model. Through these observations, one can favorably choose the best source language for a task, and can anticipate its training data demands. A key finding of this work is that similarity of syntax, morphology and phonology are good predictors of crosslingual transfer, significantly more than just the lexical similarity of languages. For a given language, we are able to predict zero-shot performance, that increases on a logarithmic scale with the number of few-shot target language data points.
引用
收藏
页码:88 / 102
页数:15
相关论文
共 7 条
  • [1] Multi-lingual scene text detection and language identification
    Saha, Shaswata
    Chakraborty, Neelotpal
    Kundu, Soumyadeep
    Paul, Sayantan
    Mollah, Ayatullah Faruk
    Basu, Subhadip
    Sarkar, Ram
    PATTERN RECOGNITION LETTERS, 2020, 138 : 16 - 22
  • [3] OUTLINEGEN: Multi-lingual Outline Generation for Encyclopedic Text in Low Resource Languages
    Subramanian, Shivansh
    Taunk, Dhaval
    Gupta, Manish
    Varma, Vasudeva
    SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2024, PT II, 2025, 15212 : 149 - 159
  • [4] The Impact of Translating Resource-Rich Datasets to Low-Resource Languages Through Multi-Lingual Text Processing
    Ghafoor, Abdul
    Imran, Ali Shariq
    Daudpota, Sher Muhammad
    Kastrati, Zenun
    Abdullah
    Batra, Rakhi
    Wani, Mudasir Ahmad
    IEEE ACCESS, 2021, 9 : 124478 - 124490
  • [5] Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach
    Neelotpal Chakraborty
    Soumyadeep Kundu
    Sayantan Paul
    Ayatullah Faruk Mollah
    Subhadip Basu
    Ram Sarkar
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 7997 - 8008
  • [6] Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach
    Chakraborty, Neelotpal
    Kundu, Soumyadeep
    Paul, Sayantan
    Mollah, Ayatullah Faruk
    Basu, Subhadip
    Sarkar, Ram
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (07) : 7997 - 8008
  • [7] Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech
    Jeong, Myeonghun
    Kim, Minchan
    Choi, Byoung Jin
    Yoon, Jaesam
    Jang, Won
    Kim, Nam Soo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1519 - 1530