Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog

被引:0
|
作者
Schuster, Sebastian [1 ,4 ]
Gupta, Sonal [2 ]
Shah, Rushin [2 ]
Lewis, Mike [3 ]
机构
[1] Stanford Linguist, Stanford, CA 94305 USA
[2] Facebook Conversat AI, New York, NY USA
[3] Facebook AI Res, New York, NY USA
[4] Facebook, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the first steps in the utterance interpretation pipeline of many task-oriented conversational AI systems is to identify user intents and the corresponding slots. Since data collection for machine learning models for this task is time-consuming, it is desirable to make use of existing data in a high-resource language to train models in low-resource languages. However, development of such models has largely been hindered by the lack of multilingual training data. In this paper, we present a new data set of 57k annotated utterances in English (43k), Spanish (8.6k) and Thai (5k) across the domains weather, alarm, and reminder. We use this data set to evaluate three different cross-lingual transfer methods: (1) translating the training data, (2) using cross-lingual pre-trained embeddings, and (3) a novel method of using a multilingual machine translation encoder as contextual word representations. We find that given several hundred training examples in the the target language, the latter two methods outperform translating the training data. Further, in very low-resource settings, multilingual contextual word representations give better results than using cross-lingual static embeddings. We also compare the cross-lingual methods to using monolingual resources in the form of contextual ELMo representations and find that given just small amounts of target language data, this method outperforms all cross-lingual methods, which highlights the need for more sophisticated cross-lingual methods.
引用
收藏
页码:3795 / 3805
页数:11
相关论文
共 50 条
  • [21] Multilingual and Cross-Lingual Graded Lexical Entailment
    Vulic, Ivan
    Ponzetto, Simone Paolo
    Glavas, Goran
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4963 - 4974
  • [22] Cross-Lingual Transfer Learning Framework for Program Analysis
    Li, Zhiming
    [J]. 2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 1074 - 1078
  • [23] Cross-Lingual Transfer Learning for Statistical Type Inference
    Li, Zhiming
    Xie, Xiaofei
    Li, Haoliang
    Xu, Zhengzi
    Li, Yi
    Liu, Yang
    [J]. PROCEEDINGS OF THE 31ST ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2022, 2022, : 239 - 250
  • [24] Multi-level multilingual semantic alignment for zero-shot cross-lingual transfer learning
    Gui, Anchun
    Xiao, Han
    [J]. NEURAL NETWORKS, 2024, 173
  • [25] Multi-Task Learning for Cross-Lingual Abstractive Summarization
    Takase, Sho
    Okazaki, Naoaki
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3008 - 3016
  • [26] CROSS-LINGUAL TRANSFER LEARNING FOR SPOKEN LANGUAGE UNDERSTANDING
    Quynh Ngoc Thi Do
    Gaspers, Judith
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5956 - 5960
  • [27] Cross-Lingual Transfer Learning for Complex Word Identification
    Zaharia, George-Eduard
    Cercel, Dumitru-Clementin
    Dascalu, Mihai
    [J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 384 - 390
  • [28] MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer
    Ansell, Alan
    Ponti, Edoardo Maria
    Pfeiffer, Jonas
    Ruder, Sebastian
    Glavas, Goran
    Vulic, Ivan
    Korhonen, Anna
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4762 - 4781
  • [29] Exploring Cross-lingual Textual Style Transfer with Large Multilingual Language Models
    Moskovskiy, Daniil
    Dementieva, Daryna
    Panchenko, Alexander
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 346 - 354
  • [30] XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization
    Hu, Junjie
    Ruder, Sebastian
    Siddhant, Aditya
    Neubig, Graham
    Firat, Orhan
    Johnson, Melvin
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119