Towards Zero-Shot Multilingual Transfer for Code-Switched Responses

被引:0
|
作者
Wu, Ting-Wei [1 ,2 ]
Zhao, Changsheng [2 ]
Chang, Ernie [2 ]
Shi, Yangyang [2 ]
Chuang, Pierce [2 ]
Chandra, Vikas [2 ]
Juang, Biing [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Meta Real Labs, Menlo Pk, CA 94010 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent task-oriented dialog systems obtained great successes in building personal assistants for high resource language such as English, but extending these systems to a global audience is challenging due to the need for annotated data or machine translation systems in the target language. An alternative approach is to leverage existing data in a high-resource language to enable cross-lingual transfer in low-resource language models. However, this type of transfer has not been widely explored in natural language response generation. In this research, we investigate the use of state-of-the-art multilingual models such as mBART and T5 to facilitate zero-shot and few-shot transfer of code-switched responses. We propose a new adapterbased framework that allows for efficient transfer by learning jointly the task-specific, source and target language representations. Our framework is able to successfully transfer language knowledge even when the target language corpus is limited. We present both quantitative and qualitative analyses to evaluate the effectiveness and limitations of our approach.
引用
收藏
页码:7551 / 7563
页数:13
相关论文
共 50 条
  • [1] Multi-label Masked Language Modeling on Zero-shot Code-switched Sentiment Analysis
    Li, Zhi
    Gao, Xing
    Zhang, Ji
    Zhang, Yin
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2663 - 2668
  • [2] From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers
    Lauscher, Anne
    Ravishankar, Vinit
    Vulic, Ivan
    Glavas, Goran
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4483 - 4499
  • [3] ZERO-SHOT CODE-SWITCHING ASR AND TTS WITH MULTILINGUAL MACHINE SPEECH CHAIN
    Nakayama, Sahoko
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 964 - 971
  • [4] Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis
    Aryal, Saurav K.
    Prioleau, Howard
    Washington, Gloria
    Burge, Legand
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 165 - 173
  • [5] Zero-Shot Task Transfer
    Pal, Arghya
    Balasubramanian, Vineeth N.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2184 - 2193
  • [6] Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
    Artetxe, Mikel
    Schwenk, Holger
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 : 597 - 610
  • [7] Toward Zero-Shot and Zero-Resource Multilingual Question Answering
    Kuo, Chia-Chih
    Chen, Kuan-Yu
    [J]. IEEE ACCESS, 2022, 10 : 99754 - 99761
  • [8] Feature Aggregation in Zero-Shot Cross-Lingual Transfer Using Multilingual BERT
    Chen, Beiduo
    Guo, Wu
    Liu, Quan
    Tao, Kun
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1428 - 1435
  • [9] Towards Zero-shot Language Modeling
    Ponti, Edoardo M.
    Vulic, Ivan
    Cotterell, Ryan
    Reichart, Roi
    Korhonen, Anna
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2900 - +
  • [10] Towards Open Zero-Shot Learning
    Marmoreo, Federico
    Carrazco, Julio Ivan Davila
    Cavazza, Jacopo
    Murino, Vittorio
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 564 - 575