Cross-Lingual Transfer Learning for Arabic Task-Oriented Dialogue Systems Using Multilingual Transformer Model mT5

被引:5
|
作者
Fuad, Ahlam [1 ]
Al-Yahya, Maha [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Technol, POB 145111, Riyadh 4545, Saudi Arabia
关键词
cross-lingual transfer learning; task-oriented dialogue systems; Arabic language; mixed-language pre-training; multilingual transformer model; mT5; natural language processing;
D O I
10.3390/math10050746
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Due to the promising performance of pre-trained language models for task-oriented dialogue systems (DS) in English, some efforts to provide multilingual models for task-oriented DS in low-resource languages have emerged. These efforts still face a long-standing challenge due to the lack of high-quality data for these languages, especially Arabic. To circumvent the cost and time-intensive data collection and annotation, cross-lingual transfer learning can be used when few training data are available in the low-resource target language. Therefore, this study aims to explore the effectiveness of cross-lingual transfer learning in building an end-to-end Arabic task-oriented DS using the mT5 transformer model. We use the Arabic task-oriented dialogue dataset (Arabic-TOD) in the training and testing of the model. We present the cross-lingual transfer learning deployed with three different approaches: mSeq2Seq, Cross-lingual Pre-training (CPT), and Mixed-Language Pre-training (MLT). We obtain good results for our model compared to the literature for Chinese language using the same settings. Furthermore, cross-lingual transfer learning deployed with the MLT approach outperform the other two approaches. Finally, we show that our results can be improved by increasing the training dataset size.
引用
收藏
页数:9
相关论文
共 10 条
  • [1] AraConv: Developing an Arabic Task-Oriented Dialogue System Using Multi-Lingual Transformer Model mT5
    Fuad, Ahlam
    Al-Yahya, Maha
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (04):
  • [2] Robust Cross-lingual Task-oriented Dialogue
    Xiang, Lu
    Zhu, Junnan
    Zhao, Yang
    Zhou, Yu
    Zong, Chengqing
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)
  • [3] Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog
    Schuster, Sebastian
    Gupta, Sonal
    Shah, Rushin
    Lewis, Mike
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3795 - 3805
  • [4] Cross-Lingual Transfer Learning for Affective Spoken Dialogue Systems
    Gjoreski, Kristijan
    Gjoreski, Aleksandar
    Kraljevski, Ivan
    Hirschfeld, Diane
    [J]. INTERSPEECH 2019, 2019, : 1916 - 1920
  • [5] MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
    Lin, Zhaojiang
    Madotto, Andrea
    Winata, Genta Indra
    Fung, Pascale
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3391 - 3405
  • [6] Attention-Informed Mixed-Language Training for Zero-Shot Cross-Lingual Task-Oriented Dialogue Systems
    Liu, Zihan
    Winata, Genta Indra
    Lin, Zhaojiang
    Xu, Peng
    Fung, Pascale
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8433 - 8440
  • [7] Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model
    Lee, Hoyeon
    Yoon, Hyun-Wook
    Kim, Jong-Hwan
    Kim, Jae-Min
    [J]. INTERSPEECH 2023, 2023, : 611 - 615
  • [8] CrossAligner & Co: Zero-Shot Transfer Methods for Task-Oriented Cross-lingual Natural Language Understanding
    Gritta, Milan
    Hu, Ruoyu
    Iacobacci, Ignacio
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 4048 - 4061
  • [9] Using Reinforcement Learning for Dialogue Act Classification in Task-oriented Conversation Systems
    Xia, Qingyang
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSSE 2018), 2018, : 187 - 196
  • [10] Transfer Learning based Task-oriented Dialogue Policy for Multiple Domains using Hierarchical Reinforcement Learning
    Saha, Tulika
    Saha, Sriparna
    Bhattacharyya, Pushpak
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,