Improving Machine Translation of Arabic Dialects Through Multi-task Learning

被引:0
|
作者
Moukafih, Youness [1 ,2 ]
Sbihi, Nada [1 ]
Ghogho, Mounir [1 ]
Smaili, Kamel [2 ]
机构
[1] Univ Int Rabat, Coll Engn & Architecture, TICLab, Rabat, Morocco
[2] LORIA INRIA Lorraine, 615 Rue Jardin Bot,BP 101, F-54600 Villers Les Nancy, France
关键词
Neural network; Machine translation; Multitask learning; Low-resource languages; Arabic dialects;
D O I
10.1007/978-3-031-08421-8_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Machine Translation (NMT) systems have been shown to perform impressively on many language pairs compared to Statistical Machine Translation (SMT). However, these systems are data-intensive, which is problematic for the majority of language pairs, and especially for low-resource languages. In this work, we address this issue in the case of certain Arabic dialects, those variants of Modern Standard Arabic (MSA) that are spelling non-standard, morphologically rich, and yet resource-poor variants. Here, we have experimented with several multitasking learning strategies to take advantage of the relationships between these dialects. Despite the simplicity of this idea, empirical results show that several multitasking learning strategies are capable of achieving remarkable performance compared to statistical machine translation. For instance, we obtained the BLUE scores for the Algerian. Modern-Standard-Arabic and the Moroccan. Palestinian of 35.06 and 27.55, respectively, while the scores obtained with a statistical method are 15.1 and 18.91 respectively. We show that on 42 machine translation experiments, and despite the use of a small corpus, multitasking learning achieves better performance than statistical machine translation in 88% of cases.
引用
收藏
页码:580 / 590
页数:11
相关论文
共 50 条
  • [21] RESEARCH OF MULTI-TASK LEARNING BASED ON EXTREME LEARNING MACHINE
    Mao, Wentao
    Xu, Jiucheng
    Zhao, Shengjie
    Tian, Mei
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2013, 21 : 75 - 85
  • [22] Multi-Passage Machine Reading Comprehension Through Multi-Task Learning and Dual Verification
    Li, Xingyi
    Cheng, Xiang
    Xia, Min
    Ren, Qiyu
    He, Zhaofeng
    Su, Sen
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5280 - 5293
  • [23] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    [J]. Memetic Computing, 2020, 12 : 355 - 369
  • [24] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    [J]. MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [25] Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach
    Sanchez-Cartagena, Victor M.
    Espla-Gomis, Miquel
    Antonio Perez-Ortiz, Juan
    Sanchez-Martinez, Felipe
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8502 - 8516
  • [26] VOGUE: Answer Verbalization Through Multi-Task Learning
    Kacupaj, Endri
    Premnadh, Shyamnath
    Singh, Kuldeep
    Lehmann, Jens
    Maleshkova, Maria
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 563 - 579
  • [27] Multi-task face analyses through adversarial learning
    Wang, Shangfei
    Yin, Shi
    Hao, Longfei
    Liang, Guang
    [J]. PATTERN RECOGNITION, 2021, 114
  • [28] Unified Voice Embedding through Multi-task Learning
    Rajenthiran, Jenarthanan
    Sithamaparanathan, Lakshikka
    Uthayakumar, Saranya
    Thayasivam, Uthayasanker
    [J]. 2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 178 - 183
  • [29] Improving Entity Recommendation with Search Log and Multi-Task Learning
    Huang, Jizhou
    Zhang, Wei
    Sun, Yaming
    Wang, Haifeng
    Liu, Ting
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4107 - 4114
  • [30] Improving radial lens distortion correction with multi-task learning
    Janos, Igor
    Benesova, Wanda
    [J]. PATTERN RECOGNITION LETTERS, 2024, 183 : 147 - 154