Improving Machine Translation of Arabic Dialects Through Multi-task Learning

被引:0
|
作者
Moukafih, Youness [1 ,2 ]
Sbihi, Nada [1 ]
Ghogho, Mounir [1 ]
Smaili, Kamel [2 ]
机构
[1] Univ Int Rabat, Coll Engn & Architecture, TICLab, Rabat, Morocco
[2] LORIA INRIA Lorraine, 615 Rue Jardin Bot,BP 101, F-54600 Villers Les Nancy, France
关键词
Neural network; Machine translation; Multitask learning; Low-resource languages; Arabic dialects;
D O I
10.1007/978-3-031-08421-8_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Machine Translation (NMT) systems have been shown to perform impressively on many language pairs compared to Statistical Machine Translation (SMT). However, these systems are data-intensive, which is problematic for the majority of language pairs, and especially for low-resource languages. In this work, we address this issue in the case of certain Arabic dialects, those variants of Modern Standard Arabic (MSA) that are spelling non-standard, morphologically rich, and yet resource-poor variants. Here, we have experimented with several multitasking learning strategies to take advantage of the relationships between these dialects. Despite the simplicity of this idea, empirical results show that several multitasking learning strategies are capable of achieving remarkable performance compared to statistical machine translation. For instance, we obtained the BLUE scores for the Algerian. Modern-Standard-Arabic and the Moroccan. Palestinian of 35.06 and 27.55, respectively, while the scores obtained with a statistical method are 15.1 and 18.91 respectively. We show that on 42 machine translation experiments, and despite the use of a small corpus, multitasking learning achieves better performance than statistical machine translation in 88% of cases.
引用
收藏
页码:580 / 590
页数:11
相关论文
共 50 条
  • [1] Improving Robustness of Neural Machine Translation with Multi-task Learning
    Zhou, Shuyan
    Zeng, Xiangkai
    Zhou, Yingqi
    Anastasopoulos, Antonios
    Neubig, Graham
    [J]. FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 565 - 571
  • [2] Multi-task Learning for Multilingual Neural Machine Translation
    Wang, Yiren
    Zhai, ChengXiang
    Awadalla, Hany Hassan
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1022 - 1034
  • [3] Adaptive Knowledge Sharing in Multi-Task Learning: Improving Low-Resource Neural Machine Translation
    Zaremoodi, Poorya
    Buntine, Wray
    Haffari, Gholamreza
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 656 - 661
  • [4] Neural Machine Translation Based on Multi-task Learning of Discourse Structure
    Kang, Xiao-Mian
    Zong, Cheng-Qing
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10): : 3806 - 3818
  • [5] Machine translation for Arabic dialects (survey)
    Harrat, Salima
    Meftouh, Karima
    Smaili, Kamel
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (02) : 262 - 273
  • [6] Autocorrect in the Process of Translation- Multi-task Learning Improves Dialogue Machine Translation
    Wang, Tao
    Zhao, Chengqi
    Wang, Mingxuan
    Li, Lei
    Xiong, Deyi
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 105 - 112
  • [7] Multi-Task Learning with Shared Encoder for Non-Autoregressive Machine Translation
    Hao, Yongchang
    He, Shilin
    Jiao, Wenxiang
    Tu, Zhaopeng
    Lyu, Michael R.
    Wang, Xing
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3989 - 3996
  • [8] Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation
    Wang, Qiang
    Xiao, Tong
    Zhu, Jingbo
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4307 - 4312
  • [9] Multi-Task Learning for Multiple Language Translation
    Dong, Daxiang
    Wu, Hua
    He, Wei
    Yu, Dianhai
    Wang, Haifeng
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1723 - 1732
  • [10] Improving Machine Reading Comprehension with Multi-Task Learning and Self-Training
    Ouyang, Jianquan
    Fu, Mengen
    [J]. MATHEMATICS, 2022, 10 (03)