The Impact of Word Segmentation Techniques on Neural and Statistical Machine Translation: English-Arabic Case

被引:0
|
作者
Berrichi, Safae [1 ]
Mazroui, Azzeddine [1 ]
机构
[1] Mohammed First Univ, Fac Sci, Dept Comp Sci, Oujda, Morocco
关键词
Machine translation; Morphological segmentation; Sub-word segmentation; Statistical approach; Neural approach;
D O I
10.1007/978-3-030-90633-7_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with Machine Translation between the English and Arabic languages. This task is very tricky given the morphological richness of the Arabic language and the unavailability of large parallel corpora. To overcome those issues, we have examined the impact of word segmentation (sub-word and morphological segmentation) on machine translation performance. We have tested both the statistical approach and the neural approach which is widely employed in recent years owing to its promising results. In our experiments, carried out on English-Arabic direction and based on the United Nations parallel corpus, we show that applying morphological segmentation to the target language proved very beneficial, whereas sub-word segmentation made no significant impact on both neural and statistical models.
引用
收藏
页码:454 / 462
页数:9
相关论文
共 50 条
  • [41] Errors and non-errors in English-Arabic machine translation of gender-bound constructs in technical texts
    Abu-Ayyash, Emad A. S.
    ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 73 - 80
  • [43] A General Approach for Word Reordering in English-Vietnamese-English Statistical Machine Translation
    Nguyen, Nhung T. H.
    Le, Vinh Q.
    Minh-Quoc Nghiem
    Dien Dinh
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2015, 24 (06)
  • [44] English–Mizo Machine Translation using neural and statistical approaches
    Amarnath Pathak
    Partha Pakray
    Jereemi Bentham
    Neural Computing and Applications, 2019, 31 : 7615 - 7631
  • [45] Evaluation of English-Slovak Neural and Statistical Machine Translation
    Benkova, Lucia
    Munkova, Dasa
    Benko, Lubomir
    Munk, Michal
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [46] Rethinking the English-Arabic Legal Translation Course: Restructuring for Specific Competence Acquisition
    Halimi, Sonia Asmahene
    INTERNATIONAL JOURNAL FOR THE SEMIOTICS OF LAW-REVUE INTERNATIONALE DE SEMIOTIQUE JURIDIQUE, 2019, 32 (01): : 117 - 134
  • [47] The Effect of Shallow Segmentation on English-Tigrinya Statistical Machine Translation
    Tedla, Yemane
    Yamamoto, Kazuhide
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 79 - 82
  • [48] The 'Carbon Capture' Metaphor: An English-Arabic Terminological Case Study
    Haddad Haddad, Amal
    Montero-Martinez, Silvia
    LANGUAGES, 2019, 4 (04)
  • [49] English/Arabic/English machine translation: A historical perspective
    Zughoul, MR
    Abu-Alshaar, AM
    META, 2005, 50 (03) : 1022 - 1041
  • [50] Modeling Word Formation in English-German Neural Machine Translation
    Weller-Di Marco, Marion
    Fraser, Alexander
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4227 - 4232