English - Oromo Machine Translation: An Experiment Using a Statistical Approach

被引:0
|
作者
Adugna, Sisay [1 ]
Eisele, Andreas
机构
[1] Haramaya Univ, East Harerge, Ethiopia
关键词
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
This paper deals with translation of English documents to Oromo using statistical methods. Whereas English is the lingua franca of online information, Oromo, despite its relative wide distribution within Ethiopia and neighbouring countries like Kenya and Somalia, is one of the most resource scarce languages. The paper has two main goals: one is to test how far we can go with the available limited parallel corpus for the English - Oromo language pair and the applicability of existing Statistical Machine Translation (SMT) systems on this language pair. The second goal is to analyze the output of the system with the objective of identifying the challenges that need to be tackled. Since the language is resource scarce as mentioned above, we cannot get as many parallel documents as we want for the experiment. However, using a limited corpus of 20,000 bilingual sentences and 62, 300 monolingual sentences, translation accuracy in terms of BLEU Score of 17.74% was achieved.
引用
收藏
页码:2196 / 2199
页数:4
相关论文
共 50 条
  • [41] A hybrid approach to Chinese-English machine translation
    Liu, Y
    Liu, Q
    Zhang, X
    Chang, BB
    1997 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT PROCESSING SYSTEMS, VOLS 1 & 2, 1997, : 1146 - 1150
  • [42] Theoretical based approach to English to Sinhala machine translation
    Hettige, B.
    Karunananda, A. S.
    2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, 2009, : 380 - 385
  • [43] QUERY REWRITING USING STATISTICAL MACHINE TRANSLATION
    Bao, Jun-Wei
    Zheng, De-Qvan
    Xu, Bing
    Zhao, Tie-Jun
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 814 - 819
  • [44] Following Directions Using Statistical Machine Translation
    Matuszek, Cynthia
    Fox, Dieter
    Koscher, Karl
    PROCEEDINGS OF THE 5TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI 2010), 2010, : 251 - 258
  • [45] An Experiment on Japanese-Uighur Statistical Machine Translation with Increased Corpus
    Nimaiti, Maimitili
    Izumi, Yamamoto
    2014 IEEE 13TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI-CC), 2014, : 490 - 495
  • [46] Orthographic and morphological processing for English-Arabic statistical machine translation
    El Kholy, Ahmed
    Habash, Nizar
    MACHINE TRANSLATION, 2012, 26 (1-2) : 25 - 45
  • [47] Bi-directional English-Hiligaynon Statistical Machine Translation
    Genevieve Macabante, Dana
    Casper Tambanillo, John
    Dela Cruz, Angelica
    Ellema, Nove
    Octaviano, Manolito, Jr.
    Rodriguez, Ramon
    Edita Roxas, Rachel
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 2852 - 2853
  • [48] Morphology generation for English-Indian language statistical machine translation
    S. Sreelekha
    Soft Computing, 2021, 25 : 3657 - 3664
  • [49] Benefits of morphosyntactic features on English-Arabic Statistical Machine Translation
    Berrichi, Safae
    Mazroui, Azzeddine
    2018 IEEE 5TH INTERNATIONAL CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'18), 2018, : 244 - 248
  • [50] The Effect of Shallow Segmentation on English-Tigrinya Statistical Machine Translation
    Tedla, Yemane
    Yamamoto, Kazuhide
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 79 - 82