English - Oromo Machine Translation: An Experiment Using a Statistical Approach

被引:0
|
作者
Adugna, Sisay [1 ]
Eisele, Andreas
机构
[1] Haramaya Univ, East Harerge, Ethiopia
关键词
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
This paper deals with translation of English documents to Oromo using statistical methods. Whereas English is the lingua franca of online information, Oromo, despite its relative wide distribution within Ethiopia and neighbouring countries like Kenya and Somalia, is one of the most resource scarce languages. The paper has two main goals: one is to test how far we can go with the available limited parallel corpus for the English - Oromo language pair and the applicability of existing Statistical Machine Translation (SMT) systems on this language pair. The second goal is to analyze the output of the system with the objective of identifying the challenges that need to be tackled. Since the language is resource scarce as mentioned above, we cannot get as many parallel documents as we want for the experiment. However, using a limited corpus of 20,000 bilingual sentences and 62, 300 monolingual sentences, translation accuracy in terms of BLEU Score of 17.74% was achieved.
引用
收藏
页码:2196 / 2199
页数:4
相关论文
共 50 条
  • [31] Factored Statistical Machine Translation System for English to Tamil Language
    Anand, Kumar M.
    Dhanalakshmi
    Soman, K. P.
    Rajendran, S.
    PERTANIKA JOURNAL OF SOCIAL SCIENCE AND HUMANITIES, 2014, 22 (04): : 1045 - 1061
  • [32] English-Arabic Statistical Machine Translation: State of the Art
    Ebrahim, Sara
    Hegazy, Doaa
    Mostafa, Mostafa G. M.
    El-Beltagy, Samhaa R.
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 520 - 533
  • [33] English Language Statistical Machine Translation Oriented Classification Algorithm
    Yan, Jia
    Chao, Wang
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA AND SMART CITY (ICITBS), 2016, : 376 - 379
  • [34] Evaluation of English-Slovak Neural and Statistical Machine Translation
    Benkova, Lucia
    Munkova, Dasa
    Benko, Lubomir
    Munk, Michal
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [35] English to Bodo Phrase-Based Statistical Machine Translation
    Islam, Md Saiful
    Purkayastha, Bipul Syam
    ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2018, 562 : 207 - 217
  • [36] Classical Arabic English machine translation using rule-based approach
    Hebresha, Huda Alhusain
    Aziz, Mohd Juzaiddin Ab
    Journal of Applied Sciences, 2013, 13 (01) : 79 - 86
  • [37] Refined lexicon models for statistical machine translation using a maximum entropy approach
    Varea, IG
    Och, FJ
    Ney, H
    Casacuberta, F
    39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 204 - 211
  • [38] A Semi-supervised Approach to Bengali-English Phrase-Based Statistical Machine Translation
    Roy, Maxim
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5549 : 291 - +
  • [39] A Hybrid Approach For Hindi-English Machine Translation
    Dhariya, Omkar
    Malviya, Shrikant
    Tiwary, Uma Shanker
    2017 31ST INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2017, : 389 - 394
  • [40] A Hybrid Approach for Amazigh-English Machine Translation
    Taghbalout, Imane
    Allah, Fadoua Ataa
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND NEW TECHNOLOGIES (ICSENT '18), 2018,