Ebaluatoia: crowd evaluation for English–Basque machine translation

被引:0
|
作者
Nora Aranberri
Gorka Labaka
Arantza Díaz de Ilarraza
Kepa Sarasola
机构
[1] University of the Basque Country (UPV/EHU),IXA Group, Faculty of Computer Science
来源
关键词
Machine translation; Crowd evaluation; Pair-wise comparison; English; Basque;
D O I
暂无
中图分类号
学科分类号
摘要
This work explores the feasibility of a crowd-based pair-wise comparison evaluation to get feedback on machine translation progress for under-resourced languages. Specifically, we propose a task based on simple work units to compare the outputs of five English-to-Basque systems, which we implement in a web application. In our design, we put forward two key aspects that we believe community collaboration initiatives should consider in order to attract and maintain participants, that is, providing both a community challenge and a personal challenge. We describe how these aspects can comply with a strict methodology to ensure research validity. In particular, we consider the evaluation set size and the characteristics of the test sentences, the number of evaluators per comparison pair, and a mechanism to identify dishonest participation (or participants with insufficient linguistic knowledge). We also describe our dissemination effort, which targeted both general users and interest groups. Over 500 people participated actively in the Ebaluatoia campaign and we were able to collect over 35,000 evaluations in a short period of 10 days. From the results, we complete the ranking of the systems under evaluation and establish whether the difference in quality between the systems is significant.
引用
收藏
页码:1053 / 1084
页数:31
相关论文
共 50 条
  • [1] Ebaluatoia: crowd evaluation for English-Basque machine translation
    Aranberri, Nora
    Labaka, Gorka
    Diaz de Ilarraza, Arantza
    Sarasola, Kepa
    LANGUAGE RESOURCES AND EVALUATION, 2017, 51 (04) : 1053 - 1084
  • [2] English-Basque Statistical and Neural Machine Translation
    Unanue, Inigo Jauregi
    Garmendia Arratibel, Lierni
    Borzeshi, Ehsan Zare
    Piccardi, Massimo
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 880 - 885
  • [3] Tectogrammar-based machine translation for English-Spanish and English-Basque
    Aranberri, Nora
    Labaka, Gorka
    Jauregi, Oneka
    Diaz de Ilarraza, Arantza
    Alegria, Inaki
    Agirre, Eneko
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2016, (56): : 73 - 80
  • [4] Hybrid Machine Translation For English to Marathi: A Research Evaluation In Machine Translation
    Salunkhe, Pramod
    Kadam, Aniket D.
    Joshi, Shashank
    Patil, Shuhas
    Thakore, Devendrasingh
    Jadhav, Shrikant
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 924 - 931
  • [5] Evaluation of Arabic to English Machine Translation Systems
    Zakraoui, Jezia
    Saleh, Moutaz
    Al-Maadeed, Somaya
    AlJa'am, Jihad Mohamad
    2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2020, : 185 - 190
  • [6] Evaluation of Machine Translation Approaches to Translate English to Bengali
    Nahar, Shamsun
    Huda, Mohammad Nurul
    Nur-E-Arefin, Md.
    Rahman, Mohammad Mahbubur
    2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2017,
  • [7] Subjective and Objective Evaluation of English to Urdu Machine Translation
    Gupta, Vaishali
    Joshi, Nisheeth
    Mathur, Iti
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1520 - 1525
  • [8] A human evaluation of English-Slovak machine translation
    Munkova, Dasa
    Panisova, Ludmila
    Welnitzova, Katarina
    PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2023, 31 (06): : 1142 - 1161
  • [9] Evaluation of Machine Translation Errors in English and Iraqi Arabic
    Condon, Sherri
    Parvaz, Dan
    Aberdeen, John
    Doran, Christy
    Freeman, Andrew
    Awad, Marwan
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [10] A Test Suite for the Evaluation of Portuguese-English Machine Translation
    Avelino, Mariana
    Macketanz, Vivien
    Avramidis, Eleftherios
    Moller, Sebastian
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 15 - 25