Ebaluatoia: crowd evaluation for English–Basque machine translation

被引：0

作者：

Nora Aranberri

Gorka Labaka

Arantza Díaz de Ilarraza

Kepa Sarasola

机构：

[1] University of the Basque Country (UPV/EHU),IXA Group, Faculty of Computer Science

来源：

Language Resources and Evaluation | 2017年 / 51卷

关键词：

Machine translation; Crowd evaluation; Pair-wise comparison; English; Basque;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This work explores the feasibility of a crowd-based pair-wise comparison evaluation to get feedback on machine translation progress for under-resourced languages. Specifically, we propose a task based on simple work units to compare the outputs of five English-to-Basque systems, which we implement in a web application. In our design, we put forward two key aspects that we believe community collaboration initiatives should consider in order to attract and maintain participants, that is, providing both a community challenge and a personal challenge. We describe how these aspects can comply with a strict methodology to ensure research validity. In particular, we consider the evaluation set size and the characteristics of the test sentences, the number of evaluators per comparison pair, and a mechanism to identify dishonest participation (or participants with insufficient linguistic knowledge). We also describe our dissemination effort, which targeted both general users and interest groups. Over 500 people participated actively in the Ebaluatoia campaign and we were able to collect over 35,000 evaluations in a short period of 10 days. From the results, we complete the ranking of the systems under evaluation and establish whether the difference in quality between the systems is significant.

引用

页码：1053 / 1084

页数：31

共 50 条

[1] Ebaluatoia: crowd evaluation for English-Basque machine translation
Aranberri, Nora
Labaka, Gorka
Diaz de Ilarraza, Arantza
Sarasola, Kepa
LANGUAGE RESOURCES AND EVALUATION, 2017, 51 (04) : 1053 - 1084
[2] English-Basque Statistical and Neural Machine Translation
Unanue, Inigo Jauregi
Garmendia Arratibel, Lierni
Borzeshi, Ehsan Zare
Piccardi, Massimo
PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 880 - 885
[3] Tectogrammar-based machine translation for English-Spanish and English-Basque
Aranberri, Nora
Labaka, Gorka
Jauregi, Oneka
Diaz de Ilarraza, Arantza
Alegria, Inaki
Agirre, Eneko
PROCESAMIENTO DEL LENGUAJE NATURAL, 2016, (56): : 73 - 80
[4] Hybrid Machine Translation For English to Marathi: A Research Evaluation In Machine Translation
Salunkhe, Pramod
Kadam, Aniket D.
Joshi, Shashank
Patil, Shuhas
Thakore, Devendrasingh
Jadhav, Shrikant
2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 924 - 931
[5] Evaluation of Arabic to English Machine Translation Systems
Zakraoui, Jezia
Saleh, Moutaz
Al-Maadeed, Somaya
AlJa'am, Jihad Mohamad
2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2020, : 185 - 190
[6] Evaluation of Machine Translation Approaches to Translate English to Bengali
Nahar, Shamsun
Huda, Mohammad Nurul
Nur-E-Arefin, Md.
Rahman, Mohammad Mahbubur
2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2017,
[7] Subjective and Objective Evaluation of English to Urdu Machine Translation
Gupta, Vaishali
Joshi, Nisheeth
Mathur, Iti
2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1520 - 1525
[8] A human evaluation of English-Slovak machine translation
Munkova, Dasa
Panisova, Ludmila
Welnitzova, Katarina
PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2023, 31 (06): : 1142 - 1161
[9] Evaluation of Machine Translation Errors in English and Iraqi Arabic
Condon, Sherri
Parvaz, Dan
Aberdeen, John
Doran, Christy
Freeman, Andrew
Awad, Marwan
LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
[10] A Test Suite for the Evaluation of Portuguese-English Machine Translation
Avelino, Mariana
Macketanz, Vivien
Avramidis, Eleftherios
Moller, Sebastian
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 15 - 25

← 1 2 3 4 5 →