VERTa: a linguistic approach to automatic machine translation evaluation

被引:7
|
作者
Comelles, Elisabet [1 ]
Atserias, Jordi [2 ]
机构
[1] Univ Barcelona, Gran Via Corts Catalanes 585, E-08007 Barcelona, Spain
[2] Univ Basque Country, Paseo Manuel de Lardizabal 1, Donosti 20018, Spain
关键词
Machine translation; Machine translation evaluation; MT metric; Linguistic features; Qualitative approach;
D O I
10.1007/s10579-018-9430-2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Machine translation (MT) is directly linked to its evaluation in order to both compare different MT system outputs and analyse system errors so that they can be addressed and corrected. As a consequence, MT evaluation has become increasingly important and popular in the last decade, leading to the development of MT evaluation metrics aiming at automatically assessing MT output. Most of these metrics use reference translations in order to compare system output, and the most well-known and widely spread work at lexical level. In this study we describe and present a linguistically-motivated metric, VERTa, which aims at using and combining a wide variety of linguistic features at lexical, morphological, syntactic and semantic level. Before designing and developing VERTa a qualitative linguistic analysis of data was performed so as to identify the linguistic phenomena that an MT metric must consider (Comelles et al. 2017). In the present study we introduce VERTa's design and architecture and we report the experiments performed in order to develop the metric and to check the suitability and interaction of the linguistic information used. The experiments carried out go beyond traditional correlation scores and step towards a more qualitative approach based on linguistic analysis. Finally, in order to check the validity of the metric, an evaluation has been conducted comparing the metric's performance to that of other well-known state-of-the-art MT metrics.
引用
收藏
页码:57 / 86
页数:30
相关论文
共 50 条
  • [1] VERTa: a linguistic approach to automatic machine translation evaluation
    Elisabet Comelles
    Jordi Atserias
    [J]. Language Resources and Evaluation, 2019, 53 : 57 - 86
  • [2] Linguistic measures for automatic machine translation evaluation
    Giménez, Jesús
    Màrquez, Lluís
    [J]. Machine Translation, 2010, 24 (3-4) : 209 - 240
  • [3] A machine learning approach to the automatic evaluation of machine translation
    Corston-Oliver, S
    Gamon, M
    Brockett, C
    [J]. 39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 140 - 147
  • [4] VERTa: Linguistic Features in MT Evaluation
    Comelles, Elisabet
    Atserias, Jordi
    Arranz, Victoria
    Castellon, Irene
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3944 - 3950
  • [5] Optimizing Automatic Evaluation of Machine Translation with the ListMLE Approach
    Li, Maoxi
    Wang, Mingwen
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (01)
  • [6] Towards human linguistic machine translation evaluation
    Costa-jussa, Marta R.
    Farrus, Mireia
    [J]. DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2015, 30 (02) : 157 - 166
  • [7] Incorporation of Linguistic Features in Machine Translation Evaluation of Arabic
    El Marouani, Mohamed
    Boudaa, Tarik
    Enneya, Nourddine
    [J]. BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 500 - 511
  • [8] Neutralizing the Effect of Translation Shifts on Automatic Machine Translation Evaluation
    Fomicheva, Marina
    Bel, Nuria
    da Cunha, Iria
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 596 - 607
  • [9] The METEOR metric for automatic evaluation of machine translation
    Lavie, Alon
    Denkowski, Michael J.
    [J]. MACHINE TRANSLATION, 2009, 23 (2-3) : 105 - 115
  • [10] An automatic evaluation of machine translation and Slavic languages
    Munkova, Dasa
    Munk, Michal
    [J]. 2014 IEEE 8TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2014, : 447 - 451