A set of recommendations for assessing human-machine parity in language translation

被引:0
|
作者
Läubli S. [1 ]
Castilho S. [2 ]
Neubig G. [3 ]
Sennrich R. [1 ]
Shen Q. [3 ]
Toral A. [4 ]
机构
[1] Institute of Computational Linguistics, University of Zurich
[2] ADAPT Centre, Dublin City University
[3] Language Technologies Institute, Carnegie Mellon University
[4] Center for Language and Cognition, University of Groningen
关键词
D O I
10.1613/JAIR.1.11371
中图分类号
学科分类号
摘要
The quality of machine translation has increased remarkably over the past years, to the degree that it was found to be indistinguishable from professional human translation in a number of empirical investigations. We reassess Hassan et al.'s 2018 investigation into Chinese to English news translation, showing that the finding of human-machine parity was owed to weaknesses in the evaluation design-which is currently considered best practice in the field. We show that the professional human translations contained significantly fewer errors, and that perceived quality in human evaluation depends on the choice of raters, the availability of linguistic context, and the creation of reference translations. Our results call for revisiting current best practices to assess strong machine translation systems in general and human-machine parity in particular, for which we offer a set of recommendations based on our empirical findings. © 2020 AI Access Foundation. All rights reserved.
引用
收藏
页码:653 / 672
页数:19
相关论文
共 50 条
  • [1] A Set of Recommendations for Assessing Human-Machine Parity in Language Translation
    Laeubli, Samuel
    Castilho, Sheila
    Neubig, Graham
    Sennrich, Rico
    Shen, Qinlan
    Toral, Antonio
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2020, 67 : 653 - 672
  • [2] Assessing Human-Parity in Machine Translation on the Segment Level
    Graham, Yvette
    Federmann, Christian
    Eskevich, Maria
    Haddow, Barry
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4199 - 4207
  • [3] A Human-Machine Language Dictionary
    Fei Liu
    Shirin Akther Khanam
    Yi-Ping Phoebe Chen
    International Journal of Computational Intelligence Systems, 2020, 13 : 904 - 913
  • [4] A Human-Machine Language Dictionary
    Liu, Fei
    Khanam, Shirin Akther
    Chen, Yi-Ping Phoebe
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 904 - 913
  • [5] hmCodeTrans: Human-Machine Interactive Code Translation
    Liu, Jiaqi
    Zhang, Fengming
    Zhang, Xin
    Yu, Zhiwen
    Wang, Liang
    Zhang, Yao
    Guo, Bin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (05) : 1163 - 1181
  • [6] Decolonizing Language Resources in The Human-Machine Era
    Gammelgaard, Anna Osterskov
    Pedersen, Casper Gjodvad
    Kaspersen, Emilie Strudahl
    Thomsen, Marius Risbaek
    Samson, Jonathan Kok
    Fabricius, Anne H.
    INTERVENTIONS-INTERNATIONAL JOURNAL OF POSTCOLONIAL STUDIES, 2024, 26 (08): : 1189 - 1210
  • [7] Human-machine Translation Model Evaluation Based on Artificial Intelligence Translation
    Li, Ruicha
    Nawi, Abdullah Mohd
    Kang, Myoung Sook
    EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2023, 11 (02)
  • [8] HMPT: a human-machine cooperative program translation method
    Zhang, Xin
    Yu, Zhiwen
    Liu, Jiaqi
    Wang, Hui
    Wang, Liang
    Guo, Bin
    AUTOMATED SOFTWARE ENGINEERING, 2023, 30 (02)
  • [9] A Human-machine Cooperation Protocol for Machine Translation Output Edit Annotation
    Costa, Felipe de Almeida
    Pagano, Adriana S.
    Ferreira, Thiago Castro
    Meira, Wagner, Jr.
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2021, (19): : 148 - 170
  • [10] On "Human Parity" and "Super Human Performance" in Machine Translation Evaluation
    Poibeau, Thierry
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6018 - 6023