A set of recommendations for assessing human-machine parity in language translation

被引:0
|
作者
Läubli S. [1 ]
Castilho S. [2 ]
Neubig G. [3 ]
Sennrich R. [1 ]
Shen Q. [3 ]
Toral A. [4 ]
机构
[1] Institute of Computational Linguistics, University of Zurich
[2] ADAPT Centre, Dublin City University
[3] Language Technologies Institute, Carnegie Mellon University
[4] Center for Language and Cognition, University of Groningen
关键词
D O I
10.1613/JAIR.1.11371
中图分类号
学科分类号
摘要
The quality of machine translation has increased remarkably over the past years, to the degree that it was found to be indistinguishable from professional human translation in a number of empirical investigations. We reassess Hassan et al.'s 2018 investigation into Chinese to English news translation, showing that the finding of human-machine parity was owed to weaknesses in the evaluation design-which is currently considered best practice in the field. We show that the professional human translations contained significantly fewer errors, and that perceived quality in human evaluation depends on the choice of raters, the availability of linguistic context, and the creation of reference translations. Our results call for revisiting current best practices to assess strong machine translation systems in general and human-machine parity in particular, for which we offer a set of recommendations based on our empirical findings. © 2020 AI Access Foundation. All rights reserved.
引用
收藏
页码:653 / 672
页数:19
相关论文
共 50 条
  • [31] Human-Machine Intelligence
    Zhang C.
    Kim J.
    Jeon J.
    Xing J.
    Ahn C.R.
    Tang P.
    Cai H.
    Civil Engineering Magazine Archive, 2023, 93 (05): : 74 - 79
  • [32] Human-machine communication
    Farbrot, JE
    Nihlwing, C
    Svengren, H
    ATW-INTERNATIONAL JOURNAL FOR NUCLEAR POWER, 2005, 50 (02): : 96 - +
  • [33] ON HUMAN-MACHINE INTERFACE
    BUHR, P
    COMMUNICATIONS OF THE ACM, 1983, 26 (07) : 463 - 464
  • [34] On human-machine relations
    Degani, Asaf
    Goldman, Claudia V.
    Deutsch, Omer
    Tsimhoni, Omer
    COGNITION TECHNOLOGY & WORK, 2017, 19 (2-3) : 211 - 231
  • [35] Natural language human-machine interface using artificial neural networks
    Majewski, Maciej
    Kacalak, Wojciech
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 1161 - 1166
  • [36] Language analysis in a specific-domain human-machine dialog system
    Guo, Rong
    Yu, Tao
    Lu, Ruzhan
    2003, Shanghai Computer Society (29):
  • [37] Speech production in human-machine dialogue: A natural language generation perspective
    Grote, B
    Hagen, E
    Stein, A
    Teich, E
    DIALOGUE PROCESSING IN SPOKEN LANGUAGE SYSTEMS, 1997, 1236 : 70 - 85
  • [38] Improving Language Models in Speech-Based Human-Machine Interaction
    Justo, Raquel
    Saz, Oscar
    Miguel, Antonio
    Ines Torres, M.
    Lleida, Eduardo
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
  • [39] Human-Machine Interface for Mobile Robot Based on Natural Language Processing
    Masek, P.
    Ruzicka, M.
    MECHATRONICS 2013: RECENT TECHNOLOGICAL AND SCIENTIFIC ADVANCES, 2014, : 583 - 590
  • [40] Human-machine symbiosis: A multivariate perspective for physically coupled human-machine systems
    Inga, Jairo
    Ruess, Miriam
    Robens, Jan Heinrich
    Nelius, Thomas
    Rothfuss, Simon
    Kille, Sean
    Dahlinger, Philipp
    Lindenmann, Andreas
    Thomaschke, Roland
    Neumann, Gerhard
    Matthiesen, Sven
    Hohmann, Soren
    Kiesel, Andrea
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2023, 170