Assessing Human-Parity in Machine Translation on the Segment Level

被引:0
|
作者
Graham, Yvette [1 ]
Federmann, Christian [2 ]
Eskevich, Maria [3 ]
Haddow, Barry [4 ]
机构
[1] Trinity Coll Dublin, ADAPT, Dublin, Ireland
[2] Microsoft Res, Redmond, WA USA
[3] CLARIN ERIC, Utrecht, Netherlands
[4] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent machine translation shared tasks have shown top-performing systems to tie or in some cases even outperform human translation. Such conclusions about system and human performance are, however, based on estimates aggregated from scores collected over large test sets of translations and so leave some remaining questions unanswered. For instance, simply because a system significantly outperforms the human translator on average may not necessarily mean that it has done so for every translation in the test set. Furthermore, are there remaining source segments present in evaluation test sets that cause significant challenges for top-performing systems and can such challenging segments go unnoticed due to the opacity of current human evaluation procedures ? To provide insight into these issues we carefully inspect the outputs of top-performing systems in the recent WMT19 news translation shared task for all language pairs in which a system either tied or outperformed human translation. Our analysis provides a new method of identifying the remaining segments for which either machine or human perform poorly. For example, in our close inspection of WMT19 English to German and German to English we discover the segments that disjointly proved a challenge for human and machine. For English to Russian, there were no segments included in our sample of translations that caused a significant challenge for the human translator, while we again identify the set of segments that caused issues for the top-performing system.
引用
收藏
页码:4199 / 4207
页数:9
相关论文
共 50 条
  • [21] Assessing the Discourse Factors that Influence the Quality of Machine Translation
    Li, Junyi Jessy
    Carpuat, Marine
    Nenkova, Ani
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 283 - 288
  • [22] Assessing Translation Quality of Hypotactic Structure for Chinese-to-English Machine Translation
    Feng, Wenhe
    Chen, Xiao'en
    Li, Nan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 213 - 226
  • [23] Evaluation of the Validity of Back-Translation as a Method of Assessing the Accuracy of Machine Translation
    Miyabe, Mai
    Yoshino, Takashi
    2015 INTERNATIONAL CONFERENCE ON CULTURE AND COMPUTING (CULTURE COMPUTING), 2015, : 145 - 150
  • [24] Document-Level Machine Translation as a Re-translation Process
    Martinez Garcia, Eva
    Espana-Bonet, Cristina
    Marquez, Lluis
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2014, (53): : 103 - 110
  • [25] Areas of undecidability in the machine and human translation
    Kadiu, Silvia
    META, 2016, 61 (01) : 204 - 220
  • [26] Dialogism and monologism in human and machine translation
    Greenall, AJK
    TRANSLATION STUDIES IN THE NEW MILLENNIUM, PROCEEDINGS, 2003, : 229 - 240
  • [27] Comparabilty of Corpora in Human and Machine Translation
    Lapshinova-Koltunski, Ekaterina
    Pal, Santanu
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [28] Neural machine translation and human translation A political and ideological perspective
    Sheng, Anfeng
    Kong, Yankun
    BABEL-REVUE INTERNATIONALE DE LA TRADUCTION-INTERNATIONAL JOURNAL OF TRANSLATION, 2023, 69 (04): : 483 - 498
  • [29] Encouraging Lexical Translation Consistency for Document-Level Neural Machine Translation
    Lyu, Xinglin
    Li, Junhui
    Gong, Zhengxian
    Zhang, Min
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3265 - 3277
  • [30] Enhancing Lexical Translation Consistency for Document-Level Neural Machine Translation
    Kang, Xiaomian
    Zhao, Yang
    Zhang, Jiajun
    Zong, Chengqing
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (03)