A Comprehensive Survey on Various Fully Automatic Machine Translation Evaluation Metrics

被引：13

作者：

Chauhan, Shweta ^{[1
]}

Daniel, Philemon ^{[1
]}

机构：

[1] Natl Inst Technol, Dept Elect & Commun, Hamirpur 177005, Himachal Prades, India

来源：

NEURAL PROCESSING LETTERS | 2023年 / 55卷 / 09期

关键词：

Machine translation evaluation; Machine translation; Automated metrics; Metrics; SYSTEMS;

D O I：

10.1007/s11063-022-10835-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The fast advancement in machine translation models necessitates the development of accurate evaluation metrics that would allow researchers to track the progress in text languages. The evaluation of machine translation models is crucial since its results are exploited for improvements of translation models. However fully automatically evaluating the machine translation models in itself is a huge challenge for the researchers as human evaluation is very expensive, time-consuming, unreproducible. This paper presents a detailed classification and comprehensive survey on various fully automated evaluation metrics, which are used to assess the performance or quality of machine translated output. Various fully automatic evaluation metrics are classified into five categories that are lexical, character, semantic, syntactic, and semantic & syntactic evaluation metrics for better understanding purpose. Taking account of the challenges posed in the field of machine translation evaluation by Statistical Machine Translation and Neural Machine Translation, along with a discussion on the advantages, disadvantages, and gaps for each fully automatic machine translation evaluation metric has been provided. The presented study will help machine translation researchers in quickly identifying automatic machine translation evaluation metrics that are most appropriate for the improvement or development of their machine translation model, as well as researchers in gaining a general understanding of how automatic machine translation evaluation research evolved.

引用

页码：12663 / 12717

页数：55

共 50 条

[1] A Comprehensive Survey on Various Fully Automatic Machine Translation Evaluation Metrics
Shweta Chauhan
Philemon Daniel
Neural Processing Letters, 2023, 55 : 12663 - 12717
[2] A Survey on Evaluation Metrics for Machine Translation
Lee, Seungjun
Lee, Jungseob
Moon, Hyeonseok
Park, Chanjun
Seo, Jaehyung
Eo, Sugyeong
Koo, Seonmin
Lim, Heuiseok
MATHEMATICS, 2023, 11 (04)
[3] Automatic Metrics for Machine Translation Evaluation and Minority Languages
Munkova, Dasa
Munk, Michal
PROCEEDINGS OF THE MEDITERRANEAN CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGIES 2015 (MEDCT 2015), VOL 2, 2016, 381 : 631 - 636
[4] Significance tests of automatic machine translation evaluation metrics
Zhang, Ying
Vogel, Stephan
MACHINE TRANSLATION, 2010, 24 (01) : 51 - 65
[5] A comprehensive understanding of popular machine translation evaluation metrics
Islam, Md Adnanul
Mukta, Md Saddam Hossain
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (05) : 467 - 478
[6] UScore: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation
Belouadi, Jonas
Eger, Steffen
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 358 - 374
[7] Detecting errors in machine translation using residuals and metrics of automatic evaluation
Munk, Michal
Munkova, Dasa
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 3211 - 3223
[8] Automatic Meta-evaluation of Low-Resource Machine Translation Evaluation Metrics
Yu, Junting
Liu, Wuying
He, Hongye
Wang, Lin
PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 136 - 141
[9] Extrinsic Evaluation of Machine Translation Metrics
Moghe, Nikita
Sherborne, Tom
Steedman, Mark
Birch, Alexandra
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 13060 - 13078
[10] Towards Explainable Evaluation Metrics for Machine Translation
Leiter, Christoph
Lertvittayakumjorn, Piyawat
Fomicheva, Marina
Zhao, Wei
Gao, Yang
Eger, Steffen
JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25

← 1 2 3 4 5 →