A Comprehensive Survey on Various Fully Automatic Machine Translation Evaluation Metrics

被引:13
|
作者
Chauhan, Shweta [1 ]
Daniel, Philemon [1 ]
机构
[1] Natl Inst Technol, Dept Elect & Commun, Hamirpur 177005, Himachal Prades, India
关键词
Machine translation evaluation; Machine translation; Automated metrics; Metrics; SYSTEMS;
D O I
10.1007/s11063-022-10835-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fast advancement in machine translation models necessitates the development of accurate evaluation metrics that would allow researchers to track the progress in text languages. The evaluation of machine translation models is crucial since its results are exploited for improvements of translation models. However fully automatically evaluating the machine translation models in itself is a huge challenge for the researchers as human evaluation is very expensive, time-consuming, unreproducible. This paper presents a detailed classification and comprehensive survey on various fully automated evaluation metrics, which are used to assess the performance or quality of machine translated output. Various fully automatic evaluation metrics are classified into five categories that are lexical, character, semantic, syntactic, and semantic & syntactic evaluation metrics for better understanding purpose. Taking account of the challenges posed in the field of machine translation evaluation by Statistical Machine Translation and Neural Machine Translation, along with a discussion on the advantages, disadvantages, and gaps for each fully automatic machine translation evaluation metric has been provided. The presented study will help machine translation researchers in quickly identifying automatic machine translation evaluation metrics that are most appropriate for the improvement or development of their machine translation model, as well as researchers in gaining a general understanding of how automatic machine translation evaluation research evolved.
引用
收藏
页码:12663 / 12717
页数:55
相关论文
共 50 条
  • [1] A Comprehensive Survey on Various Fully Automatic Machine Translation Evaluation Metrics
    Shweta Chauhan
    Philemon Daniel
    Neural Processing Letters, 2023, 55 : 12663 - 12717
  • [2] A Survey on Evaluation Metrics for Machine Translation
    Lee, Seungjun
    Lee, Jungseob
    Moon, Hyeonseok
    Park, Chanjun
    Seo, Jaehyung
    Eo, Sugyeong
    Koo, Seonmin
    Lim, Heuiseok
    MATHEMATICS, 2023, 11 (04)
  • [3] Automatic Metrics for Machine Translation Evaluation and Minority Languages
    Munkova, Dasa
    Munk, Michal
    PROCEEDINGS OF THE MEDITERRANEAN CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGIES 2015 (MEDCT 2015), VOL 2, 2016, 381 : 631 - 636
  • [4] Significance tests of automatic machine translation evaluation metrics
    Zhang, Ying
    Vogel, Stephan
    MACHINE TRANSLATION, 2010, 24 (01) : 51 - 65
  • [5] A comprehensive understanding of popular machine translation evaluation metrics
    Islam, Md Adnanul
    Mukta, Md Saddam Hossain
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (05) : 467 - 478
  • [6] UScore: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation
    Belouadi, Jonas
    Eger, Steffen
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 358 - 374
  • [7] Detecting errors in machine translation using residuals and metrics of automatic evaluation
    Munk, Michal
    Munkova, Dasa
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 3211 - 3223
  • [8] Automatic Meta-evaluation of Low-Resource Machine Translation Evaluation Metrics
    Yu, Junting
    Liu, Wuying
    He, Hongye
    Wang, Lin
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 136 - 141
  • [9] Extrinsic Evaluation of Machine Translation Metrics
    Moghe, Nikita
    Sherborne, Tom
    Steedman, Mark
    Birch, Alexandra
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 13060 - 13078
  • [10] Towards Explainable Evaluation Metrics for Machine Translation
    Leiter, Christoph
    Lertvittayakumjorn, Piyawat
    Fomicheva, Marina
    Zhao, Wei
    Gao, Yang
    Eger, Steffen
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25