Enhancing Machine Translation Quality Estimation via Fine-Grained Error Analysis and Large Language Model

被引:0
|
作者
Jung, Dahyun [1 ]
Park, Chanjun [2 ]
Eo, Sugyeong [1 ]
Lim, Heuiseok [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, Seoul 02841, South Korea
[2] Upstage, Yongin 16942, South Korea
基金
新加坡国家研究基金会;
关键词
natural language processing; quality estimation; fine-grained error span detection;
D O I
10.3390/math11194169
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Fine-grained error span detection is a sub-task within quality estimation that aims to identify and assess the spans and severity of errors present in translated sentences. In prior quality estimation, the focus has predominantly been on evaluating translations at the sentence and word levels. However, such an approach fails to recognize the severity of specific segments within translated sentences. To the best of our knowledge, this is the first study that concentrates on enhancing models for this fine-grained error span detection task in machine translation. This study introduces a framework that sequentially performs sentence-level error detection, word-level error span extraction, and severity assessment. We present a detailed analysis for each of the methodologies we propose, substantiating the effectiveness of our system, focusing on two language pairs: English-to-German and Chinese-to-English. Our results suggest that task granularity enhances performance and that a prompt-based fine-tuning approach can offer optimal performance in the classification tasks. Furthermore, we demonstrate that employing a large language model to edit the fine-tuned model's output constitutes a top strategy for achieving robust quality estimation performance.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] xcomet: Transparent Machine Translation Evaluation through Fine-grained Error Detection
    Guerreiro, Nuno M.
    Rei, Ricardo
    van Stigt, Daan
    Coheur, Luisa
    Colombo, Pierre
    Martins, Andre F. T.
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 979 - 995
  • [2] Enhancing Gaze Estimation through Fine-Grained Analysis of Eye Region
    Sugiyama, Hideharu
    Watanabe, Hiroshi
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2024, 2024, 13164
  • [3] Fine-grained attention mechanism for neural machine translation
    Choi, Heeyoul
    Cho, Kyunghyun
    Bengio, Yoshua
    [J]. NEUROCOMPUTING, 2018, 284 : 171 - 176
  • [4] Automatic Reference-Free Fine-Grained Machine Translation Error Detection via Named Entity Recognition and Back-Translation
    Yan, Yiting
    Song, Jiaxin
    Fu, Biao
    Ye, Na
    Shi, Xiaodong
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14878 : 306 - 317
  • [5] The Lazy Encoder: A Fine-Grained Analysis of the Role of Morphology in Neural Machine Translation
    Bisazza, Arianna
    Tump, Clara
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2871 - 2876
  • [6] Fine-grained analysis of language varieties and demographics
    Rangel, Francisco
    Rosso, Paolo
    Zaghouani, Wajdi
    Charfi, Anis
    [J]. NATURAL LANGUAGE ENGINEERING, 2020, 26 (06) : 641 - 661
  • [7] Fine-grained Language Identification with Multilingual CapsNet Model
    Verma, Mudit
    Buduru, Arun Balaji
    [J]. 2020 IEEE SIXTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2020), 2020, : 94 - 102
  • [8] FGraDA: A Dataset and Benchmark for Fine-Grained Domain Adaptation in Machine Translation
    Zhu, Wenhao
    Huang, Shujian
    Pu, Tong
    Huang, Pingxuan
    Zhang, Xu
    Yu, Jian
    Chen, Wei
    Wang, Yanfeng
    Chen, Jiajun
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6719 - 6727
  • [9] Fine-grained Image Classification via Combining Vision and Language
    He, Xiangteng
    Peng, Yuxin
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7332 - 7340
  • [10] Fine-grained detoxification framework via instance-level prefixes for large language models
    Yi, Xin
    Wang, Linlin
    Wang, Xiaoling
    He, Liang
    [J]. Neurocomputing, 2025, 611