Test-time Adaptation for Machine Translation Evaluation by Uncertainty Minimization

被引:0
|
作者
Zhan, Runzhe [1 ]
Liu, Xuebo [2 ]
Wong, Derek F. [2 ]
Zhang, Cuilian [1 ]
Chao, Lidia S. [1 ]
Zhang, Min
机构
[1] Univ Macau, Dept Comp & Informat Sci, NLP2CT Lab, Taipa, Macau, Peoples R China
[2] Harbin Inst Technol, Inst Comp & Intelligence, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The neural metrics recently received considerable attention from the research community in the automatic evaluation of machine translation. Unlike text-based metrics that have interpretable and consistent evaluation mechanisms for various data sources, the reliability of neural metrics in assessing out-of-distribution data remains a concern due to the disparity between training data and real-world data. This paper aims to address the inference bias of neural metrics through uncertainty minimization during test time, without requiring additional data. Our proposed method comprises three steps: uncertainty estimation, test-time adaptation, and inference. Specifically, the model employs the prediction uncertainty of the current data as a signal to update a small fraction of parameters during test time and subsequently refine the prediction through optimization. To validate our approach, we apply the proposed method to three representative models and conduct experiments on the WMT21 benchmarks. The results obtained from both in-domain and out-of-distribution evaluations consistently demonstrate improvements in correlation performance across different models. Furthermore, we provide evidence that the proposed method effectively reduces model uncertainty. The code is publicly available at https://github.com/NLP2CT/TaU.
引用
收藏
页码:807 / 820
页数:14
相关论文
共 50 条
  • [1] Test-Time Poisoning Attacks Against Test-Time Adaptation Models
    Cong, Tianshuo
    He, Xinlei
    Shen, Yun
    Zhang, Yang
    45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, : 1306 - 1324
  • [2] Contrastive Test-Time Adaptation
    Chen, Dian
    Wang, Dequan
    Darrell, Trevor
    Ibrahimi, Sayna
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 295 - 305
  • [3] Online Subloop Search via Uncertainty Quantization for Efficient Test-Time Adaptation
    Lee, Jae-Hong
    Lee, Sang-Eon
    Kim, Dong-Hyun
    Kim, DoHee
    Chang, Joon-Hyuk
    INTERSPEECH 2024, 2024, : 2880 - 2884
  • [4] MULTI-STEP TEST-TIME ADAPTATION WITH ENTROPY MINIMIZATION AND PSEUDO-LABELING
    Kingetsu, Hiroaki
    Kobayashi, Kenichi
    Okawa, Yoshihiro
    Yokota, Yasuto
    Nakazawa, Katsuhito
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4153 - 4157
  • [5] Train/Test-Time Adaptation with Retrieval
    Zancato, Luca
    Achille, Alessandro
    Liu, Tian Yu
    Trager, Matthew
    Perera, Pramuditha
    Soatto, Stefano
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15911 - 15921
  • [6] TEA: Test-time Energy Adaptation
    Yuan, Yige
    Xu, Bingbing
    Hou, Liang
    Sun, Fei
    Shen, Huawei
    Cheng, Xueqi
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23901 - 23911
  • [7] Continual Test-Time Domain Adaptation
    Wang, Qin
    Fink, Olga
    Van Gool, Luc
    Dai, Dengxin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7191 - 7201
  • [8] Calibrated Diverse Ensemble Entropy Minimization for Robust Test-Time Adaptation in Prostate Cancer Detection
    Gilany, Mandi
    Harmanani, Mohamed
    Wilson, Paul
    To, Minh Nguyen Nhat
    Jamzad, Amoon
    Fooladgar, Fahimeh
    Wodlinger, Brian
    Abolmaesumi, Purang
    Mousavi, Parvin
    MACHINE LEARNING IN MEDICAL IMAGING, PT I, MLMI 2024, 2025, 15241 : 361 - 371
  • [9] Towards Open-Set Test-Time Adaptation Utilizing theWisdom of Crowds in Entropy Minimization
    Lee, Jungsoo
    Das, Debasmit
    Choo, Jaegul
    Choi, Sungha
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16334 - 16343
  • [10] Robust gradient aware and reliable entropy minimization for stable test-time adaptation in dynamic scenarios
    Xiong, Haoyu
    Xiang, Yu
    VISUAL COMPUTER, 2025, 41 (01): : 315 - 330