Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

被引:0
|
作者
Guerreiro, Nuno M. [1 ,2 ,3 ]
Colombo, Pierre [5 ]
Piantanida, Pablo [6 ]
Martins, Andre F. T. [1 ,2 ,3 ,4 ]
机构
[1] Inst Telecomun, Lisbon, Portugal
[2] Univ Lisbon, Inst Super Tecn, Lisbon, Portugal
[3] Univ Lisbon, LUMLIS, Lisbon ELLIS Unit, Lisbon, Portugal
[4] Unbabel, Lisbon, Portugal
[5] Univ Paris Saclay, CentraleSupelec, MICS, Paris, France
[6] CNRS, CentraleSupelec, ILLS, Paris, France
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the problem of hallucination detection in NMT by following a simple intuition: as hallucinations are detached from the source content, they exhibit cross-attention patterns that are statistically different from those of good quality translations. We frame this problem with an optimal transport formulation and propose a fully unsupervised, plug-in detector that can be used with any attention-based NMT model. Experimental results show that our detector not only outperforms all previous model-based detectors, but is also competitive with detectors that employ external models trained on millions of samples for related tasks such as quality estimation and cross-lingual sentence similarity.
引用
收藏
页码:13766 / 13784
页数:19
相关论文
共 50 条
  • [1] Unsupervised dialectal neural machine translation
    Farhan, Wael
    Talafha, Bashar
    Abuammar, Analle
    Jaikat, Ruba
    Al-Ayyoub, Mahmoud
    Tarakji, Ahmad Bisher
    Toma, Anas
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [2] On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation
    Wang, Chaojun
    Sennrich, Rico
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3544 - 3552
  • [3] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [4] Unsupervised Neural Machine Translation with Universal Grammar
    Li, Zuchao
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Hai
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3249 - 3264
  • [5] Unsupervised Quality Estimation for Neural Machine Translation
    Fomicheva, Marina
    Sun, Shuo
    Yankovskaya, Lisa
    Blain, Frederic
    Guzman, Francisco
    Fishel, Mark
    Aletras, Nikolaos
    Chaudhary, Vishrav
    Specia, Lucia
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 539 - 555
  • [6] Deep Learning for Unsupervised Neural Machine Translation
    Yu, Kuai
    [J]. 2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 614 - 617
  • [7] Unsupervised Neural Machine Translation with Weight Sharing
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 46 - 55
  • [8] Unsupervised Bilingual Word Embedding Agreement for Unsupervised Neural Machine Translation
    Sun, Haipeng
    Wang, Rui
    Chen, Kehai
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Tiejun
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1235 - 1245
  • [9] Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation
    Sun, Haipeng
    Wang, Rui
    Chen, Kehai
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Tiejun
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3525 - 3535
  • [10] Unsupervised Extraction of Partial Translations for Neural Machine Translation
    Marie, Benjamin
    Fujita, Atsushi
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3834 - 3844