Understanding the Properties of Minimum Bayes Risk Decoding in Neural Machine Translation

被引：0

作者：

Mueller, Mathias ^{[1
]}

Sennrich, Rico ^{[1
,2
]}

机构：

[1] Univ Zurich, Dept Computat Linguist, Zurich, Switzerland

[2] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland

来源：

59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021) | 2021年

基金：

瑞士国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural Machine Translation (NMT) currently exhibits biases such as producing translations that are too short and overgenerating frequent words, and shows poor robustness to copy noise in training data or domain shift. Recent work has tied these shortcomings to beam search - the de facto standard inference algorithm in NMT - and Eikema and Aziz (2020) propose to use Minimum Bayes Risk (MBR) decoding on unbiased samples instead. In this paper, we empirically investigate the properties of MBR decoding on a number of previously reported biases and failure cases of beam search. We find that MBR still exhibits a length and token frequency bias, owing to the MT metrics used as utility functions, but that MBR also increases robustness against copy noise in the training data and domain shift.(1)

引用

下载

页码：259 / 272

页数：14

共 50 条

[31] Understanding and Improving Hidden Representation for Neural Machine Translation
Li, Guanlin
Liu, Lemao
Li, Xintong
Zhu, Conghui
Zhao, Tiejun
Shi, Shuming
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 466 - 477
[32] Fast decoding and optimal decoding for machine translation
Germann, U
Jahr, M
Knight, K
Marcu, D
Yamada, K
39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 228 - 235
[33] MINIMUM BAYES RISK TRAINING OF CTC ACOUSTIC MODELS IN MAXIMUM A POSTERIORI BASED DECODING FRAMEWORK
Kanda, Naoyuki
Lu, Xugang
Kawai, Hisashi
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4855 - 4859
[34] Minimum Bayes-Risk decoding with presumed word significance for speech based information retrieval
Shichiri, Takashi
Nanjo, Hiroaki
Yoshimi, Takehiko
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1557 - 1560
[35] Detecting Source Contextual Barriers for Understanding Neural Machine Translation
Li, Guanlin
Liu, Lemao
Zhu, Conghui
Wang, Rui
Zhao, Tiejun
Shi, Shuming
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3158 - 3169
[36] Towards a Better Understanding of Label Smoothing in Neural Machine Translation
Gao, Yingbo
Wang, Weiyue
Herold, Christian
Yang, Zijian
Ney, Hermann
1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 212 - 223
[37] Towards Understanding Neural Machine Translation with Attention Heads' Importance
Zhou, Zijie
Zhu, Junguo
Li, Weijiang
APPLIED SCIENCES-BASEL, 2024, 14 (07):
[38] Understanding and Improving the Robustness of Terminology Constraints in Neural Machine Translation
Zhang, Huaao
Wang, Qiang
Qin, Bo
Shi, Zelin
Wang, Haibo
Chen, Ming
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6029 - 6042
[39] Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation
Zhang, Songming
Liang, Yunlong
Wang, Shuaibo
Chen, Yufeng
Han, Wenjuan
Liu, Jian
Xu, Jinan
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 8062 - 8079
[40] Guiding Non-Autoregressive Neural Machine Translation Decoding with Reordering Information
Ran, Qiu
Lin, Yankai
Li, Peng
Zhou, Jie
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13727 - 13735

← 1 2 3 4 5 →