Prediction Difference Regularization against Perturbation for Neural Machine Translation

被引:0
|
作者
Guo, Dengji [1 ,2 ]
Ma, Zhengrui [1 ,2 ]
Zhang, Min [3 ]
Feng, Yang [1 ,2 ]
机构
[1] Chinese Acad Sci ICT CAS, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Harbin Inst Technol, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS) | 2022年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regularization methods applying input perturbation have drawn considerable attention and have been frequently explored for NMT tasks in recent years. Despite their simplicity and effectiveness, we argue that these methods are limited by the under-fitting of training data. In this paper, we utilize prediction difference for ground-truth tokens to analyze the fitting of token-level samples and find that under-fitting is almost as common as over-fitting. We introduce prediction difference regularization (PD-R), a simple and effective method that can reduce over-fitting and under-fitting at the same time. For all token-level samples, PD-R minimizes the prediction difference between the original pass and the input-perturbed pass, making the model less sensitive to small input changes, thus more robust to both perturbations and under-fitted training data. Experiments on three widely used WMT translation tasks show that our approach can significantly improve over existing perturbation regularization methods. On WMT16 En-De task, our model achieves 1.80 SacreBLEU improvement over vanilla transformer.
引用
收藏
页码:7665 / 7675
页数:11
相关论文
共 50 条
  • [21] "Found in Translation": A deeper analysis of neural machine translation models for chemical reaction prediction
    Schwaller, Philippe
    Gaudin, Theophile
    Lanyi, David
    Bekas, Costas
    Laino, Teodoro
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [22] Neural Machine Translation
    Jooste, Wandri
    Haque, Rejwanul
    Way, Andy
    MACHINE TRANSLATION, 2021, 35 (02) : 289 - 299
  • [23] Neural Machine Translation
    Birch, Alexandra
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (03) : 377 - 378
  • [24] Neural Machine Translation Advised by Statistical Machine Translation
    Wang, Xing
    Lu, Zhengdong
    Tu, Zhaopeng
    Li, Hang
    Xiong, Deyi
    Zhang, Min
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3330 - 3336
  • [25] Neural Machine Translation as a Novel Approach to Machine Translation
    Benkova, Lucia
    Benko, Lubomir
    DIVAI 2020: 13TH INTERNATIONAL SCIENTIFIC CONFERENCE ON DISTANCE LEARNING IN APPLIED INFORMATICS, 2020, : 499 - 508
  • [26] Neural Name Translation Improves Neural Machine Translation
    Li, Xiaoqing
    Yan, Jinghui
    Zhang, Jiajun
    Zong, Chengqing
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 93 - 100
  • [27] DaLC: Domain Adaptation Learning Curve Prediction for Neural Machine Translation
    Park, Cheonbok
    Kim, Hantae
    Calapodescu, Ioan
    Cho, Hyunchang
    Nikoulina, Vassilina
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1789 - 1807
  • [28] The Event/Machine of Neural Machine Translation?
    Regnauld, Arnaud
    JOURNAL OF AESTHETICS AND PHENOMENOLOGY, 2022, 9 (02) : 141 - 154
  • [29] ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network
    Cao, Renzhi
    Freitas, Colton
    Chan, Leong
    Sun, Miao
    Jiang, Haiqing
    Chen, Zhangxin
    MOLECULES, 2017, 22 (10):
  • [30] Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation
    Shao, Chenze
    Zhang, Jinchao
    Feng, Yang
    Meng, Fandong
    Zhou, Jie
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 198 - 205