ReWE: RegressingWord Embeddings for Regularization of Neural Machine Translation Systems

被引:0
|
作者
Unanue, Inigo Jauregi [1 ,2 ]
Borzeshi, Ehsan Zare [2 ]
Esmaili, Nazanin [2 ]
Piccardil, Massimo [1 ]
机构
[1] Univ Technol Sydney, Sydney, NSW, Australia
[2] Capital Markets Cooperat Res Ctr, Sydney, NSW, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regularization of neural machine translation is still a significant problem, especially in low-resource settings. To mollify this problem, we propose regressing word embeddings (ReWE) as a new regularization technique in a system that is jointly trained to predict the next word in the translation (categorical value) and its word embedding (continuous value). Such a joint training allows the proposed system to learn the distributional properties represented by the word embeddings, empirically improving the generalization to unseen sentences. Experiments over three translation datasets have showed a consistent improvement over a strong baseline, ranging between 0:91 and 2:54 BLEU points, and also a marked improvement over a state-of-the-art system.
引用
收藏
页码:430 / 436
页数:7
相关论文
共 50 条
  • [1] Neural Machine Translation with Reordering Embeddings
    Chen, Kehai
    Wang, Rui
    Utiyama, Masao
    Sumita, Eiichiro
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1787 - 1799
  • [2] Neural Machine Translation without Embeddings
    Shaham, Uri
    Levy, Omer
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 181 - 186
  • [3] Effective Adversarial Regularization for Neural Machine Translation
    Sato, Motoki
    Suzuki, Jun
    Kiyono, Shun
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 204 - 210
  • [4] Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings
    Wang, Weixuan
    Peng, Wei
    Zhang, Meng
    Liu, Qun
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3197 - 3202
  • [5] EXPLORING THE USE OF ACOUSTIC EMBEDDINGS IN NEURAL MACHINE TRANSLATION
    Deena, Salil
    Ng, Raymond W. M.
    Madhyastha, Pranava
    Specia, Lucia
    Hain, Thomas
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 450 - 457
  • [6] Attention With Sparsity Regularization for Neural Machine Translation and Summarization
    Zhang, Jiajun
    Zhao, Yang
    Li, Haoran
    Zong, Chengqing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 507 - 518
  • [7] Adversarial Subword Regularization for Robust Neural Machine Translation
    Park, Jungsoo
    Sung, Mujeen
    Lee, Jinhyuk
    Kang, Jaewoo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1945 - 1953
  • [8] Unsupervised Neural Machine Translation with SMT as Posterior Regularization
    Ren, Shuo
    Zhang, Zhirui
    Liu, Shujie
    Zhou, Ming
    Ma, Shuai
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 241 - 248
  • [9] Progressive and Consistent Subword Regularization for Neural Machine Translation
    Gao, Yongqi
    Luo, Yingfeng
    Zhang, Qinghong
    Sh, Huibo
    Xiao, Tong
    Zhu, Jingbo
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 314 - 326
  • [10] Prediction Difference Regularization against Perturbation for Neural Machine Translation
    Guo, Dengji
    Ma, Zhengrui
    Zhang, Min
    Feng, Yang
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 7665 - 7675