Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

被引:0
|
作者
Wei, Xiangpeng [1 ,2 ]
Yu, Heng [3 ]
Hu, Yue [1 ,2 ]
Weng, Rongxiang [3 ]
Xing, Luxi [1 ,2 ]
Luo, Weihua [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Alibaba Grp, Machine Intelligence Technol Lab, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a sequence-to-sequence generation task, neural machine translation (NMT) naturally contains intrinsic uncertainty, where a single sentence in one language has multiple valid counterparts in the other. However, the dominant methods for NMT only observe one of them from the parallel corpora for the model training but have to deal with adequate variations under the same meaning at inference. This leads to a discrepancy of the data distribution between the training and the inference phases. To address this problem, we propose uncertainty-aware semantic augmentation, which explicitly captures the universal semantic information among multiple semantically-equivalent source sentences and enhances the hidden representations with this information for better translations. Extensive experiments on various translation tasks reveal that our approach significantly outperforms the strong baselines and the existing methods.
引用
收藏
页码:2724 / 2735
页数:12
相关论文
共 50 条
  • [1] Uncertainty-aware non-autoregressive neural machine translation
    Liu, Chuanming
    Yu, Jingqi
    COMPUTER SPEECH AND LANGUAGE, 2023, 78
  • [2] Uncertainty-Aware Machine Translation Evaluation
    Glushkova, Taisiya
    Zerva, Chrysoula
    Rei, Ricardo
    Martins, Andre F. T.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3920 - 3938
  • [3] Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training
    Wu, Minghao
    Li, Yitong
    Zhang, Meng
    Li, Liangyou
    Haffari, Gholamreza
    Liu, Qun
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7291 - 7305
  • [4] Uncertainty-Aware Data Augmentation for Food Recognition
    Aguilar, Eduardo
    Nagarajan, Bhalaji
    Khantun, Rupali
    Bolanos, Marc
    Radeva, Petia
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4017 - 4024
  • [5] Uncertainty-aware Binary Neural Networks
    Zhao, Junhe
    Yang, Linlin
    Zhang, Baochang
    Guo, Guodong
    Doermann, David
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3441 - 3447
  • [6] Syntax-Aware Data Augmentation for Neural Machine Translation
    Duan, Sufeng
    Zhao, Hai
    Zhang, Dongdong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2988 - 2999
  • [7] Uncertainty-Aware Data Augmentation for Offline Reinforcement Learning
    Su, Yunjie
    Kong, Yilun
    Wang, Xueqian
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] Robust Tracking via Uncertainty-Aware Semantic Consistency
    Ma, Jie
    Lan, Xiangyuan
    Zhong, Bineng
    Li, Guorong
    Tang, Zhenjun
    Li, Xianxian
    Ji, Rongrong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1740 - 1751
  • [9] Uncertainty-aware automated machine learning toolbox
    Dorst, Tanja
    Schneider, Tizian
    Eichstaedt, Sascha
    Schuetze, Andreas
    TM-TECHNISCHES MESSEN, 2023, 90 (03) : 141 - 153
  • [10] Uncertainty-Aware Semantic Guidance and Estimation for Image Inpainting
    Liao, Liang
    Xiao, Jing
    Wang, Zheng
    Lin, Chia-Wen
    Satoh, Shin'ichi
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 310 - 323