Unsupervised Neural Machine Translation with SMT as Posterior Regularization

被引:0
|
作者
Ren, Shuo [1 ]
Zhang, Zhirui [2 ]
Liu, Shujie [3 ]
Zhou, Ming [3 ]
Ma, Shuai [1 ]
机构
[1] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, SKLSDE Lab, Beijing, Peoples R China
[2] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Without real bilingual corpus available, unsupervised Neural Machine Translation (NMT) typically requires pseudo parallel data generated with the back-translation method for the model training. However, due to weak supervision, the pseudo data inevitably contain noises and errors that will be accumulated and reinforced in the subsequent training process, leading to bad translation performance. To address this issue, we introduce phrase based Statistic Machine Translation (SMT) models which are robust to noisy data, as posterior regularizations to guide the training of unsupervised NMT models in the iterative back-translation process. Our method starts from SMT models built with pre-trained language models and word-level translation tables inferred from cross-lingual embeddings. Then SMT and NMT models are optimized jointly and boost each other incrementally in a unified EM framework. In this way, (1) the negative effect caused by errors in the iterative back-translation process can be alleviated timely by SMT filtering noises from its phrase tables; meanwhile, (2) NMT can compensate for the deficiency of fluency inherent in SMT. Experiments conducted on en-fr and en-de translation tasks show that our method outperforms the strong baseline and achieves new state-of-the-art unsupervised machine translation performance.
引用
收藏
页码:241 / 248
页数:8
相关论文
共 50 条
  • [1] Empirical Regularization for Synthetic Sentence Pairs in Unsupervised Neural Machine Translation
    Ai, Xi
    Fang, Bin
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12471 - 12479
  • [2] Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization
    Zhang, Jiacheng
    Liu, Yang
    Luan, Huanbo
    Xu, Jingfang
    Sun, Maosong
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1514 - 1523
  • [3] Improved Neural Machine Translation with SMT Features
    He, Wei
    He, Zhongjun
    Wu, Hua
    Wang, Haifeng
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 151 - 157
  • [4] Unsupervised dialectal neural machine translation
    Farhan, Wael
    Talafha, Bashar
    Abuammar, Analle
    Jaikat, Ruba
    Al-Ayyoub, Mahmoud
    Tarakji, Ahmad Bisher
    Toma, Anas
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [5] Effective Adversarial Regularization for Neural Machine Translation
    Sato, Motoki
    Suzuki, Jun
    Kiyono, Shun
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 204 - 210
  • [6] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [7] Unsupervised Neural Machine Translation with Universal Grammar
    Li, Zuchao
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Hai
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3249 - 3264
  • [8] Unsupervised Quality Estimation for Neural Machine Translation
    Fomicheva, Marina
    Sun, Shuo
    Yankovskaya, Lisa
    Blain, Frederic
    Guzman, Francisco
    Fishel, Mark
    Aletras, Nikolaos
    Chaudhary, Vishrav
    Specia, Lucia
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 539 - 555
  • [9] Deep Learning for Unsupervised Neural Machine Translation
    Yu, Kuai
    [J]. 2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 614 - 617
  • [10] Unsupervised Neural Machine Translation with Weight Sharing
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 46 - 55