Unsupervised Domain Adaptation for Neural Machine Translation

被引:0
|
作者
Yang, Zhen [1 ,2 ]
Chen, Wei [1 ]
Wang, Feng [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, 95 Zhongguancun East Rd, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Impressive neural machine translation (NMT) results are achieved in domains with large-scale, high quality bilingual training corpora. However, transferring to a target domain with significant domain shifts but no bilingual training corpora remains largely unexplored. To address the aforementioned setting of unsupervised domain adaptation, we propose a novel adversarial training procedure for NMT to leverage the widespread monolingual data in target domain. Two discriminative networks, namely the domain discriminator and pair discriminator, are introduced to guide the translation model. The domain discriminator evaluates whether the sentences generated by the translation model are indistinguishable from the ones in target domain. The pair discriminator assesses whether the generated sentences are paired with the source-side sentences. The translation model acts as an adversary to the two discriminators, which aims to generate sentences uneasily discriminated by the discriminators. We tested our approach on Chinese-English and English-German translation tasks. Experimental results show that our approaches achieve great success in unsupervised domain adaptation for NMT.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 50 条
  • [21] DaLC: Domain Adaptation Learning Curve Prediction for Neural Machine Translation
    Park, Cheonbok
    Kim, Hantae
    Calapodescu, Ioan
    Cho, Hyunchang
    Nikoulina, Vassilina
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1789 - 1807
  • [22] Improving Document-Level Neural Machine Translation with Domain Adaptation
    Ul Haq, Sami
    Rauf, Sadaf Abdul
    Shoukat, Arslan
    Noor-e-Hira
    [J]. NEURAL GENERATION AND TRANSLATION, 2020, : 225 - 231
  • [23] Overcoming Catastrophic Forgetting During Domain Adaptation of Neural Machine Translation
    Thompson, Brian
    Gwinnup, Jeremy
    Khayrallah, Huda
    Duh, Kevin
    Koehn, Philipp
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2062 - 2068
  • [24] Domain Adaptation for Statistical Machine Translation
    Wang, Xiaoxue
    Zhu, Conghui
    Li, Sheng
    Zhao, Tiejun
    Zheng, Dequan
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1652 - 1658
  • [25] Efficient Machine Translation Domain Adaptation
    Martins, Pedro Henrique
    Marinhe, Zita
    Martins, Andre F. T.
    [J]. PROCEEDINGS OF THE 1ST WORKSHOP ON SEMIPARAMETRIC METHODS IN NLP: DECOUPLING LOGIC FROM KNOWLEDGE (SPA-NLP 2022), 2022, : 23 - 29
  • [26] Unsupervised Neural Machine Translation with Universal Grammar
    Li, Zuchao
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Hai
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3249 - 3264
  • [27] Unsupervised Quality Estimation for Neural Machine Translation
    Fomicheva, Marina
    Sun, Shuo
    Yankovskaya, Lisa
    Blain, Frederic
    Guzman, Francisco
    Fishel, Mark
    Aletras, Nikolaos
    Chaudhary, Vishrav
    Specia, Lucia
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 539 - 555
  • [28] Deep Learning for Unsupervised Neural Machine Translation
    Yu, Kuai
    [J]. 2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 614 - 617
  • [29] Unsupervised Neural Machine Translation with Weight Sharing
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 46 - 55
  • [30] Regularized Training Objective for Continued Training for Domain Adaptation in Neural Machine Translation
    Khayrallah, Huda
    Thompson, Brian
    Duh, Kevin
    Koehn, Philipp
    [J]. NEURAL MACHINE TRANSLATION AND GENERATION, 2018, : 36 - 44