Unsupervised Domain Adaptation for Neural Machine Translation

被引:0
|
作者
Yang, Zhen [1 ,2 ]
Chen, Wei [1 ]
Wang, Feng [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, 95 Zhongguancun East Rd, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Impressive neural machine translation (NMT) results are achieved in domains with large-scale, high quality bilingual training corpora. However, transferring to a target domain with significant domain shifts but no bilingual training corpora remains largely unexplored. To address the aforementioned setting of unsupervised domain adaptation, we propose a novel adversarial training procedure for NMT to leverage the widespread monolingual data in target domain. Two discriminative networks, namely the domain discriminator and pair discriminator, are introduced to guide the translation model. The domain discriminator evaluates whether the sentences generated by the translation model are indistinguishable from the ones in target domain. The pair discriminator assesses whether the generated sentences are paired with the source-side sentences. The translation model acts as an adversary to the two discriminators, which aims to generate sentences uneasily discriminated by the discriminators. We tested our approach on Chinese-English and English-German translation tasks. Experimental results show that our approaches achieve great success in unsupervised domain adaptation for NMT.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 50 条
  • [1] Neural Network Machine Translation Method Based on Unsupervised Domain Adaptation
    Wang, Rui
    [J]. COMPLEXITY, 2020, 2020 (2020)
  • [2] Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation
    Zheng, Xin
    Zhang, Zhirui
    Huang, Shujian
    Chen, Boxing
    Xie, Jun
    Luo, Weihua
    Chen, Jiajun
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4234 - 4241
  • [3] Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings
    Dou, Zi-Yi
    Hu, Junjie
    Anastasopoulos, Antonios
    Neubig, Graham
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1417 - 1422
  • [4] Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation
    Sato, Shoetsu
    Sakuma, Jin
    Yoshinaga, Naoki
    Toyoda, Masashi
    Kitsuregawa, Masaru
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4269 - 4279
  • [5] A Domain Adaptation Method for Neural Machine Translation
    Tian, Xiaohu
    Liu, Jin
    Pu, Jiachen
    Wang, Jin
    [J]. ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 321 - 326
  • [6] Generalised Unsupervised Domain Adaptation of Neural Machine Translation with Cross-Lingual Data Selection
    Thuy-Trang Vu
    He, Xuanli
    Dinh Phung
    Haffari, Gholamreza
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3335 - 3346
  • [7] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders D.
    [J]. Journal of Artificial Intelligence Research, 2022, 75 : 351 - 424
  • [8] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders, Danielle
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 75 : 351 - 424
  • [9] Sentence Embedding for Neural Machine Translation Domain Adaptation
    Wang, Rui
    Finch, Andrew
    Utiyama, Masao
    Sumita, Eiichiro
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 560 - 566
  • [10] Curriculum Learning for Domain Adaptation in Neural Machine Translation
    Zhang, Xuan
    Shapiro, Pamela
    Kumar, Gaurav
    McNamee, Paul
    Carpuat, Marine
    Duh, Kevin
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1903 - 1915