Unsupervised Domain Adaptation for Neural Machine Translation

被引:0
|
作者
Yang, Zhen [1 ,2 ]
Chen, Wei [1 ]
Wang, Feng [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, 95 Zhongguancun East Rd, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Impressive neural machine translation (NMT) results are achieved in domains with large-scale, high quality bilingual training corpora. However, transferring to a target domain with significant domain shifts but no bilingual training corpora remains largely unexplored. To address the aforementioned setting of unsupervised domain adaptation, we propose a novel adversarial training procedure for NMT to leverage the widespread monolingual data in target domain. Two discriminative networks, namely the domain discriminator and pair discriminator, are introduced to guide the translation model. The domain discriminator evaluates whether the sentences generated by the translation model are indistinguishable from the ones in target domain. The pair discriminator assesses whether the generated sentences are paired with the source-side sentences. The translation model acts as an adversary to the two discriminators, which aims to generate sentences uneasily discriminated by the discriminators. We tested our approach on Chinese-English and English-German translation tasks. Experimental results show that our approaches achieve great success in unsupervised domain adaptation for NMT.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 50 条
  • [41] Phrase-Based & Neural Unsupervised Machine Translation
    Lample, Guillaume
    Ott, Myle
    Conneau, Alexis
    Denoyer, Ludovic
    Ranzato, Marc'Aurelio
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 5039 - 5049
  • [42] Unsupervised neural domain adaptation for document image binarization
    Castellanos, Francisco J.
    Gallego, Antonio-Javier
    Calvo-Zaragoza, Jorge
    PATTERN RECOGNITION, 2021, 119
  • [43] Domain Bridge for Unpaired Image-to-Image Translation and Unsupervised Domain Adaptation
    Pizzati, Fabio
    de Charette, Raoul
    Zaccaria, Michela
    Cerri, Pietro
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2979 - 2987
  • [44] Simple, Scalable Adaptation for Neural Machine Translation
    Bapna, Ankur
    Firat, Orhan
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1538 - 1548
  • [45] Extreme Adaptation for Personalized Neural Machine Translation
    Michel, Paul
    Neubig, Graham
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 312 - 318
  • [46] A General Framework for Adaptation of Neural Machine Translation to Simultaneous Translation
    Chen, Yun
    Li, Liangyou
    Jiang, Xin
    Chen, Xiao
    Liu, Qun
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 191 - 200
  • [47] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Huang, Shuanghong
    Guo, Junjun
    Yu, Zhengtao
    Wen, Yonghua
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19): : 14047 - 14060
  • [48] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Shuanghong Huang
    Junjun Guo
    Zhengtao Yu
    Yonghua Wen
    Neural Computing and Applications, 2023, 35 : 14047 - 14060
  • [49] Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation
    Wu, Jiawei
    Wang, Xin
    Wang, William Yang
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1173 - 1183
  • [50] Importance Weighted Import Vector Machine for Unsupervised Domain Adaptation
    Khalighi, Sirvan
    Ribeiro, Bernardete
    Nunes, Urbano J.
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3280 - 3292