Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings

被引:0
|
作者
Dou, Zi-Yi [1 ]
Hu, Junjie [1 ]
Anastasopoulos, Antonios [1 ]
Neubig, Graham [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent success of neural machine translation models relies on the availability of high quality, in-domain data. Domain adaptation is required when domain-specific data is scarce or nonexistent. Previous unsupervised domain adaptation strategies include training the model with in-domain copied monolingual or back-translated data. However, these methods use generic representations for text regardless of domain shift, which makes it infeasible for translation models to control outputs conditional on a specific domain. In this work, we propose an approach that adapts models with domain-aware feature embeddings, which are learned via an auxiliary language modeling task. Our approach allows the model to assign domain-specific representations to words and output sentences in the desired domain. Our empirical results demonstrate the effectiveness of the proposed strategy, achieving consistent improvements in multiple experimental settings. In addition, we show that combining our method with back translation can further improve the performance of the model.(1)
引用
收藏
页码:1417 / 1422
页数:6
相关论文
共 50 条
  • [1] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [2] Domain-Aware Self-Attention for Multi-Domain Neural Machine Translation
    Zhang, Shiqi
    Liu, Yan
    Xiong, Deyi
    Zhang, Pei
    Chen, Boxing
    [J]. INTERSPEECH 2021, 2021, : 2047 - 2051
  • [3] MetaCorrection: Domain-aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation
    Guo, Xiaoqing
    Yang, Chen
    Li, Baopu
    Yuan, Yixuan
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3926 - 3935
  • [4] Neural Network Machine Translation Method Based on Unsupervised Domain Adaptation
    Wang, Rui
    [J]. COMPLEXITY, 2020, 2020
  • [5] Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation
    Zheng, Xin
    Zhang, Zhirui
    Huang, Shujian
    Chen, Boxing
    Xie, Jun
    Luo, Weihua
    Chen, Jiajun
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4234 - 4241
  • [6] Multi-source domain adaptation for dependency parsing via domain-aware feature generation
    Li, Ying
    Zhang, Zhenguo
    Xian, Yantuan
    Yu, Zhengtao
    Gao, Shengxiang
    Mao, Cunli
    Huang, Yuxin
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, : 6093 - 6106
  • [7] Feature Stylization and Domain-aware Contrastive Learning for Domain Generalization
    Jeon, Seogkyu
    Hong, Kibeom
    Lee, Pilhyeon
    Lee, Jewook
    Byun, Hyeran
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 22 - 31
  • [8] A Study of Methods for the Generation of Domain-Aware Word Embeddings
    Seyler, Dominic
    Zhai, ChengXiang
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1609 - 1612
  • [9] Geometry-aware Domain Adaptation for Unsupervised Alignment of Word Embeddings
    Jawanpuria, Pratik
    Meghwanshi, Mayank
    Mishra, Bamdev
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3052 - 3058
  • [10] Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation
    Sato, Shoetsu
    Sakuma, Jin
    Yoshinaga, Naoki
    Toyoda, Masashi
    Kitsuregawa, Masaru
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4269 - 4279