Domain-Aware Self-Attention for Multi-Domain Neural Machine Translation

被引:2
|
作者
Zhang, Shiqi [1 ]
Liu, Yan [2 ]
Xiong, Deyi [2 ]
Zhang, Pei [1 ]
Chen, Boxing [1 ]
机构
[1] Alibaba Grp Inc, Hong Kong, Peoples R China
[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
来源
关键词
Transformer; machine translation; domain adaptation; unsupervised learning;
D O I
10.21437/Interspeech.2021-1477
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this paper, we investigate multi-domain neural machine translation (NMT) that translates sentences of different domains in a single model. To this end, we propose a domain-aware self-attention mechanism that jointly learns domain representations with the single NMT model. The learned domain representations are integrated into both the encoder and decoder. We further propose two different domain representation learning approaches: 1) word-level unsupervised learning via a domain attention network and 2) guided learning with an auxiliary loss. The two learning approaches allow our multi-domain NMT to work in different settings as to whether the domain information is available or not. Experiments on both Chinese-English and English-French demonstrate that our multi-domain model outperforms a strong baseline built on the Transformer and other previous multi-domain NMT approaches. Further analyses show that our model is able to learn domain clusters even without prior knowledge about the domain structure.
引用
收藏
页码:2047 / 2051
页数:5
相关论文
共 50 条
  • [21] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Shuanghong Huang
    Junjun Guo
    Zhengtao Yu
    Yonghua Wen
    [J]. Neural Computing and Applications, 2023, 35 : 14047 - 14060
  • [22] Building a Multi-Domain Neural Machine Translation Model Using Knowledge Distillation
    Mghabbar, Idriss
    Ratnamogan, Pirashanth
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2116 - 2123
  • [23] WDSRL: Multi-Domain Neural Machine Translation With Word-Level Domain-Sensitive Representation Learning
    Man, Zhibo
    Huang, Zengcheng
    Zhang, Yujie
    Li, Yu
    Chen, Yuanmeng
    Chen, Yufeng
    Xu, Jinan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 577 - 590
  • [24] Multi-Domain Neural Machine Translation with Word-Level Adaptive Layer-wise Domain Mixing
    Jiang, Haoming
    Liang, Chen
    Wang, Chong
    Zhao, Tuo
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1823 - 1834
  • [25] Learning Domain Specific Sub-layer Latent Variable for Multi-domain Adaptation Neural Machine Translation
    Huang, Shuanghong
    Feng, Chong
    Shi, Ge
    Li, Zhengjun
    Zhao, Xuan
    Li, Xinyan
    Wang, Xiaomei
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (06)
  • [26] Self-Attention Neural Machine Translation for Automatic Software Repair
    Cao, He-Ling
    Liu, Yu
    Han, Dong
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (03): : 945 - 956
  • [27] Improving the Quality Trade-Off for Neural Machine Translation Multi-Domain Adaptation
    Hasler, Eva
    Domhan, Tobias
    Trenous, Jonay
    Tran, Ke
    Byrne, Bill
    Hieber, Felix
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8470 - 8477
  • [28] Collaborative attention neural network for multi-domain sentiment classification
    Yue, Chunyi
    Cao, Hanqiang
    Xu, Guoping
    Dong, Youli
    [J]. APPLIED INTELLIGENCE, 2021, 51 (06) : 3174 - 3188
  • [29] Collaborative attention neural network for multi-domain sentiment classification
    Chunyi Yue
    Hanqiang Cao
    Guoping Xu
    Youli Dong
    [J]. Applied Intelligence, 2021, 51 : 3174 - 3188
  • [30] DOMAIN-AWARE NEURAL LANGUAGE MODELS FOR SPEECH RECOGNITION
    Liu, Linda
    Gu, Yile
    Gourav, Aditya
    Gandhe, Ankur
    Kalmane, Shashank
    Filimonov, Denis
    Rastrow, Ariya
    Bulyko, Ivan
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7373 - 7377