Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

被引:0
|
作者
Saunders, Danielle [1 ]
机构
[1] Univ Cambridge, Engn Dept, Cambridge, England
基金
英国工程与自然科学研究理事会;
关键词
ENSEMBLE; MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of deep learning techniques has allowed Neural Machine Translation (NMT) models to become extremely powerful, given sufficient training data and training time. However, systems struggle when translating text from a new domain with a distinct style or vocabulary. Fine-tuning on in-domain data allows good domain adaptation, but requires sufficient relevant bilingual data. Even if this is available, simple fine-tuning can cause overfitting to new data and catastrophic forgetting of previously learned behaviour. We survey approaches to domain adaptation for NMT, particularly where a system may need to translate across multiple domains. We divide techniques into those revolving around data selection or generation, model architecture, parameter adaptation procedure, and inference procedure. We finally highlight the benefits of domain adaptation and multi -domain adaptation techniques to other lines of NMT research.
引用
收藏
页码:351 / 424
页数:74
相关论文
共 50 条
  • [1] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders, Danielle
    [J]. Journal of Artificial Intelligence Research, 2022, 75 : 351 - 424
  • [2] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Huang, Shuanghong
    Guo, Junjun
    Yu, Zhengtao
    Wen, Yonghua
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19): : 14047 - 14060
  • [3] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Shuanghong Huang
    Junjun Guo
    Zhengtao Yu
    Yonghua Wen
    [J]. Neural Computing and Applications, 2023, 35 : 14047 - 14060
  • [4] Improving the Quality Trade-Off for Neural Machine Translation Multi-Domain Adaptation
    Hasler, Eva
    Domhan, Tobias
    Trenous, Jonay
    Tran, Ke
    Byrne, Bill
    Hieber, Felix
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8470 - 8477
  • [5] Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation
    Sato, Shoetsu
    Sakuma, Jin
    Yoshinaga, Naoki
    Toyoda, Masashi
    Kitsuregawa, Masaru
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4269 - 4279
  • [6] Learning Domain Specific Sub-layer Latent Variable for Multi-domain Adaptation Neural Machine Translation
    Huang, Shuanghong
    Feng, Chong
    Shi, Ge
    Li, Zhengjun
    Zhao, Xuan
    Li, Xinyan
    Wang, Xiaomei
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (06)
  • [7] A Domain Adaptation Method for Neural Machine Translation
    Tian, Xiaohu
    Liu, Jin
    Pu, Jiachen
    Wang, Jin
    [J]. ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 321 - 326
  • [8] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [9] Sentence Embedding for Neural Machine Translation Domain Adaptation
    Wang, Rui
    Finch, Andrew
    Utiyama, Masao
    Sumita, Eiichiro
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 560 - 566
  • [10] Curriculum Learning for Domain Adaptation in Neural Machine Translation
    Zhang, Xuan
    Shapiro, Pamela
    Kumar, Gaurav
    McNamee, Paul
    Carpuat, Marine
    Duh, Kevin
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1903 - 1915