An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation

被引:77
|
作者
Chu, Chenhui [1 ,3 ]
Dabre, Raj [2 ]
Kurohashi, Sadao [2 ]
机构
[1] Osaka Univ, Inst Databil Sci, Suita, Osaka, Japan
[2] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
[3] Japan Sci & Technol Agcy, Kawaguchi, Saitama, Japan
关键词
D O I
10.18653/v1/P17-2061
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a novel domain adaptation method named "mixed fine tuning" for neural machine translation (NMT). We combine two existing approaches namely fine tuning and multi domain NMT. We first train an NMT model on an out-of-domain parallel corpus, and then fine tune it on a parallel corpus which is a mix of the in-domain and out-of-domain corpora. All corpora are augmented with artificial tags to indicate specific domains. We empirically compare our proposed method against fine tuning and multi domain methods and discuss its benefits and shortcomings.
引用
收藏
页码:385 / 391
页数:7
相关论文
共 50 条
  • [1] Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation
    Sato, Shoetsu
    Sakuma, Jin
    Yoshinaga, Naoki
    Toyoda, Masashi
    Kitsuregawa, Masaru
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4269 - 4279
  • [2] A Domain Adaptation Method for Neural Machine Translation
    Tian, Xiaohu
    Liu, Jin
    Pu, Jiachen
    Wang, Jin
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 321 - 326
  • [3] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [4] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders D.
    Journal of Artificial Intelligence Research, 2022, 75 : 351 - 424
  • [5] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders, Danielle
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 75 : 351 - 424
  • [6] Sentence Embedding for Neural Machine Translation Domain Adaptation
    Wang, Rui
    Finch, Andrew
    Utiyama, Masao
    Sumita, Eiichiro
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 560 - 566
  • [7] Curriculum Learning for Domain Adaptation in Neural Machine Translation
    Zhang, Xuan
    Shapiro, Pamela
    Kumar, Gaurav
    McNamee, Paul
    Carpuat, Marine
    Duh, Kevin
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1903 - 1915
  • [8] Domain Adaptation of Neural Machine Translation by Lexicon Induction
    Hu, Junjie
    Xia, Mengzhou
    Neubig, Graham
    Carbonell, Jaime
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2989 - 3001
  • [9] Iterative Dual Domain Adaptation for Neural Machine Translation
    Zeng, Jiali
    Liu, Yang
    Su, Jinsong
    Ge, Yubin
    Lu, Yaojie
    Yin, Yongjing
    Luo, Jiebo
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 845 - 855
  • [10] Exploring Composite Indexes for Domain Adaptation in Neural Machine Translation
    Minh, Nhan Vo
    Minh, Khue Nguyen Tran
    Nguyen, Long H. B.
    Dinh, Dien
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2024, 11 (01) : 75 - 94