Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation

被引:28
|
作者
Wang, Xing [1 ]
Tu, Zhaopeng [2 ]
Zhang, Min [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Inst Artificial Intelligence, Suzhou 215006, Peoples R China
[2] Tencent AI Lab, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural machine translation; statistical machine translation; hybrid translation; translation combination; NETWORKS;
D O I
10.1109/TASLP.2018.2860287
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Neural machine translation (NMT) has gained more and more attention in recent years, mainly due to its simplicity yet state-of-the-art performance. However, previous research has shown that NMT suffers from several limitations: source coverage guidance, translation of rare words, and the limited vocabulary, while statistical machine translation (SMT) has complementary properties that correspond well to these limitations. It is straightforward to improve the translation performance by combining the advantages of two kinds of models. This paper proposes a general framework for incorporating the SMT word knowledge into NMT to alleviate above word-level limitations. In our framework, the NMT decoder makes more accurate word prediction by referring to the SMT word recommendations in both training and testing phases. Specifically, the SMT model offers informative word recommendations based on the NMT decoding information. Then, we use the SMT word predictions as prior knowledge to adjust the NMT word generation probability, which unitizes a neural network based classifier to digest the discrete word knowledge. In this paper, we use two model variants to implement the framework, one with a gating mechanism and the other with a direct competition mechanism. Experimental results on Chinese-to-English and English-to-German translation tasks show that the proposed framework can take advantage of the SMT word knowledge and consistently achieve significant improvements over NMT and SMT baseline systems.
引用
收藏
页码:2255 / 2266
页数:12
相关论文
共 50 条
  • [1] Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation
    Dugonik, Jani
    Maucec, Mirjam Sepesy
    Verber, Domen
    Brest, Janez
    [J]. MATHEMATICS, 2023, 11 (11)
  • [2] Incorporating Word Reordering Knowledge into Attention-based Neural Machine Translation
    Zhang, Jinchao
    Wang, Mingxuan
    Liu, Qun
    Zhou, Jie
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1524 - 1534
  • [3] Neural Machine Translation Advised by Statistical Machine Translation
    Wang, Xing
    Lu, Zhengdong
    Tu, Zhaopeng
    Li, Hang
    Xiong, Deyi
    Zhang, Min
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3330 - 3336
  • [4] Incorporating Syntactic Knowledge in Neural Quality Estimation for Machine Translation
    Ye, Na
    Wang, Yuanyuan
    Cai, Dongfeng
    [J]. MACHINE TRANSLATION, CCMT 2019, 2019, 1104 : 23 - 34
  • [5] Integrating Prior Translation Knowledge Into Neural Machine Translation
    Chen, Kehai
    Wang, Rui
    Utiyama, Masao
    Sumita, Eiichiro
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 330 - 339
  • [6] Word Position Aware Translation Memory for Neural Machine Translation
    He, Qiuxiang
    Huang, Guoping
    Liu, Lemao
    Li, Li
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 367 - 379
  • [7] Generation of word graphs in statistical machine translation
    Ueffing, N
    Och, FJ
    Ney, H
    [J]. PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 156 - 163
  • [8] Analysing terminology translation errors in statistical and neural machine translation
    Haque, Rejwanul
    Hasanuzzaman, Mohammed
    Way, Andy
    [J]. MACHINE TRANSLATION, 2020, 34 (2-3) : 149 - 195
  • [9] On the Word Alignment from Neural Machine Translation
    Li, Xintong
    Li, Guanlin
    Liu, Lemao
    Meng, Max
    Shi, Shuming
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1293 - 1303
  • [10] Content Word Aware Neural Machine Translation
    Chen, Kehai
    Wang, Rui
    Utiyama, Masao
    Sumita, Eiichiro
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 358 - 364