Improved Neural Machine Translation with SMT Features

被引:0
|
作者
He, Wei [1 ]
He, Zhongjun [1 ]
Wu, Hua [1 ]
Wang, Haifeng [1 ]
机构
[1] Baidu Inc, 10,Shangdi 10th St, Beijing 100085, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural machine translation (NMT) conducts end-to-end translation with a source language encoder and a target language decoder, making promising translation performance. However, as a newly emerged approach, the method has some limitations. An NMT system usually has to apply a vocabulary of certain size to avoid the time-consuming training and decoding, thus it causes a serious out-of-vocabulary problem. Furthermore, the decoder lacks a mechanism to guarantee all the source words to be translated and usually favors short translations, resulting in fluent but inadequate translations. In order to solve the above problems, we incorporate statistical machine translation (SMT) features, such as a translation model and an n-gram language model, with the NMT model under the log-linear framework. Our experiments show that the proposed method significantly improves the translation quality of the state-of-the-art NMT system on Chinese-to-English translation tasks. Our method produces a gain of up to 2.33 BLEU score on NIST open test sets.
引用
收藏
页码:151 / 157
页数:7
相关论文
共 50 条
  • [1] Improved Neural Machine Translation with Chinese Phonologic Features
    Yang, Jian
    Wu, Shuangzhi
    Zhang, Dongdong
    Li, Zhoujun
    Zhou, Ming
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, 2018, 11108 : 303 - 315
  • [2] Unsupervised Neural Machine Translation with SMT as Posterior Regularization
    Ren, Shuo
    Zhang, Zhirui
    Liu, Shujie
    Zhou, Ming
    Ma, Shuai
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 241 - 248
  • [3] Improved Neural Machine Translation with Source Syntax
    Wu, Shuangzhi
    Zhou, Ming
    Zhang, Dongdong
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4179 - 4185
  • [4] Improving neural machine translation with latent features feedback
    Li, Yachao
    Li, Junhui
    Zhang, Min
    [J]. NEUROCOMPUTING, 2021, 463 : 368 - 378
  • [5] An Improved English-to-Mizo Neural Machine Translation
    Lalrempuii, Candy
    Soni, Badal
    Pakray, Partha
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [6] Improved English to Hindi Multimodal Neural Machine Translation
    Laskar, Sahinur Rahman
    Khilji, Abdullah Faiz Ur Rahman
    Kaushik, Darsh
    Pakray, Partha
    Bandyopadhyay, Sivaji
    [J]. WAT 2021: THE 8TH WORKSHOP ON ASIAN TRANSLATION, 2021, : 155 - 160
  • [7] Improved Spoken Uyghur Segmentation for Neural Machine Translation
    Mi, Chenggang
    Yang, Yating
    Zhou, Xi
    Wang, Lei
    Jiang, Tonghai
    [J]. 2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 47 - 51
  • [8] AN AUTOENCODER WITH BILINGUAL SPARSE FEATURES FOR IMPROVED STATISTICAL MACHINE TRANSLATION
    Zhao, Bing
    Tam, Yik-Cheung
    Zheng, Jing
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Neural machine translation of low-resource languages using SMT phrase pair injection
    Sen, Sukanta
    Hasanuzzaman, Mohammed
    Ekbal, Asif
    Bhattacharyya, Pushpak
    Way, Andy
    [J]. NATURAL LANGUAGE ENGINEERING, 2021, 27 (03) : 271 - 292
  • [10] Statistical machine translation method based on improved neural network
    Yang, Lingxing
    [J]. AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (01): : 1715 - 1719