Topic-based term translation models for statistical machine translation

被引:8
|
作者
Xiong, Deyi [1 ]
Meng, Fandong [2 ]
Liu, Qun [2 ,3 ]
机构
[1] Soochow Univ, Suzhou, Peoples R China
[2] Inst Comp Technol, Beijing, Peoples R China
[3] Dublin City Univ, Sch Comp, Dublin 9, Ireland
基金
中国国家自然科学基金; 爱尔兰科学基金会;
关键词
Term; Term translation disambiguation; Term translation consistency; Term unithood; Statistical machine translation;
D O I
10.1016/j.artint.2015.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Term translation is of great importance for machine translation. In this article, we investigate three issues of term translation in the context of statistical machine translation and propose three corresponding models: (a) a term translation disambiguation model which selects desirable translations for terms in the source language with domain information, (b) a term translation consistency model that encourages consistent translations for terms with a high strength of translation consistency throughout a document, and (c) a term unithood model that rewards translation hypotheses where source terms are translated into target strings as a whole unit. We integrate the three models into hierarchical phrase-based SMT and evaluate their effectiveness on NIST Chinese-English translation with large-scale training data. Experiment results show that all three models can achieve substantial improvements over the baseline. Our analyses also suggest that the proposed models are capable of improving term translation. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:54 / 75
页数:22
相关论文
共 50 条
  • [21] Neural Machine Translation Advised by Statistical Machine Translation
    Wang, Xing
    Lu, Zhengdong
    Tu, Zhaopeng
    Li, Hang
    Xiong, Deyi
    Zhang, Min
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3330 - 3336
  • [22] Graph-based Lexicalized Reordering Models for Statistical Machine Translation
    Su Jinsong
    Liu Yang
    Liu Qun
    Dong Huailin
    CHINA COMMUNICATIONS, 2014, 11 (05) : 71 - 82
  • [23] Backward and trigger-based language models for statistical machine translation
    Xiong, Deyi
    Zhang, Min
    NATURAL LANGUAGE ENGINEERING, 2015, 21 (02) : 201 - 226
  • [24] Topic-aware pivot language approach for statistical machine translation
    Jin-song SU
    Xiao-dong SHI
    Yan-zhou HUANG
    Yang LIU
    Qing-qiang WU
    Yi-dong CHEN
    Huai-lin DONG
    Frontiers of Information Technology & Electronic Engineering, 2014, (04) : 241 - 253
  • [25] Statistical machine translation
    Sanchez-Martinez, Felipe
    Antonio Perez-Ortiz, Juan
    MACHINE TRANSLATION, 2010, 24 (3-4) : 273 - 278
  • [26] Statistical Machine Translation
    Vandeghinste, Vincent
    Van Eynde, Frank
    TARGET-INTERNATIONAL JOURNAL OF TRANSLATION STUDIES, 2012, 24 (01) : 157 - 159
  • [27] Statistical Machine Translation
    Vatsa, Mukesh G. S.
    Joshi, Nikita
    Goswami, Sumit
    DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2010, 30 (04): : 25 - 32
  • [28] Statistical Machine Translation
    Babhulgaonkar, A. R.
    Bharad, S. V.
    2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 62 - 67
  • [29] Statistical machine translation
    Lopez, Adam
    ACM COMPUTING SURVEYS, 2008, 40 (03)
  • [30] Statistical Machine Translation
    Cherry, Colin
    COMPUTATIONAL LINGUISTICS, 2010, 36 (04) : 773 - 776