Topic-based term translation models for statistical machine translation

被引:8
|
作者
Xiong, Deyi [1 ]
Meng, Fandong [2 ]
Liu, Qun [2 ,3 ]
机构
[1] Soochow Univ, Suzhou, Peoples R China
[2] Inst Comp Technol, Beijing, Peoples R China
[3] Dublin City Univ, Sch Comp, Dublin 9, Ireland
基金
中国国家自然科学基金; 爱尔兰科学基金会;
关键词
Term; Term translation disambiguation; Term translation consistency; Term unithood; Statistical machine translation;
D O I
10.1016/j.artint.2015.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Term translation is of great importance for machine translation. In this article, we investigate three issues of term translation in the context of statistical machine translation and propose three corresponding models: (a) a term translation disambiguation model which selects desirable translations for terms in the source language with domain information, (b) a term translation consistency model that encourages consistent translations for terms with a high strength of translation consistency throughout a document, and (c) a term unithood model that rewards translation hypotheses where source terms are translated into target strings as a whole unit. We integrate the three models into hierarchical phrase-based SMT and evaluate their effectiveness on NIST Chinese-English translation with large-scale training data. Experiment results show that all three models can achieve substantial improvements over the baseline. Our analyses also suggest that the proposed models are capable of improving term translation. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:54 / 75
页数:22
相关论文
共 50 条
  • [1] Topic-based coherence modeling for statistical machine translation
    Institute for Infocomm Research, Singapore
    138632, Singapore
    不详
    215006, China
    IEEE Trans. Audio Speech Lang. Process., 3 (483-493):
  • [2] Topic-Based Coherence Modeling for Statistical Machine Translation
    Xiong, Deyi
    Zhang, Min
    Wang, Xing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (03) : 483 - 493
  • [3] Improved statistical machine translation model with topic-based paraphrase
    Wu, Qing-Qiang, 1843, Zhejiang University (48):
  • [4] Topic-Based Dissimilarity and Sensitivity Models for Translation Rule Selection
    Zhang, Min
    Xiao, Xinyan
    Xiong, Deyi
    Liu, Qun
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 50 : 1 - 30
  • [5] Topic-based dissimilarity and sensitivity models for translation rule selection
    Zhang, Min
    Xiao, Xinyan
    Xiong, Deyi
    Liu, Qun
    Journal of Artificial Intelligence Research, 2014, 50 : 1 - 30
  • [6] Topic Adaptation for Statistical Machine Translation
    Taraghi, Mina
    Khadivi, Shahram
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 2147 - 2152
  • [7] A Topic-Triggered Translation Model for Statistical Machine Translation
    SU Jinsong
    WANG Zhihao
    WU Qingqiang
    YAO Junfeng
    LONG Fei
    ZHANG Haiying
    ChineseJournalofElectronics, 2017, 26 (01) : 65 - 72
  • [8] A Topic-Triggered Translation Model for Statistical Machine Translation
    Su Jinsong
    Wang Zhihao
    Wu Qingqiang
    Yao Junfeng
    Long Fei
    Zhang Haiying
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (01) : 65 - 72
  • [9] Statistical machine translation based on translation rules
    Yulian, H.
    Journal of Chemical and Pharmaceutical Research, 2014, 6 (07) : 1628 - 1635
  • [10] Bilingual cluster based models for statistical machine translation
    Yamamoto, Hirofumi
    Sumita, Eiichiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 588 - 597