Topic-based term translation models for statistical machine translation

被引:8
|
作者
Xiong, Deyi [1 ]
Meng, Fandong [2 ]
Liu, Qun [2 ,3 ]
机构
[1] Soochow Univ, Suzhou, Peoples R China
[2] Inst Comp Technol, Beijing, Peoples R China
[3] Dublin City Univ, Sch Comp, Dublin 9, Ireland
基金
中国国家自然科学基金; 爱尔兰科学基金会;
关键词
Term; Term translation disambiguation; Term translation consistency; Term unithood; Statistical machine translation;
D O I
10.1016/j.artint.2015.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Term translation is of great importance for machine translation. In this article, we investigate three issues of term translation in the context of statistical machine translation and propose three corresponding models: (a) a term translation disambiguation model which selects desirable translations for terms in the source language with domain information, (b) a term translation consistency model that encourages consistent translations for terms with a high strength of translation consistency throughout a document, and (c) a term unithood model that rewards translation hypotheses where source terms are translated into target strings as a whole unit. We integrate the three models into hierarchical phrase-based SMT and evaluate their effectiveness on NIST Chinese-English translation with large-scale training data. Experiment results show that all three models can achieve substantial improvements over the baseline. Our analyses also suggest that the proposed models are capable of improving term translation. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:54 / 75
页数:22
相关论文
共 50 条
  • [31] Statistical Machine Translation
    Zhang Xiaojun
    APPLIED LINGUISTICS, 2011, 32 (03) : 359 - 362
  • [32] MACHINE TRANSLATION: A CRITICAL LOOK AT THE PERFORMANCE OF RULE-BASED AND STATISTICAL MACHINE TRANSLATION
    Banitz, Brita
    CADERNOS DE TRADUCAO, 2020, 40 (01): : 54 - 71
  • [33] Linguistically motivated statistical machine translation: models and algorithms
    Vandeghinste, Vincent
    MACHINE TRANSLATION, 2015, 29 (3-4) : 291 - 294
  • [34] An Investigation on Statistical Machine Translation with Neural Language Models
    Zhao, Yinggong
    Huang, Shujian
    Chen, Huadong
    Chen, Jiajun
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 175 - 186
  • [35] Syntax-Based Statistical Machine Translation
    Hadiwinoto, Christian
    COMPUTATIONAL LINGUISTICS, 2017, 43 (04) : 893 - 896
  • [36] Phrase-based statistical machine translation
    Zens, R
    Och, FJ
    Ney, H
    KI2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2479 : 18 - 32
  • [37] Statistical machine translation decoder based on phrase
    ATR Spoken Language Translation Research Laboratories, 2-2-2 Hikaridai Seika-cho, Soraku-gun, Kyoto
    619-0288, Japan
    不详
    606-8501, Japan
    Int. Conf. Spok. Lang. Process., ICSLP, (1889-1892):
  • [38] What's in a Domain? Analyzing Genre and Topic Differences in Statistical Machine Translation
    van der Wees, Marlies
    Bisazza, Arianna
    Weerkamp, Wouter
    Monz, Christof
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 560 - 566
  • [39] Translation Model of Myanmar Phrases for Statistical Machine Translation
    Zin, Thet Thet
    Soe, Khin Mar
    Thein, Ni Lar
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 235 - +
  • [40] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266