Topic-based term translation models for statistical machine translation

被引:8
|
作者
Xiong, Deyi [1 ]
Meng, Fandong [2 ]
Liu, Qun [2 ,3 ]
机构
[1] Soochow Univ, Suzhou, Peoples R China
[2] Inst Comp Technol, Beijing, Peoples R China
[3] Dublin City Univ, Sch Comp, Dublin 9, Ireland
基金
中国国家自然科学基金; 爱尔兰科学基金会;
关键词
Term; Term translation disambiguation; Term translation consistency; Term unithood; Statistical machine translation;
D O I
10.1016/j.artint.2015.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Term translation is of great importance for machine translation. In this article, we investigate three issues of term translation in the context of statistical machine translation and propose three corresponding models: (a) a term translation disambiguation model which selects desirable translations for terms in the source language with domain information, (b) a term translation consistency model that encourages consistent translations for terms with a high strength of translation consistency throughout a document, and (c) a term unithood model that rewards translation hypotheses where source terms are translated into target strings as a whole unit. We integrate the three models into hierarchical phrase-based SMT and evaluate their effectiveness on NIST Chinese-English translation with large-scale training data. Experiment results show that all three models can achieve substantial improvements over the baseline. Our analyses also suggest that the proposed models are capable of improving term translation. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:54 / 75
页数:22
相关论文
共 50 条
  • [41] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266
  • [42] Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation
    Dugonik, Jani
    Maucec, Mirjam Sepesy
    Verber, Domen
    Brest, Janez
    MATHEMATICS, 2023, 11 (11)
  • [43] Unsupervised Statistical Machine Translation
    Artetxe, Mikel
    Labaka, Gorka
    Agirre, Eneko
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3632 - 3642
  • [44] Discourse in Statistical Machine Translation
    Hardmeier, Christian
    DISCOURS-REVUE DE LINGUISTIQUE PSYCHOLINGUISTIQUE ET INFORMATIQUE, 2012, (11):
  • [45] A SomAgent statistical machine translation
    Lopez, V. F.
    Corchado, J. M.
    De Paz, J. F.
    Rodriguez, S.
    Bajo, J.
    APPLIED SOFT COMPUTING, 2011, 11 (02) : 2925 - 2933
  • [46] A critique of Statistical Machine Translation
    Way, Andy
    LINGUISTICA ANTVERPIENSIA NEW SERIES-THEMES IN TRANSLATION STUDIES, 2009, 8 : 17 - 41
  • [47] Statistical Alignment Models in Machine Translation from Slovenian to English
    Maucec, Mirjam Sepesy
    Brest, Janez
    Kaic, Zdravko
    ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2006, 73 (05): : 273 - 278
  • [48] Statistical alignment models in machine translation from Slovenian to English
    University of Maribor, Faculty of Electrical Engineering and Computer Science, Smetanova 17, Maribor, Slovenia
    Elektroteh Vestn Electrotech Rev, 2006, 5 (273-278):
  • [49] Compositions of Tree-to-Tree Statistical Machine Translation Models
    Maletti, Andreas
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2018, 29 (05) : 877 - 892
  • [50] Compositions of Tree-to-Tree Statistical Machine Translation Models
    Maletti, Andreas
    DEVELOPMENTS IN LANGUAGE THEORY, DLT 2016, 2016, 9840 : 293 - 305