Distortion Models For Statistical Machine Translation

被引:0
|
作者
Al-Onaizan, Yaser [1 ]
Papineni, Kishore [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we argue that n-gram language models are not sufficient to address word reordering required for Machine Translation. We propose a new distortion model that can be used with existing phrase-based SMT decoders to address those n-gram language model limitations. We present empirical results in Arabic to English Machine Translation that show statistically significant improvements when our proposed model is used. We also propose a novel metric to measure word order similarity (or difference) between any pair of languages based on word alignments.
引用
收藏
页码:529 / 536
页数:8
相关论文
共 50 条
  • [1] Improving Statistical Machine Translation by Adapting Translation Models to Translationese
    Lembersky, Gennadi
    Ordan, Noam
    Wintner, Shuly
    COMPUTATIONAL LINGUISTICS, 2013, 39 (04) : 999 - 1024
  • [2] Topic-based term translation models for statistical machine translation
    Xiong, Deyi
    Meng, Fandong
    Liu, Qun
    ARTIFICIAL INTELLIGENCE, 2016, 232 : 54 - 75
  • [3] Bilingual cluster based models for statistical machine translation
    Yamamoto, Hirofumi
    Sumita, Eiichiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 588 - 597
  • [4] Linguistically motivated statistical machine translation: models and algorithms
    Vandeghinste, Vincent
    MACHINE TRANSLATION, 2015, 29 (3-4) : 291 - 294
  • [5] An Investigation on Statistical Machine Translation with Neural Language Models
    Zhao, Yinggong
    Huang, Shujian
    Chen, Huadong
    Chen, Jiajun
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 175 - 186
  • [6] Statistical Alignment Models in Machine Translation from Slovenian to English
    Maucec, Mirjam Sepesy
    Brest, Janez
    Kaic, Zdravko
    ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2006, 73 (05): : 273 - 278
  • [7] Statistical alignment models in machine translation from Slovenian to English
    University of Maribor, Faculty of Electrical Engineering and Computer Science, Smetanova 17, Maribor, Slovenia
    Elektroteh Vestn Electrotech Rev, 2006, 5 (273-278):
  • [8] Compositions of Tree-to-Tree Statistical Machine Translation Models
    Maletti, Andreas
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2018, 29 (05) : 877 - 892
  • [9] Compositions of Tree-to-Tree Statistical Machine Translation Models
    Maletti, Andreas
    DEVELOPMENTS IN LANGUAGE THEORY, DLT 2016, 2016, 9840 : 293 - 305
  • [10] Phrase-based alignment models for statistical machine translation
    Tomás, J
    Lloret, J
    Casacuberta, F
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 605 - 613