Distortion Models For Statistical Machine Translation

被引:0
|
作者
Al-Onaizan, Yaser [1 ]
Papineni, Kishore [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we argue that n-gram language models are not sufficient to address word reordering required for Machine Translation. We propose a new distortion model that can be used with existing phrase-based SMT decoders to address those n-gram language model limitations. We present empirical results in Arabic to English Machine Translation that show statistically significant improvements when our proposed model is used. We also propose a novel metric to measure word order similarity (or difference) between any pair of languages based on word alignments.
引用
收藏
页码:529 / 536
页数:8
相关论文
共 50 条
  • [31] A critique of Statistical Machine Translation
    Way, Andy
    LINGUISTICA ANTVERPIENSIA NEW SERIES-THEMES IN TRANSLATION STUDIES, 2009, 8 : 17 - 41
  • [32] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266
  • [33] Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation
    Dugonik, Jani
    Maucec, Mirjam Sepesy
    Verber, Domen
    Brest, Janez
    MATHEMATICS, 2023, 11 (11)
  • [34] Seal: Efficient Training Large Scale Statistical Machine Translation Models on Spark
    Gu, Rong
    Chen, Min
    Yang, Wenjia
    Yuan, Chunfeng
    Huang, Yihua
    2018 IEEE 24TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2018), 2018, : 118 - 125
  • [35] Discriminative Spoken Language Understanding Using Statistical Machine Translation Alignment Models
    Aliannejadi, Mohammad
    Khadivi, Shahram
    Ghidary, Saeed Shiry
    Bokaei, Mohammad Hadi
    ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP 2013, 2014, 427 : 194 - +
  • [36] A comparison of segmentation methods and extended lexicon models for Arabic statistical machine translation
    Hasan, Sasa
    Mansour, Saab
    Ney, Hermann
    MACHINE TRANSLATION, 2012, 26 (1-2) : 47 - 65
  • [37] Factored bilingual n-gram language models for statistical machine translation
    Crego, Josep M.
    Yvon, Francois
    MACHINE TRANSLATION, 2010, 24 (02) : 159 - 175
  • [38] A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena
    Bisazza, Arianna
    Federico, Marcello
    COMPUTATIONAL LINGUISTICS, 2016, 42 (02) : 163 - 205
  • [39] Refined lexicon models for statistical machine translation using a maximum entropy approach
    Varea, IG
    Och, FJ
    Ney, H
    Casacuberta, F
    39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 204 - 211
  • [40] Translation Model of Myanmar Phrases for Statistical Machine Translation
    Zin, Thet Thet
    Soe, Khin Mar
    Thein, Ni Lar
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 235 - +