Increasing Translation Speed in Phrase-based Models via Suboptimal Segmentation

被引:0
|
作者
Sanchis-Trilles, German [1 ]
Casacuberta, Francisco [1 ]
机构
[1] Inst Informat Technol, Valencia 46022, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phrase-Based Models constitute nowadays the core of the state of the art in the statistical pattern recognition approach to machine translation. Being able to introduce context information into the translation model, they usually produce translations whose quality is often difficult to improve. However, these models have usually an important drawback: the translation speed they are able to deliver is mostly not sufficient for real-time tasks, and translating a single sentence can sometimes take some minutes. In this paper, we describe a novel technique for reducing significantly the size of the translation table, by performing a Viterbi-style selection of the phrases that constitute the final phrase-table. Even in cases where the pruned phrase table contains only 6% of the segments of the original one, translation quality is not worsened. Furthermore, translation quality remains the same in the worst case, achieving an increase of 0.3 BLEU in the best case,
引用
收藏
页码:135 / 143
页数:9
相关论文
共 50 条
  • [1] Phrase-based alignment models for statistical machine translation
    Tomás, J
    Lloret, J
    Casacuberta, F
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 605 - 613
  • [2] Statistical phrase-based translation
    Koehn, P
    Och, FJ
    Marcu, D
    [J]. HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 127 - 133
  • [3] Hierarchical phrase-based translation
    Chiang, David
    [J]. COMPUTATIONAL LINGUISTICS, 2007, 33 (02) : 201 - 228
  • [4] Minimum description length inference of phrase-based translation models
    Gonzalez-Rubio, Jesus
    Casacuberta, Francisco
    [J]. NEURAL COMPUTING & APPLICATIONS, 2017, 28 (09): : 2403 - 2413
  • [5] Minimum description length inference of phrase-based translation models
    Jesús González-Rubio
    Francisco Casacuberta
    [J]. Neural Computing and Applications, 2017, 28 : 2403 - 2413
  • [6] Statistical phrase-based speech translation
    Mathias, Lambert
    Byrne, William
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 561 - 564
  • [7] Improved techniques for phrase-based translation
    Ruiz Costa-Jussa, Marta
    Fonollosa, Jose A. R.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 351 - 356
  • [8] Phrase-based statistical machine translation
    Zens, R
    Och, FJ
    Ney, H
    [J]. KI2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2479 : 18 - 32
  • [9] Translation paraphrases in phrase-based machine translation
    Guzman, Francisco
    Garrido, Leonardo
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 388 - 398
  • [10] Using collocation segmentation to extract translation units in a phrase-based statistical machine translation system
    Costa-jussa, Marta R.
    Daudaravicius, Vidas
    Banchs, Rafael E.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 215 - 220