Increasing Translation Speed in Phrase-based Models via Suboptimal Segmentation

被引:0
|
作者
Sanchis-Trilles, German [1 ]
Casacuberta, Francisco [1 ]
机构
[1] Inst Informat Technol, Valencia 46022, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phrase-Based Models constitute nowadays the core of the state of the art in the statistical pattern recognition approach to machine translation. Being able to introduce context information into the translation model, they usually produce translations whose quality is often difficult to improve. However, these models have usually an important drawback: the translation speed they are able to deliver is mostly not sufficient for real-time tasks, and translating a single sentence can sometimes take some minutes. In this paper, we describe a novel technique for reducing significantly the size of the translation table, by performing a Viterbi-style selection of the phrases that constitute the final phrase-table. Even in cases where the pruned phrase table contains only 6% of the segments of the original one, translation quality is not worsened. Furthermore, translation quality remains the same in the worst case, achieving an increase of 0.3 BLEU in the best case,
引用
收藏
页码:135 / 143
页数:9
相关论文
共 50 条
  • [21] Deriving phrase-based language models
    Heeman, PA
    Damnati, G
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 41 - 48
  • [22] Phrase-Based Machine Translation based on Simulated Annealing
    Lavecchia, Caroline
    Langlois, David
    Smaili, Kamel
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3123 - 3129
  • [23] Pharaoh: A beam search decoder for phrase-based statistical machine translation models
    Koehn, P
    [J]. MACHINE TRANSLATION: FROM REAL USERS TO RESEARCH, PROCEEDINGS, 2004, 3265 : 115 - 124
  • [24] INCREMENTAL TRANSLATION USING HIERARCHICAL PHRASE-BASED TRANSLATION SYSTEM
    Siahbani, Maryam
    Sera, Ramtin Mehdizadeh
    Sankaran, Baskaran
    Sarkar, Anoop
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 71 - 76
  • [25] Monte Carlo techniques for phrase-based translation
    Arun, Ahhishek
    Haddow, Barry
    Koehn, Philipp
    Lopez, Adam
    Dyer, Chris
    Blunsom, Phil
    [J]. MACHINE TRANSLATION, 2010, 24 (02) : 103 - 121
  • [26] The CASIA phrase-based machine translation system
    Yang, ZD
    Chen, ZB
    Pang, W
    Wei, W
    Xu, B
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 416 - 419
  • [27] Improving Phrase-Based Statistical Machine Translation Models by Incorporating Syntax-Based Language Models
    陈毅东
    史晓东
    [J]. Journal of Donghua University(English Edition), 2010, 27 (02) : 185 - 188
  • [28] Flattened Syntactical Phrase-Based Translation Model for SMT
    Chen, Qing
    Yao, Tianshun
    [J]. COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 345 - 353
  • [29] Some improvements in phrase-based statistical machine translation
    Yang, Zhendong
    Pang, Wei
    Du, Jinhua
    Wei, Wei
    Xu, Bo
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 704 - +
  • [30] Improved Reordering Rules for Hierarchical Phrase-based Translation
    Cai, Shu
    Lue, Yajuan
    Liu, Qun
    [J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 65 - 70