Phrase-based statistical machine translation using approximate matching

被引:0
|
作者
Tomas, Jesus [1 ]
Lloret, Jaime [2 ]
Casacuberta, Francisco [2 ]
机构
[1] Univ Politecn Valencia, Inst Tecnol Informat, E-46071 Valencia, Spain
[2] Univ Politecn Valencia, Dept Comun, E-46071 Valencia, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phrase-based statistical models constitute one of the most competitive pattern-recognition approaches to machine translation. In this case, the source sentence is fragmented into phrases, then, each phrase is translated by using a stochastic dictionary. One shortcoming of this phrase-based model is that it does not have an adequate generalization capability. If a sequence of words has not been seen in training, it cannot be translated as a whole phrase. In this paper we try to overcome this drawback. The basic idea is that if a source phrase is not in our dictionary (has not been seen in training), we look for the most similar in our dictionary and try to adapt its translation to the source phrase. We are using the well known edit distance as a measure of similarity. We present results from an English-Spanish task (XRCE).
引用
收藏
页码:475 / +
页数:2
相关论文
共 50 条
  • [1] Phrase-based statistical machine translation
    Zens, R
    Och, FJ
    Ney, H
    [J]. KI2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2479 : 18 - 32
  • [2] Improvements in phrase-based statistical machine translation
    Zens, R
    Ney, H
    [J]. HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 257 - 264
  • [3] FACTORED PHRASE-BASED STATISTICAL MACHINE TRANSLATION
    Tufis, Dan
    Ceausu, Alexandru
    [J]. FROM SPEECH PROCESSING TO SPOKEN LANGUAGE TECHNOLOGY, 2009, : 115 - 124
  • [4] Syntactic phrase-based statistical machine translation
    Hassan, Hany
    Heame, Mary
    Way, Andy
    Sima'an, Khalil
    [J]. 2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 238 - +
  • [5] Some improvements in phrase-based statistical machine translation
    Yang, Zhendong
    Pang, Wei
    Du, Jinhua
    Wei, Wei
    Xu, Bo
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 704 - +
  • [6] Phrase-based alignment models for statistical machine translation
    Tomás, J
    Lloret, J
    Casacuberta, F
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 605 - 613
  • [7] Using TectoMT as a Preprocessing Tool for Phrase-Based Statistical Machine Translation
    Zeman, Daniel
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 216 - 223
  • [8] English to Bodo Phrase-Based Statistical Machine Translation
    Islam, Md Saiful
    Purkayastha, Bipul Syam
    [J]. ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2018, 562 : 207 - 217
  • [9] An overview of the phrase-based statistical machine translation techniques
    Ruiz Costa-Jussa, Marta
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2012, 27 (04): : 413 - 431
  • [10] Improvements in Statistical Phrase-Based Interactive Machine Translation
    Cai, Dongfeng
    Zhang, Hua
    Ye, Na
    [J]. 2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 91 - 94