Some improvements in phrase-based statistical machine translation

被引:0
|
作者
Yang, Zhendong [1 ]
Pang, Wei [1 ]
Du, Jinhua [1 ]
Wei, Wei [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Hi Tech Innovat Ctr, Inst Automat, Beijing 100080, Peoples R China
关键词
phrase-based translation; minimum error rate training; phrase-template; re-scoring;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In statistical machine translation, many of the top-performing systems are phrase-based systems. This paper describes a phrase-based translation system and some improvements. We use more information to compute translation probability. The scaling factors of the log-linear models are estimated by the minimum error rate training that uses an evaluation criteria to balance BLEU and NIST scores. We extract phrase-template from initial phrases to deal with data sparseness and distortion problem through decoding. By re-ranking the n-best list of translations generated firstly, the system gets the final output. Some experiments concerned show that all these refinements are beneficial to get better results.
引用
收藏
页码:704 / +
页数:3
相关论文
共 50 条
  • [21] Phrase-Based Tibetan-Chinese Statistical Machine Translation
    Yong Cuo
    Shi, Xiaodong
    Nyima, Tashi
    Chen, Yidong
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 424 - 427
  • [22] Statistical phrase-based speech translation
    Mathias, Lambert
    Byrne, William
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 561 - 564
  • [23] Improving Phrase-based Korean-English Statistical Machine Translation
    Lee, Jonghoon
    Lee, Donghyeon
    Lee, Gary Geunbae
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 753 - 756
  • [24] Using TectoMT as a Preprocessing Tool for Phrase-Based Statistical Machine Translation
    Zeman, Daniel
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 216 - 223
  • [25] Translation paraphrases in phrase-based machine translation
    Guzman, Francisco
    Garrido, Leonardo
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 388 - 398
  • [26] Exploiting Parallel Treebanks to Improve Phrase-Based Statistical Machine Translation
    Tinsley, John
    Hearne, Mary
    Way, Andy
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2009, 5449 : 318 - 331
  • [27] Linguistic Resources for Factored Phrase-Based Statistical Machine Translation Systems
    Navlea, Mirabela
    Todirascu, Amalia
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : H41 - H48
  • [28] Learning Word Reorderings for Hierarchical Phrase-based Statistical Machine Translation
    Zhang, Jingyi
    Utiyama, Masao
    Sumita, Eiichro
    Zhao, Hai
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 542 - 548
  • [29] A unified framework and models for integrating translation memory into phrase-based statistical machine translation
    Liu, Yang
    Wang, Kun
    Zong, Chengqing
    Su, Keh-Yih
    [J]. COMPUTER SPEECH AND LANGUAGE, 2019, 54 : 176 - 206
  • [30] A reordering model for phrase-based machine translation
    Nguyen, Vinh Van
    Nguyen, Thai Phuong
    Shimazu, Akira
    Nguyen, Minh Le
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 476 - +