Recognition and Segmentation of English Long and Short Sentences Based on Machine Translation

被引:2
|
作者
Zhang, Tiehu [1 ]
机构
[1] Xian Aeronaut Univ, Sch Foreign Languages, Xian, Shaanxi, Peoples R China
关键词
Machine translation; long sentence; regular match; error-driven method;
D O I
10.3991/ijet.v15i101.10182
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
With the advent of the information age, long sentences which include many words and have more complex structures. The translation of long sentences in English-Chinese machine translation has always been the focus of research. In this study, 400 long sentences were randomly selected from NTCIR-9 patent corpus for testing the recognition and segmentation effects of regular match method and error-driven method, and the accuracy rate of the translation was compared on Baidu Online Translation Platform. The results demonstrated that the regular matching method was effective in recognizing and segmenting long sentences, nevertheless there were many defects; the error-driven method was more effective in recognizing and segmenting long sentences; the former increased by 4.8% of the BLEU value of the translated text on Baidu Online Translation Platform and the latter increased by 12.1%, which showed that the error-driven method was more effective in machine translation.
引用
收藏
页码:152 / 162
页数:11
相关论文
共 50 条