Comma Analysis and Processing for Improving Translation Quality of Long Sentences in Rule-based English-Korean Machine Translation

被引:0
|
作者
Kim, Sung-Dong [1 ]
机构
[1] Hansung Univ, Sch Comp Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
English-Korean Machine Translation; Comma Usage Analysis; Natural Language Processing; Rule-based; Machine Translation;
D O I
10.5220/0007310604740479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current English-Korean machine translation system cannot provide practical translation quality mainly due to the difficulties in long sentence parsing. Long sentences generally include commas, resulting in lots of different possible sentence structures. It is very difficult to accurately parse the long sentences that have commas. The roles of the commas in constructing sentences have to be identified and then the syntactic analysis should be performed according to the roles of the commas for accurate parsing of the long sentences. This paper presents the analysis results of the comma usages and the comma processing methods for each comma usage. And it also proposes the comma usage classification method using machine learning technique. In experiment, some improved translation results, by identifying comma usage and processing the commas, are also presented.
引用
收藏
页码:474 / 479
页数:6
相关论文
共 50 条
  • [41] Research on Machine Translation Method of English-Chinese Long Sentences Based on Fuzzy Semantic Optimization
    Dong, Zhaofeng
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [42] Classification-Based Approach for Hybridizing Statistical and Rule-Based Machine Translation
    Park, Eun-Jin
    Kwon, Oh-Woog
    Kim, Kangil
    Kim, Young-Kil
    [J]. ETRI JOURNAL, 2015, 37 (03) : 541 - 550
  • [43] An Exploration of English Translation Methods for Long Sentences Combined with Modal Analysis from the Perspective of Translation Purpose
    Xue, Jiao
    [J]. Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [44] Improving the Quality of English-Bharati Braille Machine Translation Using Syntax Analysis
    Joshi, Nisheeth
    Katyayan, Pragya
    Pandey, Anupriya
    [J]. PROCEEDINGS OF THIRD DOCTORAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE, DOSCI 2022, 2023, 479 : 671 - 683
  • [45] Addressing Limited Vocabulary and Long Sentences Constraints in English–Arabic Neural Machine Translation
    Safae Berrichi
    Azzeddine Mazroui
    [J]. Arabian Journal for Science and Engineering, 2021, 46 : 8245 - 8259
  • [46] Improving neural machine translation for low-resource Indian languages using rule-based feature extraction
    Singh, Muskaan
    Kumar, Ravinder
    Chana, Inderveer
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1103 - 1122
  • [47] Improving neural machine translation for low-resource Indian languages using rule-based feature extraction
    Muskaan Singh
    Ravinder Kumar
    Inderveer Chana
    [J]. Neural Computing and Applications, 2021, 33 : 1103 - 1122
  • [48] Chinese complex long sentences processing method for Chinese-Japanese machine translation
    Yin, Dapeng
    Jiang, Peilin
    Ren, Fuji
    Kuroiwa, Shingo
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 170 - +
  • [49] Matxin, an open-source rule-based machine translation system for Basque
    Mayor, Aingeru
    Alegria, Inaki
    de Ilarraza, Arantza Diaz
    Labaka, Gorka
    Lersundi, Mikel
    Sarasola, Kepa
    [J]. MACHINE TRANSLATION, 2011, 25 (01) : 53 - 82
  • [50] Apertium: a free/open-source platform for rule-based machine translation
    Forcada, Mikel L.
    Ginesti-Rosell, Mireia
    Nordfalk, Jacob
    O'Regan, Jim
    Ortiz-Rojas, Sergio
    Antonio Perez-Ortiz, Juan
    Sanchez-Martinez, Felipe
    Ramirez-Sanchez, Gema
    Tyers, Francis M.
    [J]. MACHINE TRANSLATION, 2011, 25 (02) : 127 - 144