Comma Analysis and Processing for Improving Translation Quality of Long Sentences in Rule-based English-Korean Machine Translation

被引:0
|
作者
Kim, Sung-Dong [1 ]
机构
[1] Hansung Univ, Sch Comp Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
English-Korean Machine Translation; Comma Usage Analysis; Natural Language Processing; Rule-based; Machine Translation;
D O I
10.5220/0007310604740479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current English-Korean machine translation system cannot provide practical translation quality mainly due to the difficulties in long sentence parsing. Long sentences generally include commas, resulting in lots of different possible sentence structures. It is very difficult to accurately parse the long sentences that have commas. The roles of the commas in constructing sentences have to be identified and then the syntactic analysis should be performed according to the roles of the commas for accurate parsing of the long sentences. This paper presents the analysis results of the comma usages and the comma processing methods for each comma usage. And it also proposes the comma usage classification method using machine learning technique. In experiment, some improved translation results, by identifying comma usage and processing the commas, are also presented.
引用
收藏
页码:474 / 479
页数:6
相关论文
共 50 条
  • [1] PRE-PROCESSING TASKS FOR RULE-BASED ENGLISH-KOREAN MACHINE TRANSLATION SYSTEM
    Kim, Sung-Dong
    [J]. ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 257 - 262
  • [2] Customizing an English-Korean Machine Translation System for Patent Translation
    Choi, Sung-Kwon
    Kim, Young-Gil
    [J]. PACLIC 21: THE 21ST PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2007, : 105 - 114
  • [3] Improving Neural Machine Translation Using Rule-Based Machine Translation
    Singh, Muskaan
    Kumar, Ravinder
    Chana, Inderveer
    [J]. 2019 7TH INTERNATIONAL CONFERENCE ON SMART COMPUTING & COMMUNICATIONS (ICSCC), 2019, : 8 - 12
  • [4] Simplification of English and Bengali Sentences for Improving Quality of Machine Translation
    Sainik Kumar Mahata
    Avishek Garain
    Dipankar Das
    Sivaji Bandyopadhyay
    [J]. Neural Processing Letters, 2022, 54 : 3115 - 3139
  • [5] Simplification of English and Bengali Sentences for Improving Quality of Machine Translation
    Mahata, Sainik Kumar
    Garain, Avishek
    Das, Dipankar
    Bandyopadhyay, Sivaji
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (04) : 3115 - 3139
  • [6] Syntactic and Semantic English-Korean Machine Translation Using Ontology
    Seo, Eugene
    Song, Il-Sun
    Kim, Su-Kyung
    Choi, Ho-Jin
    [J]. 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, VOLS I-III, PROCEEDINGS,: UBIQUITOUS ICT CONVERGENCE MAKES LIFE BETTER!, 2009, : 2129 - 2132
  • [7] Systematic Processing of Long Sentences in Rule Based Portuguese-Chinese Machine Translation
    Oliveira, Francisco
    Wong, Fai
    Hong, Iok-Sai
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2010, 6008 : 417 - 426
  • [8] RULE-BASED MACHINE TRANSLATION FROM ENGLISH TO TELUGU WITH EMPHASIS ON PREPOSITIONS
    Lingam, Keerthi
    Lakshmi, E. Rama
    Theja, L. Ravi
    [J]. 2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 183 - 187
  • [9] Analysis of Rule-Based Machine Translation and Neural Machine Translation Approaches for Translating Portuguese to LIBRAS
    Moraes de Oliveira, Caio Cesar
    do Rego, Thais Gaudencio
    Cavalcanti Brandao Lima, Manuella Aschoff
    Ugulino de Araujo, Tiago Maritan
    [J]. WEBMEDIA 2019: PROCEEDINGS OF THE 25TH BRAZILLIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, 2019, : 117 - 124
  • [10] Improving query translation in English-Korean cross-language information retrieval
    Seo, HC
    Kim, SB
    Rim, HC
    Myaeng, SH
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (03) : 507 - 522