Word Alignment for English-Turkish Language Pair

被引:0
|
作者
Cakmak, M. Talha [1 ]
Acar, Suleyman [1 ]
Eryigit, Gulsen [1 ]
机构
[1] Istanbul Tech Univ, Dept Comp Engn, TR-34469 Istanbul, Turkey
关键词
Word Alignment; Machine Translation; Turkish;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Word alignment is an important step for machine translation systems. Although the alignment performance between grammatically similar languages is reported to be very high in many studies, the case is not the same for language pairs from different language families. In this study, we are focusing on English-Turkish language pairs. Turkish is a highly agglutinative language with a very productive and rich morphology whereas English has a very poor morphology when compared to this language. As a result of this, one Turkish word is usually aligned with several English words. The traditional models which use word-level alignment approaches generally fail in such circumstances. In this study, we evaluate a Giza++ system by splitting the words into their morphological units (stem and suffixes) and compare the model with the traditional one. For the first time, we evaluate the performance of our aligner on gold standard parallel sentences rather than in a real machine translation system. Our approach reduced the alignment error rate by 40% relative. Finally, a new test corpus of 300 manually aligned sentences is released together with this study.
引用
收藏
页码:2177 / 2180
页数:4
相关论文
共 50 条
  • [41] An introduction to English language. Sound, word and sentence
    Bowen, R
    [J]. MODERNA SPRAK, 1997, 91 (02): : 230 - 231
  • [42] The Last Word The English Language: Opinions and Prejudices.
    Ilson, Robert
    [J]. INTERNATIONAL JOURNAL OF LEXICOGRAPHY, 2009, 22 (01) : 105 - 111
  • [43] Socio-linguistic Competence of Turkish and Turkish Cypriot English as a Foreign Language Learners in the English Speech Act of Apology
    Atamturk, Hakan
    Atamturk, Nurdan
    [J]. INTERNATIONAL JOURNAL OF EDUCATIONAL SCIENCES, 2016, 15 (1-2) : 170 - 177
  • [44] Normalizing German and English inflectional morphology to improve statistical word alignment
    Corston-Oliver, S
    Gamon, M
    [J]. MACHINE TRANSLATION: FROM REAL USERS TO RESEARCH, PROCEEDINGS, 2004, 3265 : 48 - 57
  • [45] An annotation scheme and Gold Standard for Dutch-English word alignment
    Macken, Lieve
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3369 - 3374
  • [46] Research of English-Chinese alignment at word granularity on parallel corpora
    Xu Yang
    Wang Hou-feng
    Lue Xue-qiang
    [J]. 7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 223 - +
  • [47] Improved Word Alignment System for Myanmar-English Machine Translation
    Han, Nway Nway
    Thida, Aye
    [J]. PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 178 - 182
  • [48] Learning Word Alignment Models for Kazakh-English Machine Translation
    Kartbayev, Amandyk
    [J]. INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, IUKM 2015, 2015, 9376 : 326 - 335
  • [49] Word alignment of English-Chinese bilingual corpus based on chunks
    Le, S
    Youbing, J
    Du, L
    Wang, SF
    [J]. PROCEEDINGS OF THE 2000 JOINT SIGDAT CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND VERY LARGE CORPORA, 2000, : 110 - 116
  • [50] A comparative study on evaluation of Turkish and English foreign language textbooks
    Tosun, Sezen
    [J]. AKDENIZ LANGUAGE STUDIES CONFERENCE, 2013, 70 : 1374 - 1380