Word Alignment for English-Turkish Language Pair

被引：0

作者：

Cakmak, M. Talha ^{[1
]}

Acar, Suleyman ^{[1
]}

Eryigit, Gulsen ^{[1
]}

机构：

[1] Istanbul Tech Univ, Dept Comp Engn, TR-34469 Istanbul, Turkey

来源：

LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年

关键词：

Word Alignment; Machine Translation; Turkish;

D O I：

暂无

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Word alignment is an important step for machine translation systems. Although the alignment performance between grammatically similar languages is reported to be very high in many studies, the case is not the same for language pairs from different language families. In this study, we are focusing on English-Turkish language pairs. Turkish is a highly agglutinative language with a very productive and rich morphology whereas English has a very poor morphology when compared to this language. As a result of this, one Turkish word is usually aligned with several English words. The traditional models which use word-level alignment approaches generally fail in such circumstances. In this study, we evaluate a Giza++ system by splitting the words into their morphological units (stem and suffixes) and compare the model with the traditional one. For the first time, we evaluate the performance of our aligner on gold standard parallel sentences rather than in a real machine translation system. Our approach reduced the alignment error rate by 40% relative. Finally, a new test corpus of 300 manually aligned sentences is released together with this study.

引用

页码：2177 / 2180

页数：4

共 50 条

[1] STRATEGIES AND ERRORS IN SIMULTANEOUS INTERPRETING: A STUDENT-ORIENTED EXPERIMENT IN ENGLISH-TURKISH LANGUAGE PAIR
Bozok, Nazligul
Kincal, Seyda
[J]. CURRENT TRENDS IN TRANSLATION TEACHING AND LEARNING E, 2022, 9 : 32 - 75
[2] The English-Turkish Conflict of Mosul
Von Elbe, Joachim
[J]. KURDISH STUDIES, 2018, 6 (02) : 217 - 241
[3] A LIST OF ENGLISH-TURKISH COGNATES AND FALSE-COGNATES
Uzun, Levent
Salihoglu, Umut M.
[J]. POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2021, 57 (02): : 325 - 327
[4] Evaluating the English-Turkish parallel treebank for machine translation
Gorgun, Onur
Yildiz, Olcay Taner
[J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (01) : 184 - 199
[5] Bitext alignment for the English-Ukrainian language pair
Paliy, Zoryana
Romanyuk, Andriy
[J]. EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS: PROCEEDINGS OF THE XTH INTERNATIONAL CONFERENCE CADSM 2009, 2009, : 548 - +
[6] English-Turkish Literary Translation Through Human-Machine Interaction
Sahin, Mehmet
Gurses, Sabri
[J]. TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2021, (19): : 179 - 203
[7] Syntax-pragmatic and morphology-pragmatic interfaces in sequential bilingual language acquisition: The case of Russia-Turkish and English-Turkish bilingual children
Antonova-Unlu, Elena
[J]. INTERNATIONAL JOURNAL OF BILINGUALISM, 2019, 23 (05) : 1137 - 1158
[8] Morpho-syntactic properties of simultaneous bilingualism: Evidence from bilingual English-Turkish
Haznedar, Belma
[J]. INTERNATIONAL JOURNAL OF BILINGUALISM, 2019, 23 (04) : 793 - 803
[9] ENGLISH-TURKISH COGNATES AND FALSE COGNATES: COMPILING A CORPUS AND TESTING HOW THEY ARE TRANSLATED BY COMPUTER PROGRAMS
Uzun, Levent
Salihoglu, Umut M.
[J]. POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2009, 45 (04): : 569 - 593
[10] REDHOUSE ENGLISH-TURKISH DICTIONARY - AVERY,R, BEZMEZ,S, EDMONDS,AG, YAYALI,M
KELLY, JM
[J]. JOURNAL OF THE AMERICAN ORIENTAL SOCIETY, 1976, 96 (01) : 151 - 152

← 1 2 3 4 5 →