Rewriting Turkish texts written in English alphabet using Turkish alphabet

被引:0
|
作者
Okur, Burak Cagri [1 ]
Takci, Hidayet [2 ]
Akgul, Yusuf Sinan [3 ]
机构
[1] TUBITAK BILGEM, Bilisim & Bilgi Guvenligi Ileri Teknol Arastirma, TR-41470 Kocaeli, Turkey
[2] Cumhuriyet Univ, Dept Comp Engn, Sivas, Turkey
[3] Dept Comp Engn, Comp Vis Lab, Kocaeli, Turkey
关键词
Natural Language Processing; Text Mining; Word Sense Disambiguation; Machine Learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Turkish texts written by English characters are easily comprehended by people, although performing this process by machines is still one of the unsolved Word Sense Disambiguation problems. Rewriting texts in English characters using Turkish characters is a natural language processing problem special to Turkish. Choosing the right Turkish word among different alternatives requires consideration of the text semantically. In this study, the effect of examination of the text either sentence or whole text based, on the right word determination is investigated. Performance of machine learning methods and statistical methods in right word determination is examined. The study is tested on randomly selected news texts. It is shown that examination of the text as a whole provides more information compared to sentence based methods and machine learning methods provides better results compared to statistical studies.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Recognition of Two-Handed Posture Finger Turkish Sign Language Alphabet
    Katilmis, Zekeriya
    Karakuzu, Cihan
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 181 - 186
  • [22] Detection of the Turkish Sign Language Alphabet with Strain Sensor Based Data Glove
    Kaya, Fatih
    Tuncer, Ahmet Furkan
    Yildiz, Solen Kumbay
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [23] The alphabet of emotions as written in cardiac changes
    Sabelli, H
    Patel, M
    CarlsonSabelli, L
    Sugerman, A
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 4161 - 4161
  • [24] WRITTEN MIXE AND THE MIRAGE OF THE GOOD ALPHABET
    Sagi-Vela Gonzalez, Ana
    [J]. REVISTA DE LLENGUA I DRET-JOURNAL OF LANGUAGE AND LAW, 2019, (71) : 146 - 157
  • [25] An analysis of written errors of Turkish adult learners of English
    Kirkgoz, Yasemin
    [J]. INNOVATION AND CREATIVITY IN EDUCATION, 2010, 2 (02): : 4352 - 4358
  • [26] The Effectiveness of Homogenous Ensemble Classifiers for Turkish and English Texts
    Kilimci, Zeynep Hilal
    Akyokus, Selim
    Omurca, Sevinc Ilhan
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2016,
  • [27] Frame Markers in Master Thesis Abstracts Written in English and Turkish
    Atasever Belli, Serap
    [J]. CUKUROVA UNIVERSITY FACULTY OF EDUCATION JOURNAL, 2019, 48 (02): : 994 - 1011
  • [28] Chunked Texts in Reading Class: The Case of Turkish Learners of English
    Kiroglu, Kasim
    Demirel, Melek
    [J]. PAMUKKALE UNIVERSITESI EGITIM FAKULTESI DERGISI-PAMUKKALE UNIVERSITY JOURNAL OF EDUCATION, 2012, (32): : 65 - 76
  • [29] Aligning Turkish and English parallel texts for statistical machine translation
    El-Kahlout, ID
    Oflazer, K
    [J]. COMPUTER AND INFORMATION SCIENCES - ISCIS 2005, PROCEEDINGS, 2005, 3733 : 616 - 625
  • [30] Decoding English Alphabet Letters Using EEG Phase Information
    Wang, YiYan
    Wang, Pingxiao
    Yu, Yuguo
    [J]. FRONTIERS IN NEUROSCIENCE, 2018, 12