Turkish Treebank as a Gold Standard for Morphological Disambiguation and Its Influence on Parsing

被引:0
|
作者
Cetinoglu, Oezlem [1 ]
机构
[1] Univ Stuttgart, IMS, Stuttgart, Germany
关键词
Morphological Disambiguation; Parsing; Turkish;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
So far predicted scenarios for Turkish dependency parsing have used a morphological disambiguator that is trained on the data distributed with the tool(Sak et al., 2008). Although models trained on this data have high accuracy scores on the test and development data of the same set, the accuracy drastically drops when the model is used in the preprocessing of Turkish Treebank parsing experiments. We propose to use the Turkish Treebank(Oflazer et al., 2003) as a morphological resource to overcome this problem and convert the treebank to the morphological disambiguator's format. The experimental results show that we achieve improvements in disambiguating the Turkish Treebank and the results also carry over to parsing. With the help of better morphological analysis, we present the best labelled dependency parsing scores to date on Turkish.
引用
收藏
页码:3360 / 3365
页数:6
相关论文
共 50 条
  • [1] A Gold Standard Dependency Treebank for Turkish
    Kayadelen, Tolga
    Ozturel, Adnan
    Bohnet, Bernd
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5156 - 5163
  • [2] The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish
    Eryigit, Gulsen
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1960 - 1965
  • [3] Parsing Modern Standard Arabic using Treebank Resources
    Al-Emran, Mostafa
    Zaza, Sarween
    Shaalan, Khaled
    [J]. 2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY RESEARCH (ICTRC), 2015, : 80 - 83
  • [4] Prosodic Disambiguation of Morphological Ambiguities in Turkish
    Nazik Dinçtopal Deniz
    [J]. Journal of Psycholinguistic Research, 2020, 49 : 1083 - 1111
  • [5] Prosodic Disambiguation of Morphological Ambiguities in Turkish
    Deniz, Nazik Dinctopal
    [J]. JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2020, 49 (06) : 1083 - 1111
  • [6] A Novel Approach to Morphological Disambiguation for Turkish
    Gorgun, Onur
    Yildiz, Olcay Taner
    [J]. COMPUTER AND INFORMATION SCIENCES II, 2012, : 77 - 83
  • [7] Comparing the Influence of Different Treebank Annotations on Dependency Parsing
    Bosco, C.
    Montemagni, S.
    Mazzei, A.
    Lombardo, V.
    Dell'Orletta, F.
    Lenci, A.
    Lesmo, L.
    Attardi, G.
    Simi, M.
    Lavelli, A.
    Hall, J.
    Nilsson, J.
    Nivre, J.
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1794 - 1801
  • [8] Morphological disambiguation of Turkish text with perceptron algorithm
    Sak, Hasim
    Gungor, Tunga
    Saraclar, Murat
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 107 - +
  • [9] Resources for Turkish dependency parsing: introducing the BOUN Treebank and the BoAT annotation tool
    Utku Türk
    Furkan Atmaca
    Şaziye Betül Özateş
    Gözde Berk
    Seyyit Talha Bedir
    Abdullatif Köksal
    Balkız Öztürk Başaran
    Tunga Güngör
    Arzucan Özgür
    [J]. Language Resources and Evaluation, 2022, 56 : 259 - 307
  • [10] Resources for Turkish dependency parsing: introducing the BOUN Treebank and the BoAT annotation tool
    Turk, Utku
    Atmaca, Furkan
    Ozates, Saziye Betul
    Berk, Gozde
    Bedir, Seyyit Talha
    Koksal, Abdullatif
    Basaran, Balkiz Ozturk
    Gungor, Tunga
    Ozgur, Arzucan
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2022, 56 (01) : 259 - 307