Resources for Turkish natural language processing

被引:6
|
作者
Coltekin, Cagri [1 ]
Dogruoz, A. Seza [2 ]
Cetinoglu, Ozlem [3 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] Univ Ghent, Ghent, Belgium
[3] Univ Stuttgart, Stuttgart, Germany
关键词
Turkish; Corpora; Lexical resources; NLP; Linguistics; TEXT; DATASET; LEXICON; CONSTRUCTION; RECOGNITION; MORPHOLOGY; BENCHMARK; CORPORA; AUTHOR; TOOLS;
D O I
10.1007/s10579-022-09605-4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a comprehensive survey of corpora and lexical resources available for Turkish. We review a broad range of resources, focusing on the ones that are publicly available. In addition to providing information about the available linguistic resources, we present a set of recommendations, and identify gaps in the data available for conducting research and building applications in Turkish Linguistics and Natural Language Processing.
引用
收藏
页码:449 / 488
页数:40
相关论文
共 50 条
  • [1] Resources for Turkish natural language processing: A critical survey
    Çağrı Çöltekin
    A. Seza Doğruöz
    Özlem Çetinoğlu
    Language Resources and Evaluation, 2023, 57 : 449 - 488
  • [2] TULAP - An Accessible and Sustainable Platform for Turkish Natural Language Processing Resources
    Uskudarli, Susan
    Sen, Muhammet
    Akkurt, Furkan
    Gurbuz, Merve
    Gungor, Onur
    Ozgur, Arzucan
    Gungor, Tunga
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 219 - 227
  • [3] A natural language processing infrastructure for Turkish
    Department of Computer Engineering, Bogaziçi University, Bebek, İstanbul, Turkey
    不详
    1600, (2004):
  • [4] ADVANCES IN CHINESE NATURAL LANGUAGE PROCESSING AND LANGUAGE RESOURCES
    Tao, Jianhua
    Zheng, Fang
    Li, Aijun
    Li, Ya
    ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2009, : 13 - +
  • [5] Development of language resources for natural language processing in deep level
    Zhang, Yujie
    Kuroda, Kow
    Izumi, Emi
    Nozawa, Hajime
    Journal of the National Institute of Information and Communications Technology, 2007, 54 (03): : 53 - 61
  • [6] Resources for Turkish morphological processing
    Haşim Sak
    Tunga Güngör
    Murat Saraçlar
    Language Resources and Evaluation, 2011, 45 : 249 - 261
  • [7] Resources for Turkish morphological processing
    Sak, Hasim
    Gungor, Tunga
    Saraclar, Murat
    LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (02) : 249 - 261
  • [8] A set of open-source tools for Turkish natural language processing
    Coltekin, Cagri
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1079 - 1086
  • [9] Use of Natural Language Processing Methods in Teaching Turkish Proverbs and Idioms
    Erdagi, Erturk
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 1056 - 1063
  • [10] The contribution of lexical resources to natural language processing of CJK languages
    Halpern, Jack
    Chinese Spoken Language Processing, Proceedings, 2006, 4274 : 768 - 780