Resources for Turkish natural language processing

被引:6
|
作者
Coltekin, Cagri [1 ]
Dogruoz, A. Seza [2 ]
Cetinoglu, Ozlem [3 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] Univ Ghent, Ghent, Belgium
[3] Univ Stuttgart, Stuttgart, Germany
关键词
Turkish; Corpora; Lexical resources; NLP; Linguistics; TEXT; DATASET; LEXICON; CONSTRUCTION; RECOGNITION; MORPHOLOGY; BENCHMARK; CORPORA; AUTHOR; TOOLS;
D O I
10.1007/s10579-022-09605-4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a comprehensive survey of corpora and lexical resources available for Turkish. We review a broad range of resources, focusing on the ones that are publicly available. In addition to providing information about the available linguistic resources, we present a set of recommendations, and identify gaps in the data available for conducting research and building applications in Turkish Linguistics and Natural Language Processing.
引用
收藏
页码:449 / 488
页数:40
相关论文
共 50 条
  • [11] Usage based indexing of web resources with natural language processing
    Brun, Armelle
    Boyer, Anne
    WEBIST 2007: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL WIA: WEB INTERFACES AND APPLICATIONS, 2007, : 220 - +
  • [12] Natural language processing for usage based indexing of web resources
    Boyer, Anne
    Brun, Armelle
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 517 - +
  • [13] Natural Language Processing Resources: Using Semantic Web Technologies
    Pohorec, Sandi
    Ceh, Ines
    Zorman, Milan
    Mernik, Marjan
    Kokol, Peter
    PROCEEDINGS OF THE ITI 2012 34TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES (ITI), 2012, : 397 - 402
  • [14] Solutions of Creating Large Data Resources in Natural Language Processing
    Huynh Cong Phap
    RECENT DEVELOPMENTS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2016, 642 : 243 - 253
  • [15] Processing natural language without natural language processing
    Brill, E
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 360 - 369
  • [16] Analysis of Literary Works from Turkish and World Literature with Natural Language Processing
    Karaca, Uyesi Mehmet Fatih
    Bayir, Uyesi Safak
    SELCUK UNIVERSITESI EDEBIYAT FAKULTESI DERGISI-SELCUK UNIVERSITY JOURNAL OF FACULTY OF LETTERS, 2020, 44 : 379 - 404
  • [17] NATURAL LANGUAGE PROCESSING TECHNIQUES USED FOR AN AUTOMATIZED TEST GENERATION PROCESS FOR TURKISH
    Sari, Onder Can
    Aktas, Ozlem
    10TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI2017), 2017, : 2443 - 2452
  • [18] Sentimental Analysis of Twitter Users from Turkish Content with Natural Language Processing
    Balli, Cagla
    Guzel, Mehmet Serdar
    Bostanci, Erkan
    Mishra, Alok
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [19] Comparison of Transformer-Based Models Trained in Turkish and Different Languages on Turkish Natural Language Processing Problems
    Aytan, Burak
    Sakar, C. Okan
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [20] Casa de la Lhengua: a set of language resources and natural language processing tools for Mirandese
    Ferreira, Jose Pedro
    Chesi, Cristiano
    Baldewijns, Daan
    Dias, Miguel Sales
    Braga, Daniela
    Pinto, Fernando Miguel
    Cho, Hyongsil
    Correia, Margarita
    Ferreira, Amadeu
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 536 - 540