A Detailed Analysis and Improvement of Feature-Based Named Entity Recognition for Turkish

被引:2
|
作者
Akdemir, Arda [1 ]
Gungor, Tunga [2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] Bogazici Univ, Istanbul, Turkey
来源
关键词
Named Entity Recognition; Conditional Random Fields; Dependency Parsing; Turkish; TEXT;
D O I
10.1007/978-3-030-26061-3_2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Named Entity Recognition (NER) is an important task in Natural Language Processing (NLP) with a wide range of applications. Recently, word embedding based systems that does not rely on hand-crafted features dominate the task as in the case of many other sequence labeling tasks in NLP. However, we are also observing the emergence of hybrid models that make use of hand crafted features through data augmentation to improve performance of such NLP systems. Such hybrid systems are especially important for less resourced languages such as Turkish as deep learning models require a large dataset to achieve good performance. In this paper, we first give a detailed analysis of the effect of various syntactic, semantic and orthographic features on NER for Turkish. We also improve the performance of the best feature based models for Turkish using additional features. We believe that our results will guide the research in this area and help making use of the key features for data augmentation.
引用
收藏
页码:9 / 19
页数:11
相关论文
共 50 条
  • [1] Named entity recognition in Turkish: A comparative study with detailed error analysis
    Ozcelik, Oguzhan
    Toraman, Cagri
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (06)
  • [2] A Word Similarity Feature-based Semi-supervised Approach for Named Entity Recognition
    Wang, Ze
    Han, Zhongyang
    Zhao, Jun
    Wang, Wei
    Jin, Feng
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2019, : 136 - 141
  • [3] Named Entity Recognition on Turkish Tweets
    Kuecuek, Dilek
    Jacquet, Guillaume
    Steinberger, Ralf
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 450 - 454
  • [4] A Named Entity Recognition Dataset for Turkish
    Kucuk, Dilek
    Kucuk, Dogan
    Arici, Nursal
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 329 - 332
  • [5] Named Entity Recognition Model Based on Feature Fusion
    Sun, Zhen
    Li, Xinfu
    INFORMATION, 2023, 14 (02)
  • [6] Wikipedia-based Named Entity Recognition System for Turkish
    Kucuk, Dogan
    Arici, Nursal
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2016, 19 (03): : 325 - 332
  • [7] Named Entity Recognition Experiments on Turkish Texts
    Kuecuek, Dilek
    Yazici, Adnan
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 524 - 535
  • [8] A Twitter Corpus for Named Entity Recognition in Turkish
    Carik, Buse
    Yeniterzi, Reyyan
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4546 - 4551
  • [9] Named Entity Recognition in Turkish: Approaches and Issues
    Kucuk, Dogan
    Arici, Nursal
    Kucuk, Dilek
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 176 - 181
  • [10] Turkish Named Entity Recognition with Deep Learning
    Gunes, Asim
    Tantug, A. Cuneyd
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,