A Detailed Analysis and Improvement of Feature-Based Named Entity Recognition for Turkish

被引:2
|
作者
Akdemir, Arda [1 ]
Gungor, Tunga [2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] Bogazici Univ, Istanbul, Turkey
来源
关键词
Named Entity Recognition; Conditional Random Fields; Dependency Parsing; Turkish; TEXT;
D O I
10.1007/978-3-030-26061-3_2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Named Entity Recognition (NER) is an important task in Natural Language Processing (NLP) with a wide range of applications. Recently, word embedding based systems that does not rely on hand-crafted features dominate the task as in the case of many other sequence labeling tasks in NLP. However, we are also observing the emergence of hybrid models that make use of hand crafted features through data augmentation to improve performance of such NLP systems. Such hybrid systems are especially important for less resourced languages such as Turkish as deep learning models require a large dataset to achieve good performance. In this paper, we first give a detailed analysis of the effect of various syntactic, semantic and orthographic features on NER for Turkish. We also improve the performance of the best feature based models for Turkish using additional features. We believe that our results will guide the research in this area and help making use of the key features for data augmentation.
引用
收藏
页码:9 / 19
页数:11
相关论文
共 50 条
  • [31] Improving feature extraction in named entity recognition based on maximum entropy model
    Jiang, Wei
    Guan, Yi
    Wang, Xiao-Long
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2630 - +
  • [32] Named entity recognition in aerospace based on multi-feature fusion transformer
    Chu, Jing
    Liu, Yumeng
    Yue, Qi
    Zheng, Zixuan
    Han, Xiaokai
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [33] Feature selection techniques for maximum entropy based biomedical named entity recognition
    Saha, Sujan Kumar
    Sarkar, Sudeshna
    Mitra, Pabitra
    JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) : 905 - 911
  • [34] A probabilistic feature based Maximum Entropy model for Chinese named entity recognition
    Zhang, Suxiang
    Wang, Xiaojie
    Wen, Juan
    Qin, Ying
    Zhong, Yixin
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 189 - +
  • [35] Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition
    Cho, Minsoo
    Ha, Jihwan
    Park, Chihyun
    Park, Sanghyun
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 103
  • [36] Automated Testing and Improvement of Named Entity Recognition Systems
    Yu, Boxi
    Hu, Yiyan
    Mang, Qiuyang
    Hu, Wenhan
    He, Pinjia
    PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 883 - 894
  • [37] Named Entity Recognition of Zhuang Language Based on the Feature of Initial Letter in Word
    Zhang, Weiquan
    Tang, Suqin
    He, Danni
    Li, Tinghui
    Pan, Changchun
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 44 - 49
  • [38] Chinese Named Entity Recognition Based on BERT and Lightweight Feature Extraction Model
    Yang, Ruisen
    Gan, Yong
    Zhang, Chenfang
    INFORMATION, 2022, 13 (11)
  • [39] A novel feature integration and entity boundary detection for named entity recognition in cybersecurity
    Wang, Xiaodi
    Liu, Jiayong
    KNOWLEDGE-BASED SYSTEMS, 2023, 260
  • [40] Arabic Named Entity Recognition: A Feature-Driven Study
    Benajiba, Yassine
    Diab, Mona
    Rosso, Paolo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (05): : 926 - 934