Hybrid Framework for Named Entity Recognition in Turkish Social Media

被引:0
|
作者
Yilmaz, Selim F. [1 ]
Balaban, Ismail [2 ,3 ]
Tekin, Selim F. [1 ,3 ]
Kozat, Suleyman S. [1 ,3 ]
机构
[1] Bilkent Univ, Elekt & Elekt Muhendisligi Bolumu, Ankara, Turkey
[2] Orta Dogu Tekn Univ, Coklu Ortam Bilisimi Bolumu, Ankara, Turkey
[3] DataBoss AS, Ankara, Turkey
关键词
Turkish named entity recognition; social media; informal texts;
D O I
10.1109/siu49456.2020.9302335
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Named Entity Recognition (NER) is a task of extracting entities such as person, location, and organization from texts. NER is more challenging in the social media texts compared to the formal texts due to the noisy language including grammatical errors and abbreviations. However, the problem of NER in the social media gained significant attention in the literature due to the amount of information flow in the social media. In this paper, we propose a comprehensive model for NER in Turkish texts of distinct social media domains, i.e. Twitter, Facebook, and Donanimhaber Forum. The model employs Conditional Random Fields followed by Bidirectional Long Short Term Memory. To overcome the challenges of social media texts, we incorporate word embeddings, character representations, morphology, domain information, pattern-matching, dictionary, part-of-speech, and casing based features to our model. We perform ablation studies to analyze the effect of these features. We demonstrate the success of our model for tagging Turkish social media texts through the largest Turkish NER database.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Dynamic Graph Construction Framework for Multimodal Named Entity Recognition in Social Media
    Mai, Weixing
    Zhang, Zhengxuan
    Li, Kuntao
    Xue, Yun
    Li, Fenghuan
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02): : 2513 - 2522
  • [2] Named Entity Recognition in Turkish with Bayesian Learning and Hybrid Approaches
    RehaYavuz, Sermet
    Kucuk, Dilek
    Yazici, Adnan
    [J]. INFORMATION SCIENCES AND SYSTEMS 2013, 2013, 264 : 129 - 138
  • [3] Grounded Multimodal Named Entity Recognition on Social Media
    Yu, Jianfei
    Li, Ziyan
    Wang, Jieming
    Xia, Rui
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 9141 - 9154
  • [4] Named Entity Recognition on Turkish Tweets
    Kuecuek, Dilek
    Jacquet, Guillaume
    Steinberger, Ralf
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 450 - 454
  • [5] A Named Entity Recognition Dataset for Turkish
    Kucuk, Dilek
    Kucuk, Dogan
    Arici, Nursal
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 329 - 332
  • [6] A hybrid named entity recognizer for Turkish
    Kucuk, Dilek
    Yazici, Adnan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) : 2733 - 2742
  • [7] A Hybrid Deep Learning Framework for Bacterial Named Entity Recognition
    Li, Xusheng
    Wang, Xiaoyan
    Zhong, Ran
    Zhong, Duo
    He, Tingting
    Hu, Xiaohua
    Jiang, Xingpeng
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 428 - 433
  • [8] Named Entity Recognition for Code Mixed Social Media Sentences
    Sharma, Yashvardhan
    Bhargava, Rupal
    Tadikonda, Bapiraju Vamsi
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2021, 13 (02): : 23 - 36
  • [9] Improving Named Entity Recognition for Social Media with Data Augmentation
    Liu, Wenzhong
    Cui, Xiaohui
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [10] Named Entity Recognition for Social Media Texts with Semantic Augmentation
    Nie, Yuyang
    Tian, Yuanhe
    Wan, Xiang
    Yan Song
    Bo Dai
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1383 - 1391