Named Entity Recognition of Tunisian Arabic Using the Bi-LSTM-CRF Model

被引:2
|
作者
Mekki, Asma [1 ]
Zribi, Ines [2 ]
Ellouze, Mariem [1 ]
Belguith, Lamia Hadrich [1 ]
机构
[1] Univ Sfax, ANLP Res Grp, MIRACL, Sfax, Tunisia
[2] Univ Monastir, ANLP Res Grp, MIRACL, Monastir, Tunisia
关键词
Named entity recognition; Arabic dialect; Tunisian Arabic; Bi-LSTM-CRF;
D O I
10.1142/S0218213023500628
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition (NER) is an NLP field that deals with recognizing and classifying entities in written text. Most Arabic NER research studies discuss the Arabic NER challenge for the Modern Standard Arabic (MSA) language. However, the presence of dialectal Arabic textual resources in social media, blogs, TV shows, etc. is increasingly progressive. Therefore, the treatment of named entities is rapidly becoming a necessity, particularly for dialectal Arabic. In this paper, we are interested in the collection and annotation of a corpus as well as the realization of a NER system for Tunisian Arabic (TA), named TUNER. To the best of the researchers' knowledge, this is the first study that uses the suggested method for this purpose. In the present study, we adopt a hybrid method based on a Bi-LSTM-CRF model and a rule-based method. The proposed TUNER system yields an F-measure of 91.43%. This is an interesting improvement over comparable related work dialectal Arabic NER systems.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Named Entity Recognition by Using XLNet-BiLSTM-CRF
    Rongen Yan
    Xue Jiang
    Depeng Dang
    Neural Processing Letters, 2021, 53 : 3339 - 3356
  • [32] Named Entity Recognition in Portuguese Neurology Text Using CRF
    Lopes, Fabio
    Teixeira, Cesar
    Oliveira, Hugo Goncalo
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2019, PT I, 2019, 11804 : 336 - 348
  • [33] An Attention Based Bi-LSTM DenseNet Model for Named Entity Recognition in English Texts
    B. VeeraSekharReddy
    Koppula Srinivas Rao
    Neerja Koppula
    Wireless Personal Communications, 2023, 130 : 1435 - 1448
  • [34] An Attention Based Bi-LSTM DenseNet Model for Named Entity Recognition in English Texts
    VeeraSekharReddy, B.
    Rao, Koppula Srinivas
    Koppula, Neerja
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (02) : 1435 - 1448
  • [35] Named Entity Recognition by Using XLNet-BiLSTM-CRF
    Yan, Rongen
    Jiang, Xue
    Dang, Depeng
    NEURAL PROCESSING LETTERS, 2021, 53 (05) : 3339 - 3356
  • [36] A Contribution to Arabic Named Entity Recognition
    Koulali, Rim
    Meziane, Abdelouafi
    2012 TENTH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING, 2012, : 46 - 52
  • [37] NERA: Named Entity Recognition for Arabic
    Shaalan, Khaled
    Raza, Hafsa
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (08): : 1652 - 1663
  • [38] HAZOP Text Named Entity Recognition using CNN-BilSTM-CRF Model
    Gao, Dong
    Peng, Lanfei
    Bai, Yujie
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6159 - 6164
  • [39] Investigating Bi-LSTM and CRF with POS Tag Embedding for Indonesian Named Entity Tagger
    Hoesen, Devin
    Purwarianti, Ayu
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 35 - 38
  • [40] Bidirectional Encoder–Decoder Model for Arabic Named Entity Recognition
    Mohammed N. A. Ali
    Guanzheng Tan
    Arabian Journal for Science and Engineering, 2019, 44 : 9693 - 9701