A Survey of Arabic Named Entity Recognition and Classification

被引:0
|
作者
Shaalan, Khaled [1 ,2 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
[2] British Univ Dubai, Dubai, U Arab Emirates
关键词
TEXT; SYSTEM;
D O I
10.1162/COLI_a_00178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As more and more Arabic textual information becomes available through the Web in homes and businesses, via Internet and Intranet services, there is an urgent need for technologies and tools to process the relevant information. Named Entity Recognition (NER) is an Information Extraction task that has become an integral part of many other Natural Language Processing (NLP) tasks, such as Machine Translation and Information Retrieval. Arabic NER has begun to receive attention in recent years. The characteristics and peculiarities of Arabic, a member of the Semitic languages family, make dealing with NER a challenge. The performance of an Arabic NER component affects the overall performance of the NLP system in a positive manner. This article attempts to describe and detail the recent increase in interest and progress made in Arabic NER research. The importance of the NER task is demonstrated, the main characteristics of the Arabic language are highlighted, and the aspects of standardization in annotating named entities are illustrated. Moreover, the different Arabic linguistic resources are presented and the approaches used in Arabic NER field are explained. The features of common tools used in Arabic NER are described, and standard evaluation metrics are illustrated. In addition, a review of the state of the art of Arabic NER research is discussed. Finally, we present our conclusions. Throughout the presentation, illustrative examples are used for clarification.
引用
收藏
页码:469 / 510
页数:42
相关论文
共 50 条
  • [41] Improving Arabic Named Entity Recognition by Global Features and Triggers
    AlGahtani, Shabib
    McNaught, John
    KNOWLEDGE MANAGEMENT AND INNOVATION IN ADVANCING ECONOMIES-ANALYSES & SOLUTIONS, VOLS 1-3, 2009, : 1554 - 1560
  • [42] Deep learning for named entity recognition: a survey
    Hu Z.
    Hou W.
    Liu X.
    Neural Comput. Appl., 16 (8995-9022): : 8995 - 9022
  • [43] Named Entity Recognition: a Survey for the Portuguese Language
    Albuquerque, Hidelberg O.
    Souza, Ellen
    Gomes, Carlos
    Pinto, Matheus Henrique de C.
    Filho, Ricardo P. S.
    Costa, Rosimeire
    Lopes, Vinicius Teixeira de M.
    da Silva, Nadia F. F.
    de Carvalho, Andre C. P. L. F.
    Oliveira, Adriano L. I.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2023, (70): : 171 - 185
  • [44] Survey of Chinese Named Entity Recognition Research
    Zhao, Jigui
    Qian, Yurong
    Wang, Kui
    Hou, Shuxiang
    Chen, Jiaying
    Computer Engineering and Applications, 2024, 60 (01) : 15 - 27
  • [45] Hybrid Feature Selection Approach for Arabic Named Entity Recognition
    Shahine, Miran
    Sakre, Mohamed
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 452 - 464
  • [46] Integrating Semantic Features for Enhancing Arabic Named Entity Recognition
    Alsayadi, Hamzah A.
    ElKorany, Abeer M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (03) : 128 - 136
  • [47] Transfer Learning for Arabic Named Entity Recognition With Deep Neural Networks
    Al-Smadi, Mohammad
    Al-Zboon, Saad
    Jararweh, Yaser
    Juola, Patrick
    IEEE ACCESS, 2020, 8 : 37736 - 37745
  • [48] Bidirectional Encoder-Decoder Model for Arabic Named Entity Recognition
    Ali, Mohammed N. A.
    Tan, Guanzheng
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9693 - 9701
  • [49] Multiobjective Optimization for Biomedical Named Entity Recognition and Classification
    Ekbal, Asif
    Saha, Sriparna
    Sikdar, Utpal Kumar
    2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 206 - 213
  • [50] Arabic Named Entity Recognition: A Bidirectional GRU-CRF Approach
    Gridach, Mourad
    Haddad, Hatem
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 264 - 275