A Survey of Arabic Named Entity Recognition and Classification

被引:0
|
作者
Shaalan, Khaled [1 ,2 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
[2] British Univ Dubai, Dubai, U Arab Emirates
关键词
TEXT; SYSTEM;
D O I
10.1162/COLI_a_00178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As more and more Arabic textual information becomes available through the Web in homes and businesses, via Internet and Intranet services, there is an urgent need for technologies and tools to process the relevant information. Named Entity Recognition (NER) is an Information Extraction task that has become an integral part of many other Natural Language Processing (NLP) tasks, such as Machine Translation and Information Retrieval. Arabic NER has begun to receive attention in recent years. The characteristics and peculiarities of Arabic, a member of the Semitic languages family, make dealing with NER a challenge. The performance of an Arabic NER component affects the overall performance of the NLP system in a positive manner. This article attempts to describe and detail the recent increase in interest and progress made in Arabic NER research. The importance of the NER task is demonstrated, the main characteristics of the Arabic language are highlighted, and the aspects of standardization in annotating named entities are illustrated. Moreover, the different Arabic linguistic resources are presented and the approaches used in Arabic NER field are explained. The features of common tools used in Arabic NER are described, and standard evaluation metrics are illustrated. In addition, a review of the state of the art of Arabic NER research is discussed. Finally, we present our conclusions. Throughout the presentation, illustrative examples are used for clarification.
引用
收藏
页码:469 / 510
页数:42
相关论文
共 50 条
  • [1] Named entity recognition and classification for text in arabic
    Abuleil, S
    Evens, M
    INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 89 - 94
  • [2] A survey of named entity recognition and classification
    Nadeau, David
    Sekine, Satoshi
    LINGUISTICAE INVESTIGATIONES, 2007, 30 (01): : 3 - 26
  • [3] Arabic Named Entity Recognition-A Survey and Analysis
    Dandashi, Amal
    Al Jaam, Jihad
    Foufou, Sebti
    INTELLIGENT INTERACTIVE MULTIMEDIA SYSTEMS AND SERVICES 2016, 2016, 55 : 83 - 96
  • [4] Arabic Named Entity Recognition
    Benajiba, Yassine
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 151 - 152
  • [5] A recent survey of Arabic named entity recognition on social media
    Ali B.A.B.
    Mihi S.
    Bazi I.E.
    Laachfoubi N.
    Revue d'Intelligence Artificielle, 2020, 34 (02) : 125 - 135
  • [6] Named Entity Recognition and Classification in Historical Documents: A Survey
    Ehrmann, Maud
    Hamdi, Ahmed
    Pontes, Elvys Linhares
    Romanello, Matteo
    Doucet, Antoine
    ACM COMPUTING SURVEYS, 2024, 56 (02)
  • [7] A Contribution to Arabic Named Entity Recognition
    Koulali, Rim
    Meziane, Abdelouafi
    2012 TENTH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING, 2012, : 46 - 52
  • [8] NERA: Named Entity Recognition for Arabic
    Shaalan, Khaled
    Raza, Hafsa
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (08): : 1652 - 1663
  • [9] A New Approach for Arabic Named Entity Recognition
    Karaa, Wahiba
    Slimani, Thabet
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (03) : 332 - 338
  • [10] RENA: A Named Entity Recognition System for Arabic
    El Bazi, Ismail
    Laachfoubi, Nabil
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 396 - 404