A real time Named Entity Recognition system for Arabic text mining

被引:0
|
作者
Harith Al-Jumaily
Paloma Martínez
José L. Martínez-Fernández
Erik Van der Goot
机构
[1] Carlos III University of Madrid,Computer Science Department
[2] DAEDALUS – Data,undefined
[3] Decisions and Language S.A.,undefined
[4] EC Joint Research Centre,undefined
来源
关键词
Arabic language; Text mining; Named Entity Recognition; Event detection; Morphological analysis; Root extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Arabic is the most widely spoken language in the Arab World. Most people of the Islamic World understand the Classic Arabic language because it is the language of the Qur’an. Despite the fact that in the last decade the number of Arabic Internet users (Middle East and North and East of Africa) has increased considerably, systems to analyze Arabic digital resources automatically are not as easily available as they are for English. Therefore, in this work, an attempt is made to build a real time Named Entity Recognition system that can be used in web applications to detect the appearance of specific named entities and events in news written in Arabic. Arabic is a highly inflectional language, thus we will try to minimize the impact of Arabic affixes on the quality of the pattern recognition model applied to identify named entities. These patterns are built up by processing and integrating different gazetteers, from DBPedia (http://dbpedia.org/About, 2009) to GATE (A general architecture for text engineering, 2009) and ANERGazet (http://users.dsic.upv.es/grupos/nle/?file=kop4.php).
引用
收藏
页码:543 / 563
页数:20
相关论文
共 50 条
  • [21] Named entity recognition for Arabic using syntactic grammars
    Mesfar, Slim
    Natural Language Processing and Information Systems, Proceedings, 2007, 4592 : 305 - 316
  • [22] Named Entity Recognition for Short Text Messages
    Ek, Tobias
    Kirkegaard, Camilla
    Jonsson, Hakan
    Nugues, Pierre
    COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 178 - 187
  • [23] Arabic Named Entity Recognition-A Survey and Analysis
    Dandashi, Amal
    Al Jaam, Jihad
    Foufou, Sebti
    INTELLIGENT INTERACTIVE MULTIMEDIA SYSTEMS AND SERVICES 2016, 2016, 55 : 83 - 96
  • [24] Hybrid Named Entity Recognition - Application to Arabic Language
    Meselhi, Mohamed A.
    Bakr, Hitham M. Abo
    Ziedan, Ibrahim
    Shaalan, Khaled
    2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2014, : 80 - 85
  • [25] Automatic Configuration of Deep Learning Algorithms for an Arabic Named Entity Recognition System
    Azroumahli, Chaimae
    Mouhib, Ibtihal
    El Younoussi, Yacine
    Badir, Hassan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 106 - 113
  • [26] Comparing Open Arabic Named Entity Recognition Tools
    Aldumaykhi, Abdullah
    Otai, Saad
    Alsudais, Abdulkareem
    2023 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI, 2023, : 46 - 51
  • [27] A Novel Hybrid Approach to Arabic Named Entity Recognition
    Meselhi, Mohamed A.
    Bakr, Hitham M. Abo
    Ziedan, Ibrahim
    Shaalan, Khaled
    MACHINE TRANSLATION, CWMT 2014, 2014, 493 : 93 - 103
  • [28] Advancements in Arabic Named Entity Recognition: A Comprehensive Review
    El Moussaoui, Taoufiq
    Loqman, Chakir
    IEEE ACCESS, 2024, 12 : 180238 - 180266
  • [29] Deep Learning Approach for Arabic Named Entity Recognition
    Gridach, Mourad
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 439 - 451
  • [30] Arabic Named Entity Recognition Using Boosting Method
    Sajadi, Mohamad Bagher
    Minaei, Behrooz
    2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 281 - 288