A real time Named Entity Recognition system for Arabic text mining

被引:0
|
作者
Harith Al-Jumaily
Paloma Martínez
José L. Martínez-Fernández
Erik Van der Goot
机构
[1] Carlos III University of Madrid,Computer Science Department
[2] DAEDALUS – Data,undefined
[3] Decisions and Language S.A.,undefined
[4] EC Joint Research Centre,undefined
来源
关键词
Arabic language; Text mining; Named Entity Recognition; Event detection; Morphological analysis; Root extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Arabic is the most widely spoken language in the Arab World. Most people of the Islamic World understand the Classic Arabic language because it is the language of the Qur’an. Despite the fact that in the last decade the number of Arabic Internet users (Middle East and North and East of Africa) has increased considerably, systems to analyze Arabic digital resources automatically are not as easily available as they are for English. Therefore, in this work, an attempt is made to build a real time Named Entity Recognition system that can be used in web applications to detect the appearance of specific named entities and events in news written in Arabic. Arabic is a highly inflectional language, thus we will try to minimize the impact of Arabic affixes on the quality of the pattern recognition model applied to identify named entities. These patterns are built up by processing and integrating different gazetteers, from DBPedia (http://dbpedia.org/About, 2009) to GATE (A general architecture for text engineering, 2009) and ANERGazet (http://users.dsic.upv.es/grupos/nle/?file=kop4.php).
引用
收藏
页码:543 / 563
页数:20
相关论文
共 50 条
  • [1] A real time Named Entity Recognition system for Arabic text mining
    Al-Jumaily, Harith
    Martinez, Paloma
    Martinez-Fernandez, Jose L.
    Van der Goot, Erik
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2012, 46 (04) : 543 - 563
  • [2] Named entity recognition and classification for text in arabic
    Abuleil, S
    Evens, M
    [J]. INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 89 - 94
  • [3] RENA: A Named Entity Recognition System for Arabic
    El Bazi, Ismail
    Laachfoubi, Nabil
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 396 - 404
  • [4] Arabic Named Entity Recognition from diverse text types
    Shaalan, Khaled
    Raza, Hafsa
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 440 - 451
  • [5] Arabic Named Entity Recognition
    Benajiba, Yassine
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 151 - 152
  • [6] Cross Domains Arabic Named Entity Recognition System
    Al-Ahmari, S. Saad
    Al-Johar, B. Abdullatif
    [J]. FIRST INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2016, 0011
  • [7] A Contribution to Arabic Named Entity Recognition
    Koulali, Rim
    Meziane, Abdelouafi
    [J]. 2012 TENTH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING, 2012, : 46 - 52
  • [8] NERA: Named Entity Recognition for Arabic
    Shaalan, Khaled
    Raza, Hafsa
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (08): : 1652 - 1663
  • [9] A Hybrid Named Entity Recognition System for Aviation Text
    Bharathi, A.
    Ramdin, Robin
    Babu, Preeja
    Menon, Vijay Krishna
    Jayaramakrishnan, Chandrasekhar
    Lakshmikumar, Sudarsan
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (01):
  • [10] A New Approach for Arabic Named Entity Recognition
    Karaa, Wahiba
    Slimani, Thabet
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (03) : 332 - 338