A real time Named Entity Recognition system for Arabic text mining

被引:0
|
作者
Harith Al-Jumaily
Paloma Martínez
José L. Martínez-Fernández
Erik Van der Goot
机构
[1] Carlos III University of Madrid,Computer Science Department
[2] DAEDALUS – Data,undefined
[3] Decisions and Language S.A.,undefined
[4] EC Joint Research Centre,undefined
来源
关键词
Arabic language; Text mining; Named Entity Recognition; Event detection; Morphological analysis; Root extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Arabic is the most widely spoken language in the Arab World. Most people of the Islamic World understand the Classic Arabic language because it is the language of the Qur’an. Despite the fact that in the last decade the number of Arabic Internet users (Middle East and North and East of Africa) has increased considerably, systems to analyze Arabic digital resources automatically are not as easily available as they are for English. Therefore, in this work, an attempt is made to build a real time Named Entity Recognition system that can be used in web applications to detect the appearance of specific named entities and events in news written in Arabic. Arabic is a highly inflectional language, thus we will try to minimize the impact of Arabic affixes on the quality of the pattern recognition model applied to identify named entities. These patterns are built up by processing and integrating different gazetteers, from DBPedia (http://dbpedia.org/About, 2009) to GATE (A general architecture for text engineering, 2009) and ANERGazet (http://users.dsic.upv.es/grupos/nle/?file=kop4.php).
引用
收藏
页码:543 / 563
页数:20
相关论文
共 50 条
  • [41] Arabic Named Entity Recognition: A BERT-BGRU Approach
    Alsaaran, Norah
    Alrabiah, Maha
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 471 - 485
  • [42] Improving Arabic Named Entity Recognition by Global Features and Triggers
    AlGahtani, Shabib
    McNaught, John
    [J]. KNOWLEDGE MANAGEMENT AND INNOVATION IN ADVANCING ECONOMIES-ANALYSES & SOLUTIONS, VOLS 1-3, 2009, : 1554 - 1560
  • [43] Bidirectional Encoder–Decoder Model for Arabic Named Entity Recognition
    Mohammed N. A. Ali
    Guanzheng Tan
    [J]. Arabian Journal for Science and Engineering, 2019, 44 : 9693 - 9701
  • [44] Named Entity Recognition in Unstructured Medical Text Documents
    Pearson, Cole
    Seliya, Naeem
    Dave, Rushit
    [J]. INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 412 - 417
  • [45] Named Entity Recognition for Russian Judicial Rulings Text
    Averina, Maria
    Levanova, Olga
    Kasatkina, Natalia
    [J]. 2022 32ND CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2022, : 49 - 55
  • [46] Named Entity Recognition in Twitter Using Images and Text
    Esteves, Diego
    Peres, Rafael
    Lehmann, Jens
    Napolitano, Giulio
    [J]. CURRENT TRENDS IN WEB ENGINEERING, ICWE 2017, 2018, 10544 : 191 - 199
  • [47] Named Entity Recognition Method for Process Planning Text
    Dong, Hanxiao
    Li, Yuhu
    Qiao, Lihong
    Huang, Zhicheng
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (02): : 313 - 320
  • [48] Hybrid Feature Selection Approach for Arabic Named Entity Recognition
    Shahine, Miran
    Sakre, Mohamed
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 452 - 464
  • [49] A recent survey of Arabic named entity recognition on social media
    Ali, Brahim Ait Ben
    Mihi, Soukaina
    Bazi, Ismail El
    Laachfoubi, Nabil
    [J]. Revue d'Intelligence Artificielle, 2020, 34 (02) : 125 - 135
  • [50] Integrating Semantic Features for Enhancing Arabic Named Entity Recognition
    Alsayadi, Hamzah A.
    ElKorany, Abeer M.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (03) : 128 - 136