ANERsys: An Arabic Named Entity Recognition system based on maximum entropy

被引:0
|
作者
Benajiba, Yassine [1 ]
Rosso, Paolo [1 ]
Ruiz, Jose Miguel Benedi [1 ]
机构
[1] Univ Politecn Valencia, DSIC, E-46071 Valencia, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of Named Entity Recognition (NER) allows to identify proper names as well as temporal and numeric expressions, in an open-domain text. NER systems proved to be very important for many tasks in Natural Language Processing (NLP) such as Information Retrieval and Question Answering tasks. Unfortunately, the main efforts to build reliable NER systems for the Arabic language have been made in a commercial frame and the approach used as well as the accuracy of the performance are not known. In this paper, we present ANERsys: a NER system built exclusively for Arabic texts based-on n-grams and maximum entropy. Furthermore, we present both the specific Arabic language dependent heuristic and the gazetteers we used to boost our system. We developed our own training and test corpora (ANERcorp) and gazetteers (ANERgazet) to train, evaluate and boost the implemented technique. A major effort was conducted to make sure all the experiments are carried out in the same framework of the CONLL 2002 conference. We carried out several experiments and the preliminary results showed that this approach allows to tackle successfully the problem of NER for the Arabic language.
引用
收藏
页码:143 / +
页数:3
相关论文
共 50 条
  • [1] Method of Chinese Named Entity Recognition Based on Maximum Entropy Model
    Ning Hui
    Yang Hua
    Tan Ya-zhou
    Wu Hao
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 2472 - 2477
  • [2] RENA: A Named Entity Recognition System for Arabic
    El Bazi, Ismail
    Laachfoubi, Nabil
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 396 - 404
  • [3] Maximum Entropy Named Entity Recognition for Czech Language
    Konkol, Michal
    Konopik, Miloslav
    [J]. TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 203 - 210
  • [4] Hungarian named entity recognition with a maximum entropy approach
    Varga, Daniel
    Simon, Eszter
    [J]. ACTA CYBERNETICA, 2007, 18 (02): : 293 - 301
  • [5] Arabic Named Entity Recognition
    Benajiba, Yassine
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 151 - 152
  • [6] Multiobjective Approach for Feature Selection in Maximum Entropy based Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    Hasanuzzaman, Md
    [J]. 22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 1, 2010,
  • [7] Improving feature extraction in named entity recognition based on maximum entropy model
    Jiang, Wei
    Guan, Yi
    Wang, Xiao-Long
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2630 - +
  • [8] Feature selection techniques for maximum entropy based biomedical named entity recognition
    Saha, Sujan Kumar
    Sarkar, Sudeshna
    Mitra, Pabitra
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) : 905 - 911
  • [9] A probabilistic feature based Maximum Entropy model for Chinese named entity recognition
    Zhang, Suxiang
    Wang, Xiaojie
    Wen, Juan
    Qin, Ying
    Zhong, Yixin
    [J]. COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 189 - +
  • [10] Cross Domains Arabic Named Entity Recognition System
    Al-Ahmari, S. Saad
    Al-Johar, B. Abdullatif
    [J]. FIRST INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2016, 0011