Arabic Named Entity Recognition on Social Media based on feature selection techniques using SVM-RFE

被引:0
|
作者
Ali, Brahim Ait Ben [1 ]
Mihi, Soukaina [1 ]
Bazi, Ismail El [2 ]
Laachfoubi, Nahil [1 ]
机构
[1] Hassan First Univ Settat, Fac Sci & Tech, IR2M Lab, Settat, Morocco
[2] Sultan Moulay Slimane Univ, Natl Sch Business & Management, Beni Mellal, Morocco
关键词
Named entity recognition; Natural Language Processing (NLP); Feature selection; Support Vector Machine; Recursive Feature Elimination; Arabic language; Social Media; SUPPORT VECTOR MACHINE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the massive expansion of information in social media, a high demand exists for information retrieval techniques. The recognition of the named entity (NE) such as person, location, organization, etc. has emerged as one of the main tasks in natural language processing. Often, utilizing the entire feature set may not only be time-consuming but may also have a negative effect on performance. Due to the high number of features, it is difficult to identify the subset of features relevant to a given task. In this paper, we apply feature selection methods based on the support vector machine recursive feature elimination (SVM-RFE) to find the optimized feature set. Afterward, an optimized feature set combination is used to identify and classify named entities (NEs) based on the Support Vector Machine (SVM). The proposed method is evaluated using Darwish's dataset (a publicly available benchmark for Arabic NER for social media). Experimental results demonstrate the effectiveness of feature selection in enhancing performance and outperform the most advanced systems
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Arabic Named Entity Recognition: A Feature-Driven Study
    Benajiba, Yassine
    Diab, Mona
    Rosso, Paolo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (05): : 926 - 934
  • [32] Feature reduction using SVM-RFE technique to detect autism spectrum disorder
    Mohan, Priya
    Paramasivam, Ilango
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 989 - 997
  • [33] Selecting Feature Subsets Based on SVM-RFE and the Overlapping Ratio with Applications in Bioinformatics
    Lin, Xiaohui
    Li, Chao
    Zhang, Yanhui
    Su, Benzhe
    Fan, Meng
    Wei, Hai
    MOLECULES, 2018, 23 (01):
  • [34] Feature Subset Selection Using Genetic Algorithm for Named Entity Recognition
    Hasanuzzaman, Md
    Saha, Sriparna
    Ekbal, Asif
    PROCEEDINGS OF THE 24TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2010, : 153 - 162
  • [35] Feature reduction using SVM-RFE technique to detect autism spectrum disorder
    Priya Mohan
    Ilango Paramasivam
    Evolutionary Intelligence, 2021, 14 : 989 - 997
  • [36] Data Augmentation Techniques on Arabic Data for Named Entity Recognition
    Sabty, Caroline
    Omar, Islam
    Wasfalla, Fady
    Islam, Mohamed
    Abdennadher, Slim
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 292 - 299
  • [37] Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
    Satoshi Niijima
    Satoru Kuhara
    BMC Bioinformatics, 7
  • [38] Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE
    Niijima, Satoshi
    Kuhara, Satoru
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [39] Named entity recognition for Arabic using syntactic grammars
    Mesfar, Slim
    Natural Language Processing and Information Systems, Proceedings, 2007, 4592 : 305 - 316
  • [40] Arabic Named Entity Recognition Using Boosting Method
    Sajadi, Mohamad Bagher
    Minaei, Behrooz
    2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 281 - 288