A Comparative Study of Feature Selection Methods for Dialectal Arabic Sentiment Classification Using Support Vector Machine

被引:0
|
作者
Al-Harbi, Omar [1 ]
机构
[1] Jazan Univ, Jizan, Saudi Arabia
关键词
Arabic sentiment analysis; Dialectal sentiment analysis; Opinion mining; Feature selection methods; Dimensionality; SVM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unlike other languages, the Arabic language has a morphological complexity which makes the Arabic sentiment analysis is a challenging task. Moreover, the presence of the dialects in the Arabic texts have made the sentiment analysis task is more challenging, due to the absence of specific rules that govern the writing or speaking system. Generally, one of the problems of sentiment analysis is the high dimensionality of the feature vector. To resolve this problem, many feature selection methods have been proposed. In contrast to the dialectal Arabic language, these selection methods have been investigated widely for the English language. This work investigated the effect of feature selection methods and their combinations on dialectal Arabic sentiment classification. The feature selection methods are Information Gain (IG), Correlation, Support Vector Machine (SVM), Gini Index (GI), and Chi-Square. A number of experiments were carried out on dialectical Jordanian reviews with using an SVM classifier. Furthermore, the effect of different term weighting schemes, stemmers, stop words removal, and feature models on the performance were investigated. The experimental results showed that the best performance of the SVM classifier was obtained after the SVM and correlation feature selection methods had been combined with the uni-gram model.
引用
收藏
页码:167 / 176
页数:10
相关论文
共 50 条
  • [41] An Enhanced Hybrid Feature Selection Technique Using Term Frequency-Inverse Document Frequency and Support Vector Machine-Recursive Feature Elimination for Sentiment Classification
    Nafis, Nur Syafiqah Mohd
    Awang, Suryanti
    [J]. IEEE ACCESS, 2021, 9 : 52177 - 52192
  • [42] Feature selection in the Laplacian support vector machine
    Lee, Sangjun
    Park, Changyi
    Koo, Ja-Yong
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (01) : 567 - 577
  • [43] A Semisupervised Feature Selection with Support Vector Machine
    Dai, Kun
    Yu, Hong-Yi
    Li, Qing
    [J]. JOURNAL OF APPLIED MATHEMATICS, 2013,
  • [44] On the Probability of Feature Selection in Support Vector Classification
    Liu, Qunfeng
    Yao, Lan
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS (SOLI), 2013, : 334 - 339
  • [45] Genetic Support Vector Classification and Feature Selection
    Mejia-Guevaara, Ivan
    Kuri-Morales, Angel
    [J]. PROCEEDINGS OF THE SPECIAL SESSION OF THE SEVENTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE - MICAI 2008, 2008, : 75 - +
  • [46] A Performance Comparison of Feature Selection Methods for Sentiment Classification
    Hung, Lai Po
    Alfred, Rayner
    Hijazi, Mohd Hanafi Ahmad
    [J]. COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 21 - 30
  • [47] A Comparative study of Classification techniques: Support vector Machine, Fuzzy Support vector Machine & Decision Trees
    Pandey, Priyank
    Jain, Amita
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3620 - 3624
  • [48] Support Vector Machine Ensembles Using Feature-Subset Selection for Enhancing Microarray Data Classification
    Ahmed, Eman
    El Gayar, Neamat
    El Azab, Iman A.
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2012, 28 (04): : 1 - 11
  • [49] Tweet Sentiment Classification Using an Ensemble of Machine Learning Supervised Classifiers Employing Statistical Feature Selection Methods
    Devi, K. Lakshmi
    Subathra, P.
    Kumar, P. N.
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON FUZZY AND NEURO COMPUTING (FANCCO - 2015), 2015, 415 : 1 - 13
  • [50] Feature Selection and Classification for Urban Data Using Improved F-Score with Support Vector Machine
    Zemmoudj, Salah
    Kemmouche, Akila
    Chibani, Youcef
    [J]. 2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 371 - 375