Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis

被引:22
|
作者
Al-Radaideh, Qasem A. [1 ]
Al-Qudah, Ghufran Y. [1 ]
机构
[1] Yarmouk Univ, Dept Comp Informat Syst, Irbid, Jordan
关键词
Rough set theory; Reduct generation; Arabic sentiment analysis; Arabic text classification; Feature selection;
D O I
10.1007/s12559-017-9477-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is considered as one of the recent applications of text categorization that categories the emotions expressed in text as negative, positive, and natural. Rough set theory is a mathematical tool used to analyze uncertainty, incomplete information, and data reduction. Indiscernibility, reduct, and core are essential concepts in rough set theory that can be employed for data classification and knowledge reduction. This paper proposes to use the rough set-based methods for sentiment analysis to classify tweets that are written in the Arabic language. The paper investigates the application of the reduct concept of rough set theory as a feature selection method for sentiment analysis. This paper investigates four reduct computation techniques to generate the set of reducts. For classification purposes, two rule generation algorithms have been studied to build the rough set rule-based classifier. An Arabic data set of 4800 tweets is used in the experiments to validate the use of reduct computation for Arabic sentiment analysis. The results of the experiments showed that using rough set reducts techniques lead to different results and some of them can perform better than non-rough set classifier. The best classification accuracy rate was for rough set classifier using the full attribute weighting reduct generation algorithm which achieved an accuracy of 74%. The primary results indicate that using the rough set theory framework for sentiment analysis is an appealing option where it can enhance the overall accuracy and reduce the number of used terms for classification which in turn will lead to a faster classification process, especially with a large dataset.
引用
收藏
页码:436 / 445
页数:10
相关论文
共 50 条
  • [41] Application of rough set theory to feature selection for unsupervised clustering
    Questier, F
    Arnaut-Rollier, I
    Walczak, B
    Massart, DL
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 63 (02) : 155 - 167
  • [42] Improved whale optimization algorithm for feature selection in Arabic sentiment analysis
    Mohammad Tubishat
    Mohammad A. M. Abushariah
    Norisma Idris
    Ibrahim Aljarah
    [J]. Applied Intelligence, 2019, 49 : 1688 - 1707
  • [43] Improved whale optimization algorithm for feature selection in Arabic sentiment analysis
    Tubishat, Mohammad
    Abushariah, Mohammad A. M.
    Idris, Norisma
    Aljarah, Ibrahim
    [J]. APPLIED INTELLIGENCE, 2019, 49 (05) : 1688 - 1707
  • [44] A rough set-based CBR approach for feature and document reduction in text categorization
    Li, Y
    Shiu, SCK
    Pal, SK
    Liu, JNK
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2438 - 2443
  • [45] A rough set-based fuzzy clustering
    Zhao, YQ
    Zhou, XZ
    Tang, GZ
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 401 - 409
  • [46] A Genetic Algorithm Feature Selection Based Approach for Arabic Sentiment Classification
    Aliane, A. A.
    Aliane, H.
    Ziane, M.
    Bensaou, N.
    [J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [47] A rough set approach to feature selection based on power set tree
    Chen, Yumin
    Miao, Duoqian
    Wang, Ruizhi
    Wu, Keshou
    [J]. KNOWLEDGE-BASED SYSTEMS, 2011, 24 (02) : 275 - 281
  • [48] Design of metaheuristic rough set-based feature selection and rule-based medical data classification model on MapReduce framework
    Bhukya, Hanumanthu
    Manchala, Sadanandam
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2022, 31 (01) : 1002 - 1013
  • [49] Rough Set-Based Analysis of Characteristic Features for ANN Classifier
    Stanczyk, Urszula
    [J]. HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, PT 1, 2010, 6076 : 565 - 572
  • [50] Joint application of rough set-based feature reduction and Fuzzy LS-SVM classifier in motion classification
    Yan, Zhiguo
    Wang, Zhizhong
    Xie, Hongbo
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2008, 46 (06) : 519 - 527