Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis

被引:0
|
作者
Qasem A. Al-Radaideh
Ghufran Y. Al-Qudah
机构
[1] Yarmouk University,Department of Computer Information Systems
来源
Cognitive Computation | 2017年 / 9卷
关键词
Rough set theory; Reduct generation; Arabic sentiment analysis; Arabic text classification; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
Sentiment analysis is considered as one of the recent applications of text categorization that categories the emotions expressed in text as negative, positive, and natural. Rough set theory is a mathematical tool used to analyze uncertainty, incomplete information, and data reduction. Indiscernibility, reduct, and core are essential concepts in rough set theory that can be employed for data classification and knowledge reduction. This paper proposes to use the rough set-based methods for sentiment analysis to classify tweets that are written in the Arabic language. The paper investigates the application of the reduct concept of rough set theory as a feature selection method for sentiment analysis. This paper investigates four reduct computation techniques to generate the set of reducts. For classification purposes, two rule generation algorithms have been studied to build the rough set rule-based classifier. An Arabic data set of 4800 tweets is used in the experiments to validate the use of reduct computation for Arabic sentiment analysis. The results of the experiments showed that using rough set reducts techniques lead to different results and some of them can perform better than non-rough set classifier. The best classification accuracy rate was for rough set classifier using the full attribute weighting reduct generation algorithm which achieved an accuracy of 74%. The primary results indicate that using the rough set theory framework for sentiment analysis is an appealing option where it can enhance the overall accuracy and reduce the number of used terms for classification which in turn will lead to a faster classification process, especially with a large dataset.
引用
收藏
页码:436 / 445
页数:9
相关论文
共 50 条
  • [41] A Rough Set Based Hybrid Method to Feature Selection
    Ming, He
    KAM: 2008 INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING, PROCEEDINGS, 2008, : 585 - 588
  • [42] Feature selection based on rough set and information entropy
    Han, JC
    2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 153 - 158
  • [43] Application of rough set theory to feature selection for unsupervised clustering
    Questier, F
    Arnaut-Rollier, I
    Walczak, B
    Massart, DL
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 63 (02) : 155 - 167
  • [44] Improved whale optimization algorithm for feature selection in Arabic sentiment analysis
    Mohammad Tubishat
    Mohammad A. M. Abushariah
    Norisma Idris
    Ibrahim Aljarah
    Applied Intelligence, 2019, 49 : 1688 - 1707
  • [45] Improved whale optimization algorithm for feature selection in Arabic sentiment analysis
    Tubishat, Mohammad
    Abushariah, Mohammad A. M.
    Idris, Norisma
    Aljarah, Ibrahim
    APPLIED INTELLIGENCE, 2019, 49 (05) : 1688 - 1707
  • [46] A rough set-based fuzzy clustering
    Zhao, YQ
    Zhou, XZ
    Tang, GZ
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 401 - 409
  • [47] A rough set-based CBR approach for feature and document reduction in text categorization
    Li, Y
    Shiu, SCK
    Pal, SK
    Liu, JNK
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2438 - 2443
  • [48] A Genetic Algorithm Feature Selection Based Approach for Arabic Sentiment Classification
    Aliane, A. A.
    Aliane, H.
    Ziane, M.
    Bensaou, N.
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [49] A rough set approach to feature selection based on power set tree
    Chen, Yumin
    Miao, Duoqian
    Wang, Ruizhi
    Wu, Keshou
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (02) : 275 - 281
  • [50] Design of metaheuristic rough set-based feature selection and rule-based medical data classification model on MapReduce framework
    Bhukya, Hanumanthu
    Manchala, Sadanandam
    JOURNAL OF INTELLIGENT SYSTEMS, 2022, 31 (01) : 1002 - 1013