Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis

被引:0
|
作者
Qasem A. Al-Radaideh
Ghufran Y. Al-Qudah
机构
[1] Yarmouk University,Department of Computer Information Systems
来源
Cognitive Computation | 2017年 / 9卷
关键词
Rough set theory; Reduct generation; Arabic sentiment analysis; Arabic text classification; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
Sentiment analysis is considered as one of the recent applications of text categorization that categories the emotions expressed in text as negative, positive, and natural. Rough set theory is a mathematical tool used to analyze uncertainty, incomplete information, and data reduction. Indiscernibility, reduct, and core are essential concepts in rough set theory that can be employed for data classification and knowledge reduction. This paper proposes to use the rough set-based methods for sentiment analysis to classify tweets that are written in the Arabic language. The paper investigates the application of the reduct concept of rough set theory as a feature selection method for sentiment analysis. This paper investigates four reduct computation techniques to generate the set of reducts. For classification purposes, two rule generation algorithms have been studied to build the rough set rule-based classifier. An Arabic data set of 4800 tweets is used in the experiments to validate the use of reduct computation for Arabic sentiment analysis. The results of the experiments showed that using rough set reducts techniques lead to different results and some of them can perform better than non-rough set classifier. The best classification accuracy rate was for rough set classifier using the full attribute weighting reduct generation algorithm which achieved an accuracy of 74%. The primary results indicate that using the rough set theory framework for sentiment analysis is an appealing option where it can enhance the overall accuracy and reduce the number of used terms for classification which in turn will lead to a faster classification process, especially with a large dataset.
引用
收藏
页码:436 / 445
页数:9
相关论文
共 50 条
  • [31] Rough Set-Based Feature Weighted Kernels for Support Vector Machine
    Li, Xiangjun
    Rao, Fen
    Wang, Tinghua
    Qiu, Taorong
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2012, 9 (12) : 2250 - 2254
  • [32] Feature selection by ordered rough set based feature weighting
    Al-Radaideh, QA
    Sulaiman, MN
    Selamat, MH
    Ibrahim, HT
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 105 - 112
  • [33] Rough set-based SAR analysis: An inductive method
    Dong, Ying
    Xiang, Bingren
    Wang, Teng
    Liu, Hao
    Qu, Lingbo
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (07) : 5032 - 5039
  • [34] Rough set-based logics for multicriteria decision analysis
    Fan, Tuan-Fang
    Liu, Duen-Ren
    Tzeng, Gwo-Hshiung
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 182 (01) : 340 - 355
  • [35] PSO-based feature selection and neighborhood rough set-based classification for BCI multiclass motor imagery task
    Kumar, S. Udhaya
    Inbarani, H. Hannah
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (11): : 3239 - 3258
  • [36] PSO-based feature selection and neighborhood rough set-based classification for BCI multiclass motor imagery task
    S. Udhaya Kumar
    H. Hannah Inbarani
    Neural Computing and Applications, 2017, 28 : 3239 - 3258
  • [37] ROUGH SET-BASED DESIGN RULE SELECTION FOR COLLABORATIVE ASSEMBLY DESIGN
    Kim, Kyoung-Yun
    Choi, Keunho
    DETC 2008: PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, VOL 1, PTS A AND B: 34TH DESIGN AUTOMATION CONFERENCE, 2009, : 53 - 59
  • [38] Intelligent temporal classification and fuzzy rough set-based feature selection algorithm for intrusion detection system in WSNs
    Selvakumar, K.
    Karuppiah, Marimuthu
    SaiRamesh, L.
    Islam, S. K. Hafizul
    Hassan, Mohammad Mehedi
    Fortino, Giancarlo
    Choo, Kim-Kwang Raymond
    INFORMATION SCIENCES, 2019, 497 : 77 - 90
  • [39] Degrees of conditional (in)dependence: A framework for approximate Bayesian networks and examples related to the rough set-based feature selection
    Slezak, Dominik
    INFORMATION SCIENCES, 2009, 179 (03) : 197 - 209
  • [40] Feature-Based Sentiment Analysis for Arabic Language
    Alhamad, Ghady
    Kurdy, Mohamad-Bassam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 455 - 462