Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis

被引:0
|
作者
Qasem A. Al-Radaideh
Ghufran Y. Al-Qudah
机构
[1] Yarmouk University,Department of Computer Information Systems
来源
Cognitive Computation | 2017年 / 9卷
关键词
Rough set theory; Reduct generation; Arabic sentiment analysis; Arabic text classification; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
Sentiment analysis is considered as one of the recent applications of text categorization that categories the emotions expressed in text as negative, positive, and natural. Rough set theory is a mathematical tool used to analyze uncertainty, incomplete information, and data reduction. Indiscernibility, reduct, and core are essential concepts in rough set theory that can be employed for data classification and knowledge reduction. This paper proposes to use the rough set-based methods for sentiment analysis to classify tweets that are written in the Arabic language. The paper investigates the application of the reduct concept of rough set theory as a feature selection method for sentiment analysis. This paper investigates four reduct computation techniques to generate the set of reducts. For classification purposes, two rule generation algorithms have been studied to build the rough set rule-based classifier. An Arabic data set of 4800 tweets is used in the experiments to validate the use of reduct computation for Arabic sentiment analysis. The results of the experiments showed that using rough set reducts techniques lead to different results and some of them can perform better than non-rough set classifier. The best classification accuracy rate was for rough set classifier using the full attribute weighting reduct generation algorithm which achieved an accuracy of 74%. The primary results indicate that using the rough set theory framework for sentiment analysis is an appealing option where it can enhance the overall accuracy and reduce the number of used terms for classification which in turn will lead to a faster classification process, especially with a large dataset.
引用
收藏
页码:436 / 445
页数:9
相关论文
共 50 条
  • [21] Random Reducts: A Monte Carlo Rough Set-based Method for Feature Selection in Large Datasets
    Kruczyk, Marcin
    Baltzer, Nicholas
    Mieczkowski, Jakub
    Draminski, Michal
    Koronacki, Jacek
    Komorowski, Jan
    FUNDAMENTA INFORMATICAE, 2013, 127 (1-4) : 273 - 288
  • [22] Rough Set Theory for Arabic Sentiment Classification
    Al-Radaideh, Qasem A.
    Twaiq, Laila M.
    2014 INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD), 2014, : 559 - 564
  • [23] A rough set-based hybrid feature selection method for topic-specific text filtering
    Li, Q
    Li, JH
    Liu, GS
    Li, SH
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1464 - 1468
  • [24] Feature selection using rough set-based direct dependency calculation by avoiding the positive region
    Raza, Muhammad Summair
    Qamar, Usman
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 92 : 175 - 197
  • [25] Noise-resistant fuzzy multineighbourhood rough set-based feature selection with label enhancement and its application for multilabel classification
    Sun, Lin
    Du, Wenjuan
    Xu, Jiucheng
    Chang, Baofang
    Applied Soft Computing, 2024, 167
  • [26] Rough Set Based Feature Selection: A Review
    Anaraki, Javad Rahimipour
    Eftekhari, Mahdi
    2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 301 - 306
  • [27] An Analysis of Rough Set-Based Application Tools in the Decision-Making Process
    Mohamad, Masurah
    Selamat, Ali
    RECENT TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2018, 5 : 467 - 474
  • [28] Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy
    Zhang, Xiao
    Mei, Changlin
    Chen, Degang
    Li, Jinhai
    PATTERN RECOGNITION, 2016, 56 : 1 - 15
  • [29] A neural network classifier with rough set-based feature selection to classify multiclass IC package products
    Hung, Y. H.
    ADVANCED ENGINEERING INFORMATICS, 2009, 23 (03) : 348 - 357
  • [30] RSFD: A rough set-based feature discretization method for meteorological data
    Zeng, Lirong
    Chen, Qiong
    Huang, Mengxing
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10