Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis

被引:22
|
作者
Al-Radaideh, Qasem A. [1 ]
Al-Qudah, Ghufran Y. [1 ]
机构
[1] Yarmouk Univ, Dept Comp Informat Syst, Irbid, Jordan
关键词
Rough set theory; Reduct generation; Arabic sentiment analysis; Arabic text classification; Feature selection;
D O I
10.1007/s12559-017-9477-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is considered as one of the recent applications of text categorization that categories the emotions expressed in text as negative, positive, and natural. Rough set theory is a mathematical tool used to analyze uncertainty, incomplete information, and data reduction. Indiscernibility, reduct, and core are essential concepts in rough set theory that can be employed for data classification and knowledge reduction. This paper proposes to use the rough set-based methods for sentiment analysis to classify tweets that are written in the Arabic language. The paper investigates the application of the reduct concept of rough set theory as a feature selection method for sentiment analysis. This paper investigates four reduct computation techniques to generate the set of reducts. For classification purposes, two rule generation algorithms have been studied to build the rough set rule-based classifier. An Arabic data set of 4800 tweets is used in the experiments to validate the use of reduct computation for Arabic sentiment analysis. The results of the experiments showed that using rough set reducts techniques lead to different results and some of them can perform better than non-rough set classifier. The best classification accuracy rate was for rough set classifier using the full attribute weighting reduct generation algorithm which achieved an accuracy of 74%. The primary results indicate that using the rough set theory framework for sentiment analysis is an appealing option where it can enhance the overall accuracy and reduce the number of used terms for classification which in turn will lead to a faster classification process, especially with a large dataset.
引用
收藏
页码:436 / 445
页数:10
相关论文
共 50 条
  • [1] Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis
    Qasem A. Al-Radaideh
    Ghufran Y. Al-Qudah
    [J]. Cognitive Computation, 2017, 9 : 436 - 445
  • [2] Rough set-based feature selection method
    Zhan, YM
    Zeng, XY
    Sun, JC
    [J]. PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2005, 15 (03) : 280 - 284
  • [3] Rough set-based feature selection method
    ZHAN Yanmei
    [J]. Progress in Natural Science:Materials International, 2005, (03) : 88 - 92
  • [4] A novel rough set-based feature selection method
    Xu, Yan
    Li, Jintao
    Wang, Bin
    Ding, Fan
    Sun, Chunming
    Wang, Xiaoleng
    [J]. RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 226 - 231
  • [5] Rough set-based feature selection for weakly labeled data
    Campagner, Andrea
    Ciucci, Davide
    Huellermeier, Eyke
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 136 : 150 - 167
  • [6] Rough set-based approach to feature selection in customer relationship management
    Tseng, Tzu-Liang
    Huang, Chun-Che
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2007, 35 (04): : 365 - 383
  • [7] A noise resistant dependency measure for rough set-based feature selection
    Javidi, Mohammad Masoud
    Eskandari, Sadegh
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 33 (03) : 1613 - 1626
  • [8] A Novel Neighborhood Rough Set-Based Feature Selection Method and Its Application to Biomarker Identification of Schizophrenia
    Xing, Ying
    Kochunov, Peter
    van Erp, Theo G. M.
    Ma, Tianzhou
    Calhoun, Vince D.
    Du, Yuhui
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (01) : 215 - 226
  • [9] Covering Rough Set-based Three-way Decision Feature Selection
    Ren, Mengyuan
    Qu, Yanpeng
    Deng, Ansheng
    [J]. PROCEEDINGS OF 2018 TENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2018, : 782 - 787
  • [10] Covering rough set-based incremental feature selection for mixed decision system
    Yang, Yanyan
    Chen, Degang
    Zhang, Xiao
    Ji, Zhenyan
    [J]. SOFT COMPUTING, 2022, 26 (06) : 2651 - 2669