GAWA-A Feature Selection Method for Hybrid Sentiment Classification

被引:26
|
作者
Rasool, Abdur [1 ,2 ]
Tao, Ran [1 ]
Kamyab, Marjan [1 ]
Hayat, Shoaib [1 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, Shanghai 201620, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab High Performance Data Min, Shenzhen 518055, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Feature extraction; Genetic algorithms; Classification algorithms; Dictionaries; Sentiment analysis; Twitter; Machine learning algorithms; Feature selection; genetic algorithm; hybrid sentiment classification; machine learning algorithms; wrapper approach; OPTIMIZATION; ALGORITHM;
D O I
10.1109/ACCESS.2020.3030642
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis or opinion mining is the key to natural language processing for the extraction of useful information from the text documents of numerous sources. Several different techniques, i.e., simple rule-based to lexicon-based and more sophisticated machine learning algorithms, have been widely used with different classifiers to get the factual analysis of sentiment. However, lexicon-based sentiment classification is still suffering from low accuracies, mainly due to the deficiency of domain-oriented competitive dictionaries. Similarly, machine learning-based sentiment is also tackling the accuracy constraints because of feature ambiguity from social data. One of the best ways to deal with the accuracy issue is to select the best feature-set and reduce the volume of the feature. This paper proposes a method (namely, GAWA) for feature selection by utilizing the Wrapper Approaches (WA) to select the premier features and the Genetic Algorithm (GA) to reduce the size of the premier features. The novelty of this work is the modified fitness function of heuristic GA to compute the optimal features by reducing the redundancy for better accuracy. This work aims to present a comprehensive model of hybrid sentiment by using the proposed method, GAWA. It will be valued in developing a new approach for the selection of feature-set with a better accuracy level. The experiments revealed that these techniques could reduce the feature-set up-to 61.95% without negotiating the accuracy level. The new optimal feature sets enhanced the efficiency of the Naive Bayes algorithm up to 92%. This work is compared with the conventional method of feature selection and concluded the 11%; better accuracy than PCA and 8%; better than PSO. Furthermore, the results are compared with the literature work and found that the proposed method outperformed the previous research.
引用
收藏
页码:191850 / 191861
页数:12
相关论文
共 50 条
  • [1] Hybrid Filter–Wrapper Feature Selection Method for Sentiment Classification
    Gunjan Ansari
    Tanvir Ahmad
    Mohammad Najmud Doja
    [J]. Arabian Journal for Science and Engineering, 2019, 44 : 9191 - 9208
  • [2] A hybrid method of feature selection for Chinese text sentiment classification
    Wang, Suge
    Wei, Yingjie
    Li, Deyu
    Zhang, Wu
    Li, Wei
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 435 - +
  • [3] Hybrid Filter-Wrapper Feature Selection Method for Sentiment Classification
    Ansari, Gunjan
    Ahmad, Tanvir
    Doja, Mohammad Najmud
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9191 - 9208
  • [4] Intelligent Hybrid Feature Selection for Textual Sentiment Classification
    Khan, Jawad
    Alam, Aftab
    Lee, Youngmoon
    [J]. IEEE ACCESS, 2021, 9 : 140590 - 140608
  • [5] Sentiment classification using hybrid feature selection and ensemble classifier
    Jain, Achin
    Jain, Vanita
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (02) : 659 - 668
  • [6] Hybrid Ensemble Learning With Feature Selection for Sentiment Classification in Social Media
    Sharma, Sanur
    Jain, Anurag
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2020, 10 (02) : 40 - 58
  • [7] A Hybrid Feature Selection Method for Classification Purposes
    Cateni, Silvia
    Colla, Valentina
    Vannucci, Marco
    [J]. UKSIM-AMSS EIGHTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2014), 2014, : 39 - 44
  • [8] Feature selection based on genetic algorithm and hybrid model for sentiment polarity classification
    Kalaivani, P.
    Shunmuganathan, K. L.
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2016, 8 (04) : 315 - 329
  • [9] A Hybrid Feature Selection Method For Vietnamese Text Classification
    Nguyen Tri Hai
    Tuan Dinh Le
    Nguyen Hoang Nghia
    Vu Thanh Nguyen
    [J]. 2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 91 - 96
  • [10] Optimizing feature selection techniques for sentiment classification
    Uribe, Diego
    [J]. 2011 IEEE ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE (CERMA 2011), 2011, : 103 - 107