Efficient feature selection techniques for sentiment analysis

被引:29
|
作者
Madasu, Avinash [1 ]
Elango, Sivasankar [2 ]
机构
[1] Samsung R&D Inst India, Bagmane Constellat Business Pk,Outer Ring Rd, Bengaluru 560037, Karnataka, India
[2] Natl Inst Technol, Dept Comp Sci, Tanjore Main Rd,Natl Highway 67,Near BHEL Trichy, Tiruchirappalli 620015, Tamil Nadu, India
关键词
Feature selection; Ensemble techniques; Sentiment analysis; Machine learning; CLASSIFICATION;
D O I
10.1007/s11042-019-08409-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis is a domain of study that focuses on identifying and classifying the ideas expressed in the form of text into positive, negative and neutral polarities. Feature selection is a crucial process in machine learning. In this paper, we aim to study the performance of different feature selection techniques for sentiment analysis. Term Frequency Inverse Document Frequency (TF-IDF) is used as the feature extraction technique for creating feature vocabulary. Various Feature Selection (FS) techniques are experimented to select the best set of features from feature vocabulary. The selected features are trained using different machine learning classifiers Logistic Regression (LR), Support Vector Machines (SVM), Decision Tree (DT) and Naive Bayes (NB). Ensemble techniques Bagging and Random Subspace are applied on classifiers to enhance the performance on sentiment analysis. We show that, when the best FS techniques are trained using ensemble methods achieve remarkable results on sentiment analysis. We also compare the performance of FS methods trained using Bagging, Random Subspace with varied neural network architectures. We show that FS techniques trained using ensemble classifiers outperform neural networks requiring significantly less training time and parameters thereby eliminating the need for extensive hyper-parameter tuning.
引用
收藏
页码:6313 / 6335
页数:23
相关论文
共 50 条
  • [1] Efficient feature selection techniques for sentiment analysis
    Avinash Madasu
    Sivasankar Elango
    [J]. Multimedia Tools and Applications, 2020, 79 : 6313 - 6335
  • [2] A review of feature selection techniques in sentiment analysis
    Ahmad, Siti Rohaidah
    Abu Bakar, Azuraliza
    Yaakub, Mohd Ridzwan
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (01) : 159 - 189
  • [3] Sentiment Analysis of IMDb Movie Reviews: A Comparative Analysis of Feature Selection and Feature Extraction Techniques
    Karak, Gahina
    Mishra, Shubham
    Bandyopadhyay, Arkadyuti
    Rohith, Pavirala Ranga Sai
    Rathore, Hemant
    [J]. HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 283 - 294
  • [4] Optimizing feature selection techniques for sentiment classification
    Uribe, Diego
    [J]. 2011 IEEE ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE (CERMA 2011), 2011, : 103 - 107
  • [5] Preprocessing and Feature Selection Approach for Efficient Sentiment Analysis on Product Reviews
    Ghosh, Monalisa
    Sanyal, Gautam
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, FICTA 2016, VOL 1, 2017, 515 : 721 - 730
  • [6] Efficient Twitter Sentiment Analysis System with Feature Selection and Classifier Ensemble
    Fouad, Mohammed M.
    Gharib, Tarek F.
    Mashat, Abdulfattah S.
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 516 - 527
  • [7] Metaheuristic Algorithms for Feature Selection in Sentiment Analysis
    Ahmad, Siti Rohaidah
    Abu Bakar, Azuraliza
    Yaakub, Mohd Ridzwan
    [J]. 2015 SCIENCE AND INFORMATION CONFERENCE (SAI), 2015, : 222 - 226
  • [8] Evolutionary Multiobjective Feature Selection for Sentiment Analysis
    Deniz, Ayca
    Angin, Merih
    Angin, Pelin
    [J]. IEEE ACCESS, 2021, 9 : 142982 - 142996
  • [9] Firefly Algorithm for Feature Selection in Sentiment Analysis
    Kumar, Akshi
    Khorwal, Renu
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM 2016, 2017, 556 : 693 - 703
  • [10] Comparison of Feature Selection Methods for Sentiment Analysis
    Nicholls, Chris
    Song, Fei
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2010, 6085 : 286 - 289