Evolutionary Multiobjective Feature Selection for Sentiment Analysis

被引:7
|
作者
Deniz, Ayca [1 ]
Angin, Merih [2 ]
Angin, Pelin [1 ]
机构
[1] Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] Koc Univ, Dept Int Relat, TR-34450 Istanbul, Turkey
关键词
Feature extraction; Sentiment analysis; Task analysis; Machine learning; Analytical models; Measurement; Data mining; Binary classification; evolutionary computation; feature selection; multiobjective optimization; sentiment analysis; PARTICLE SWARM OPTIMIZATION; FEATURE SUBSET-SELECTION; CLASSIFICATION; ALGORITHM;
D O I
10.1109/ACCESS.2021.3118961
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis is one of the prominent research areas in data mining and knowledge discovery, which has proven to be an effective technique for monitoring public opinion. The big data era with a high volume of data generated by a variety of sources has provided enhanced opportunities for utilizing sentiment analysis in various domains. In order to take best advantage of the high volume of data for accurate sentiment analysis, it is essential to clean the data before the analysis, as irrelevant or redundant data will hinder extracting valuable information. In this paper, we propose a hybrid feature selection algorithm to improve the performance of sentiment analysis tasks. Our proposed sentiment analysis approach builds a binary classification model based on two feature selection techniques: an entropy-based metric and an evolutionary algorithm. We have performed comprehensive experiments in two different domains using a benchmark dataset, Stanford Sentiment Treebank, and a real-world dataset we have created based on World Health Organization (WHO) public speeches regarding COVID-19. The proposed feature selection model is shown to achieve significant performance improvements in both datasets, increasing classification accuracy for all utilized machine learning and text representation technique combinations. Moreover, it achieves over 70% reduction in feature size, which provides efficiency in computation time and space.
引用
收藏
页码:142982 / 142996
页数:15
相关论文
共 50 条
  • [21] Efficient feature selection techniques for sentiment analysis
    Avinash Madasu
    Sivasankar Elango
    [J]. Multimedia Tools and Applications, 2020, 79 : 6313 - 6335
  • [22] Efficient feature selection techniques for sentiment analysis
    Madasu, Avinash
    Elango, Sivasankar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 6313 - 6335
  • [23] Feature selection and weighting methods in sentiment analysis
    O'Keefe, Tim
    Koprinska, Irena
    [J]. ADCS 2009 - Proceedings of the Fourteenth Australasian Document Computing Symposium, 2009, : 67 - 74
  • [24] Comparison of Feature Selection Methods for Sentiment Analysis
    Nicholls, Chris
    Song, Fei
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2010, 6085 : 286 - 289
  • [25] Firefly Algorithm for Feature Selection in Sentiment Analysis
    Kumar, Akshi
    Khorwal, Renu
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM 2016, 2017, 556 : 693 - 703
  • [26] A review of feature selection techniques in sentiment analysis
    Ahmad, Siti Rohaidah
    Abu Bakar, Azuraliza
    Yaakub, Mohd Ridzwan
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (01) : 159 - 189
  • [27] A Review on Feature Selection Methods for Sentiment Analysis
    Hung, Lai Po
    Alfred, Rayner
    Hijazi, Mohd Hanafi Ahmad
    [J]. ADVANCED SCIENCE LETTERS, 2015, 21 (10) : 2952 - 2956
  • [28] A Multiobjective Evolutionary Nonlinear Ensemble Learning With Evolutionary Feature Selection for Silicon Prediction in Blast Furnace
    Wang, Xianpeng
    Hu, Tenghui
    Tang, Lixin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) : 2080 - 2093
  • [29] A multiobjective evolutionary setting for feature selection and a commonality-based crossover operator
    Emmanouilidis, C
    Hunter, A
    MacIntyre, J
    [J]. PROCEEDINGS OF THE 2000 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2000, : 309 - 316
  • [30] A Feature-Based Performance Analysis in Evolutionary Multiobjective Optimization
    Liefooghe, Arnaud
    Verel, Sebastien
    Daolio, Fabio
    Aguirre, Hernan
    Tanaka, Kiyoshi
    [J]. EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, PT II, 2015, 9019 : 95 - 109