Evolutionary Multiobjective Feature Selection for Sentiment Analysis

被引:7
|
作者
Deniz, Ayca [1 ]
Angin, Merih [2 ]
Angin, Pelin [1 ]
机构
[1] Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] Koc Univ, Dept Int Relat, TR-34450 Istanbul, Turkey
关键词
Feature extraction; Sentiment analysis; Task analysis; Machine learning; Analytical models; Measurement; Data mining; Binary classification; evolutionary computation; feature selection; multiobjective optimization; sentiment analysis; PARTICLE SWARM OPTIMIZATION; FEATURE SUBSET-SELECTION; CLASSIFICATION; ALGORITHM;
D O I
10.1109/ACCESS.2021.3118961
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis is one of the prominent research areas in data mining and knowledge discovery, which has proven to be an effective technique for monitoring public opinion. The big data era with a high volume of data generated by a variety of sources has provided enhanced opportunities for utilizing sentiment analysis in various domains. In order to take best advantage of the high volume of data for accurate sentiment analysis, it is essential to clean the data before the analysis, as irrelevant or redundant data will hinder extracting valuable information. In this paper, we propose a hybrid feature selection algorithm to improve the performance of sentiment analysis tasks. Our proposed sentiment analysis approach builds a binary classification model based on two feature selection techniques: an entropy-based metric and an evolutionary algorithm. We have performed comprehensive experiments in two different domains using a benchmark dataset, Stanford Sentiment Treebank, and a real-world dataset we have created based on World Health Organization (WHO) public speeches regarding COVID-19. The proposed feature selection model is shown to achieve significant performance improvements in both datasets, increasing classification accuracy for all utilized machine learning and text representation technique combinations. Moreover, it achieves over 70% reduction in feature size, which provides efficiency in computation time and space.
引用
收藏
页码:142982 / 142996
页数:15
相关论文
共 50 条
  • [1] Evolutionary Multiobjective Feature Selection in Multiresolution Analysis for BCI
    Ortega, Julio
    Asensio-Cubero, Javier
    Gan, John Q.
    Ortiz, Andres
    [J]. BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2015), PT I, 2015, 9043 : 347 - 359
  • [2] A Comparative Study of Evolutionary Methods for Feature Selection in Sentiment Analysis
    Garg, Shikhar
    Verma, Sukriti
    [J]. IJCCI: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2019, : 131 - 138
  • [3] An evolutionary parallel multiobjective feature selection framework
    Kiziloz, Hakan Ezgi
    Deniz, Ayca
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159 (159)
  • [4] Interactive evolutionary approaches to multiobjective feature selection
    Ozmen, Muberra
    Karakaya, Gulsah
    Koksalan, Murat
    [J]. INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2018, 25 (03) : 1027 - 1052
  • [5] Evolutionary Multitasking for Multiobjective Feature Selection in Classification
    Lin, Jiabin
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    [J]. IEEE Transactions on Evolutionary Computation, 2024, 28 (06) : 1852 - 1866
  • [6] Multiobjective Evolutionary Feature Selection for Fuzzy Classification
    Jimenez, Fernando
    Martinez, Carlos
    Marzano, Enrico
    Tomas Palma, Jose
    Sanchez, Gracia
    Sciavicco, Guido
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (05) : 1085 - 1099
  • [7] Metaheuristic and evolutionary methods for Feature Selection in Sentiment Analysis (a comparative study)
    Ighazran, Hasna
    Alaoui, Larbi
    Boujiha, Tarik
    [J]. 2018 INTERNATIONAL SYMPOSIUM ON ADVANCED ELECTRICAL AND COMMUNICATION TECHNOLOGIES (ISAECT), 2018,
  • [8] MOFSRank: A Multiobjective Evolutionary Algorithm for Feature Selection in Learning to Rank
    Cheng, Fan
    Guo, Wei
    Zhang, Xingyi
    [J]. COMPLEXITY, 2018,
  • [9] Evolutionary multiobjective ensemble learning based on Bayesian feature selection
    Chen, Huanhuan
    Yao, Xin
    [J]. 2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 267 - +
  • [10] An evolutionary multiobjective method based on dominance and decomposition for feature selection in classification
    Liang, Jing
    Zhang, Yuyang
    Chen, Ke
    Qu, Boyang
    Yu, Kunjie
    Yue, Caitong
    Suganthan, Ponnuthurai Nagaratnam
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (02)