Evolutionary Multiobjective Feature Selection for Sentiment Analysis

被引:7
|
作者
Deniz, Ayca [1 ]
Angin, Merih [2 ]
Angin, Pelin [1 ]
机构
[1] Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] Koc Univ, Dept Int Relat, TR-34450 Istanbul, Turkey
关键词
Feature extraction; Sentiment analysis; Task analysis; Machine learning; Analytical models; Measurement; Data mining; Binary classification; evolutionary computation; feature selection; multiobjective optimization; sentiment analysis; PARTICLE SWARM OPTIMIZATION; FEATURE SUBSET-SELECTION; CLASSIFICATION; ALGORITHM;
D O I
10.1109/ACCESS.2021.3118961
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis is one of the prominent research areas in data mining and knowledge discovery, which has proven to be an effective technique for monitoring public opinion. The big data era with a high volume of data generated by a variety of sources has provided enhanced opportunities for utilizing sentiment analysis in various domains. In order to take best advantage of the high volume of data for accurate sentiment analysis, it is essential to clean the data before the analysis, as irrelevant or redundant data will hinder extracting valuable information. In this paper, we propose a hybrid feature selection algorithm to improve the performance of sentiment analysis tasks. Our proposed sentiment analysis approach builds a binary classification model based on two feature selection techniques: an entropy-based metric and an evolutionary algorithm. We have performed comprehensive experiments in two different domains using a benchmark dataset, Stanford Sentiment Treebank, and a real-world dataset we have created based on World Health Organization (WHO) public speeches regarding COVID-19. The proposed feature selection model is shown to achieve significant performance improvements in both datasets, increasing classification accuracy for all utilized machine learning and text representation technique combinations. Moreover, it achieves over 70% reduction in feature size, which provides efficiency in computation time and space.
引用
收藏
页码:142982 / 142996
页数:15
相关论文
共 50 条
  • [41] Enhanced Twitter Sentiment Analysis by Using Feature Selection and Combination
    Yang, Ang
    Zhang, Jun
    Pan, Lei
    Xiang, Yang
    [J]. 2015 INTERNATIONAL SYMPOSIUM ON SECURITY AND PRIVACY IN SOCIAL NETWORKS AND BIG DATA (SOCIALSEC 2015), 2015, : 52 - 57
  • [42] Ant colony optimization for text feature selection in sentiment analysis
    Ahmad, Siti Rohaidah
    Abu Bakar, Azuraliza
    Yaaku, Mohd Ridzwan
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (01) : 133 - 158
  • [43] A Review of Feature Selection and Sentiment Analysis Technique in Issues of Propaganda
    Ahmad, Siti Rohaidah
    Rodzi, Muhammad Zakwan Muhammad
    Shapiei, Nurlaila Syafira
    Yusop, Nurhafizah Moziyana Mohd
    Ismail, Suhaila
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (11) : 240 - 245
  • [44] A Review of Feature Selection Algorithms in Sentiment Analysis for Drug Reviews
    Ahmad, Siti Rohaidah
    Yusop, Nurhafizah Moziyana Mohd
    Asri, Afifah Mohd
    Amran, Mohd Fahmi Muhamad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (12) : 126 - 132
  • [45] A New Feature Selection Method for Sentiment Analysis of Turkish Reviews
    Parlar, Tuba
    Ozel, Selma Ayse
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2016,
  • [46] Examining the Impact of Feature Selection on Sentiment Analysis for the Greek Language
    Spatiotis, Nikolaos
    Paraskevas, Michael
    Perikos, Isidoros
    Mporas, Iosif
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 353 - 361
  • [47] A New Feature Selection Method for Sentiment Analysis in Short Text
    Kumar, H. M. Keerthi
    Harish, B. S.
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) : 1122 - 1134
  • [48] Sentimental feature selection for sentiment analysis of Chinese online reviews
    Lijuan Zheng
    Hongwei Wang
    Song Gao
    [J]. International Journal of Machine Learning and Cybernetics, 2018, 9 : 75 - 84
  • [49] Sentiment Analysis of IMDb Movie Reviews: A Comparative Analysis of Feature Selection and Feature Extraction Techniques
    Karak, Gahina
    Mishra, Shubham
    Bandyopadhyay, Arkadyuti
    Rohith, Pavirala Ranga Sai
    Rathore, Hemant
    [J]. HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 283 - 294
  • [50] Multimodal Multiobjective Optimization in Feature Selection
    Yue, C. T.
    Liang, J. J.
    Qu, B. Y.
    Yu, K. J.
    Song, H.
    [J]. 2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 302 - 309