Evolutionary Multiobjective Feature Selection for Sentiment Analysis

被引:7
|
作者
Deniz, Ayca [1 ]
Angin, Merih [2 ]
Angin, Pelin [1 ]
机构
[1] Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] Koc Univ, Dept Int Relat, TR-34450 Istanbul, Turkey
关键词
Feature extraction; Sentiment analysis; Task analysis; Machine learning; Analytical models; Measurement; Data mining; Binary classification; evolutionary computation; feature selection; multiobjective optimization; sentiment analysis; PARTICLE SWARM OPTIMIZATION; FEATURE SUBSET-SELECTION; CLASSIFICATION; ALGORITHM;
D O I
10.1109/ACCESS.2021.3118961
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis is one of the prominent research areas in data mining and knowledge discovery, which has proven to be an effective technique for monitoring public opinion. The big data era with a high volume of data generated by a variety of sources has provided enhanced opportunities for utilizing sentiment analysis in various domains. In order to take best advantage of the high volume of data for accurate sentiment analysis, it is essential to clean the data before the analysis, as irrelevant or redundant data will hinder extracting valuable information. In this paper, we propose a hybrid feature selection algorithm to improve the performance of sentiment analysis tasks. Our proposed sentiment analysis approach builds a binary classification model based on two feature selection techniques: an entropy-based metric and an evolutionary algorithm. We have performed comprehensive experiments in two different domains using a benchmark dataset, Stanford Sentiment Treebank, and a real-world dataset we have created based on World Health Organization (WHO) public speeches regarding COVID-19. The proposed feature selection model is shown to achieve significant performance improvements in both datasets, increasing classification accuracy for all utilized machine learning and text representation technique combinations. Moreover, it achieves over 70% reduction in feature size, which provides efficiency in computation time and space.
引用
收藏
页码:142982 / 142996
页数:15
相关论文
共 50 条
  • [31] Feature Selection for Twitter Sentiment Analysis: An Experimental Study
    Mansour, Riham
    Hady, Mohamed Farouk Abdel
    Hosam, Eman
    Amr, Hani
    Ashour, Ahmed
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 92 - 103
  • [32] QER: a new feature selection method for sentiment analysis
    Parlar, Tuba
    Ozel, Selma Ayse
    Song, Fei
    [J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8
  • [33] Latent Sentiment Representation for Sentiment Feature Selection
    Liang, Jiguang
    Zhou, Xiaofei
    Liu, Ping
    Guo, Li
    [J]. 2015 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2015, : 71 - 72
  • [34] HYBRID FEATURE SELECTION FRAMEWORK FOR SENTIMENT ANALYSIS ON LARGE CORPORA
    Adewole, Kayode S.
    Balogun, Abdullateef O.
    Raheem, Muiz O.
    Jimoh, Muhammed K.
    Jimoh, Rasheed G.
    Mabayoje, Modinat A.
    Usman-Hamza, Fatima E.
    Akintola, Abimbola G.
    Asaju-Gbolagade, Ayisat W.
    [J]. JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2021, 7 (02): : 130 - 151
  • [35] Subset Selection for Evolutionary Multiobjective Optimization
    Gu, Yu-Ran
    Bian, Chao
    Li, Miqing
    Qian, Chao
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2024, 28 (02) : 403 - 417
  • [36] Evaluating the Impact of Feature Selection on Overall Performance of Sentiment Analysis
    Basha, Syed Muzamil
    Rajput, Dharmendra Singh
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT 2017), 2017, : 96 - 102
  • [37] Sentimental feature selection for sentiment analysis of Chinese online reviews
    Zheng, Lijuan
    Wang, Hongwei
    Gao, Song
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (01) : 75 - 84
  • [38] Feature selection for sentiment analysis based on content and syntax models
    Duric, Adnan
    Song, Fei
    [J]. DECISION SUPPORT SYSTEMS, 2012, 53 (04) : 704 - 711
  • [39] Sparsity Adjusted Information Gain for Feature Selection in Sentiment Analysis
    Ong, B. Y.
    Goh, S. W.
    Xu, Chi
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2122 - 2128
  • [40] Flower Pollination Algorithm for Feature Selection in Tweets Sentiment Analysis
    Abu Latiffi, Muhammad Iqbal
    Yaakub, Mohd Ridzwan
    Ahmad, Ibrahim Said
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 429 - 436