A feature selection model based on genetic rank aggregation for text sentiment classification

被引:298
|
作者
Onan, Aytug [1 ]
Korukoglu, Serdar [2 ]
机构
[1] Celal Bayar Univ, Manisa, Turkey
[2] Ege Univ, Izmir, Turkey
关键词
Feature selection; rank aggregation; sentiment classification; TRAVELING SALESMAN PROBLEM; ALGORITHMS;
D O I
10.1177/0165551515613226
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis is an important research direction of natural language processing, text mining and web mining which aims to extract subjective information in source materials. The main challenge encountered in machine learning method-based sentiment classification is the abundant amount of data available. This amount makes it difficult to train the learning algorithms in a feasible time and degrades the classification accuracy of the built model. Hence, feature selection becomes an essential task in developing robust and efficient classification models whilst reducing the training time. In text mining applications, individual filter-based feature selection methods have been widely utilized owing to their simplicity and relatively high performance. This paper presents an ensemble approach for feature selection, which aggregates the several individual feature lists obtained by the different feature selection methods so that a more robust and efficient feature subset can be obtained. In order to aggregate the individual feature lists, a genetic algorithm has been utilized. Experimental evaluations indicated that the proposed aggregation model is an efficient method and it outperforms individual filter-based feature selection methods on sentiment classification.
引用
收藏
页码:25 / 38
页数:14
相关论文
共 50 条
  • [1] Rank Aggregation based Text Feature Selection
    Wu, Ou
    Zuo, Haiqiang
    Zhu, Mingliang
    Hu, Weiming
    Gao, Jun
    Wang, Hanzi
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 165 - +
  • [2] Feature selection based on genetic algorithm and hybrid model for sentiment polarity classification
    Kalaivani, P.
    Shunmuganathan, K. L.
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2016, 8 (04) : 315 - 329
  • [3] A Genetic Algorithm Feature Selection Based Approach for Arabic Sentiment Classification
    Aliane, A. A.
    Aliane, H.
    Ziane, M.
    Bensaou, N.
    [J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [4] A hybrid method of feature selection for Chinese text sentiment classification
    Wang, Suge
    Wei, Yingjie
    Li, Deyu
    Zhang, Wu
    Li, Wei
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 435 - +
  • [5] Text feature selection for sentiment classification of Chinese online reviews
    Wang, Hongwei
    Yin, Pei
    Yao, Jiani
    Liu, James N. K.
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2013, 25 (04) : 425 - 439
  • [6] A Feature Selection Method Based on Fisher's Discriminant Ratio for Text Sentiment Classification
    Wang, Suge
    Li, Deyu
    Wei, Yingjie
    Li, Hongxia
    [J]. WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 88 - +
  • [7] Feature Enhancement Based Text Sentiment Classification using Deep Learning Model
    Janardhana, D. R.
    Vijay, C. P.
    Swamy, G. B. Janardhana
    Ganaraj, K.
    [J]. PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
  • [8] Feature selection and machine learning algorithms for uyghur text sentiment classification
    Turhuntay, Raxida
    Slamu, Wushour
    Dawut, Abdusalam
    Hamdulla, Askar
    Turhun, Erxat
    [J]. Boletin Tecnico/Technical Bulletin, 2017, 55 (13): : 56 - 66
  • [9] Feature Selection For Text Classification Using Genetic Algorithms
    Bidi, Noria
    Elberrichi, Zakaria
    [J]. PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION & CONTROL (ICMIC 2016), 2016, : 806 - 810
  • [10] A feature selection method based on improved fisher's discriminant ratio for text sentiment classification
    Wang, Suge
    Li, Deyu
    Song, Xiaolei
    Wei, Yingjie
    Li, Hongxia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) : 8696 - 8702