A hybrid method of feature selection for Chinese text sentiment classification

被引:11
|
作者
Wang, Suge [1 ,2 ]
Wei, Yingjie [2 ]
Li, Deyu
Zhang, Wu [1 ]
Li, Wei [2 ]
机构
[1] Shanghai Univ, Sch Engn & Comp Sci, Shanghai 200072, Peoples R China
[2] Shanxi Univ, Sch Math Sci, Taiyuan 030006, Peoples R China
关键词
D O I
10.1109/FSKD.2007.49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text sentiment classification can be extensively applied to information retrieval, text filtering, online tracking evaluation, the diagnoses of public opinions and chat systems. In this paper, a kinds of hybrid methods, based on category distinguishing ability of words and information gain, is adopted to feature selection. For examining the impact of varying the feature dimension to classification results, using corpus of car reviews, feature dimensions, 1000, 2000 and 3000 are adopted in our experiments. The experiments classification results indicate that the hybrid methods are best with feature dimension equal to 3000, and the result by using hybrid methods is superior to that by directly using information gain. In our experiments F value can achieve over 80%. Finally, some mistake examples are employed to indicate the limitations of methods in this paper.
引用
收藏
页码:435 / +
页数:3
相关论文
共 50 条
  • [31] Statera: A Balanced Feature Selection Method for Text Classification
    Gama Bispo, Braian Varjao
    Rios, Tatiane Nogueira
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 260 - 265
  • [32] A parallel feature selection method study for text classification
    Li, Zhao
    Lu, Wei
    Sun, Zhanquan
    Xing, Weiwei
    [J]. NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S513 - S524
  • [33] A parallel feature selection method study for text classification
    Zhao Li
    Wei Lu
    Zhanquan Sun
    Weiwei Xing
    [J]. Neural Computing and Applications, 2017, 28 : 513 - 524
  • [34] A novel probabilistic feature selection method for text classification
    Uysal, Alper Kursat
    Gunal, Serkan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2012, 36 : 226 - 235
  • [35] Hybrid Ensemble Learning With Feature Selection for Sentiment Classification in Social Media
    Sharma, Sanur
    Jain, Anurag
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2020, 10 (02) : 40 - 58
  • [36] A Hybrid Feature Selection Method for Classification Purposes
    Cateni, Silvia
    Colla, Valentina
    Vannucci, Marco
    [J]. UKSIM-AMSS EIGHTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2014), 2014, : 39 - 44
  • [37] Hybrid ACO and TOFA Feature Selection Approach for Text Classification
    Alghamdi, Hanan S.
    Tang, H. Lilian
    Alshomrani, Saleh
    [J]. 2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [38] A Hybrid Method for Sentiment Classification in Chinese Movie Reviews Based on Sentiment Labels
    Zhao, Kai
    Jin, Yaohong
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 86 - 89
  • [39] Feature Selection in Text Classification
    Sahin, Durmus Ozkan
    Ates, Nurullah
    Kilic, Erdal
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1777 - 1780
  • [40] A Novel Feature Selection Method Based on Probability Latent Semantic Analysis for Chinese Text Classification
    Zhong Jiang
    Sun Qigan
    Li Xue
    Wen Luosheng
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (02) : 228 - 232