Feature selection for sentiment analysis based on content and syntax models

被引:92
|
作者
Duric, Adnan [1 ]
Song, Fei [1 ]
机构
[1] Univ Guelph, Sch Comp Sci, Guelph, ON N1G 2W1, Canada
关键词
Sentiment analysis; Text classification; Feature selection; Maximum entropy modeling; Topic modeling; Content and Syntax models;
D O I
10.1016/j.dss.2012.05.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent solutions for sentiment analysis have relied on feature selection methods ranging from lexicon-based approaches where the set of features are generated by humans, to approaches that use general statistical measures where features are selected solely on empirical evidence. The advantage of statistical approaches is that they are fully automatic, however, they often fail to separate features that carry sentiment from those that do not. In this paper we propose a set of new feature selection schemes that use a Content and Syntax model to automatically learn a set of features in a review document by separating the entities that are being reviewed from the subjective expressions that describe those entities in terms of polarities. By focusing only on the subjective expressions and ignoring the entities, we can choose more salient features for document-level sentiment analysis. The results obtained from using these features in a maximum entropy classifier are competitive with the state-of-the-art machine learning approaches. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:704 / 711
页数:8
相关论文
共 50 条
  • [21] Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis
    Al-Radaideh, Qasem A.
    Al-Qudah, Ghufran Y.
    [J]. COGNITIVE COMPUTATION, 2017, 9 (04) : 436 - 445
  • [22] NICFS: A novel feature selection method applied to lexicon based sentiment analysis
    Mehta, Poornima
    Chandra, Satish
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2019, 13 (01): : 41 - 48
  • [23] Application of Rough Set-Based Feature Selection for Arabic Sentiment Analysis
    Qasem A. Al-Radaideh
    Ghufran Y. Al-Qudah
    [J]. Cognitive Computation, 2017, 9 : 436 - 445
  • [24] Feature Selection Based Classification of Sentiment Analysis using Biogeography Optimization Algorithm
    Shahid, Ramsha
    Javed, Sobia Tariq
    Zafar, Kashif
    [J]. 2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN ELECTRICAL ENGINEERING AND COMPUTATIONAL TECHNOLOGIES (ICIEECT), 2017,
  • [25] Research and Improvement of CHI Feature Selection in Sentiment Analysis
    Li Danyang
    Fan Huimin
    [J]. 2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [26] Feature Selection for Twitter Sentiment Analysis: An Experimental Study
    Mansour, Riham
    Hady, Mohamed Farouk Abdel
    Hosam, Eman
    Amr, Hani
    Ashour, Ahmed
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 92 - 103
  • [27] QER: a new feature selection method for sentiment analysis
    Parlar, Tuba
    Ozel, Selma Ayse
    Song, Fei
    [J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8
  • [28] Latent Sentiment Representation for Sentiment Feature Selection
    Liang, Jiguang
    Zhou, Xiaofei
    Liu, Ping
    Guo, Li
    [J]. 2015 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2015, : 71 - 72
  • [29] Ordinal-based and frequency-based integration of feature selection methods for sentiment analysis
    Yousefpour, Alireza
    Ibrahim, Roliana
    Hamed, Haza Nuzly Abdel
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 75 : 80 - 93
  • [30] Feature Selection Using Multi-objective Optimization for Aspect Based Sentiment Analysis
    Akhtar, Md Shad
    Kohail, Sarah
    Kumar, Amit
    Ekbal, Asif
    Biemann, Chris
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 15 - 27