SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis

被引:36
|
作者
Khan, Farhan Hassan [1 ]
Qamar, Usman [1 ]
Bashir, Saba [1 ]
机构
[1] Natl Univ Sci & Technol, Coll Elect & Mech Engn, Dept Comp Engn, Islamabad, Pakistan
关键词
Sentiment analysis; Natural Language Processing (NLP); Movie reviews; Cornell; Feature selection; Support Vector Machine; CLASSIFICATION; LEXICON; ALGORITHMS; EXTRACTION; FRAMEWORK;
D O I
10.1016/j.knosys.2016.02.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics' sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:97 / 111
页数:15
相关论文
共 50 条
  • [41] Semi-supervised local feature selection for data classification
    Zechao LI
    Jinhui TANG
    [J]. Science China(Information Sciences), 2021, 64 (09) : 127 - 138
  • [42] SEMI-SUPERVISED EVALUATION OF CONSTRAINT SCORES FOR FEATURE SELECTION
    Kalakech, Mariam
    Biela, Philippe
    Hamad, Denis
    Macaire, Ludovic
    [J]. NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : 175 - 182
  • [43] Semi-supervised Sentiment Classification with Self-training on Feature Subspaces
    Gao, Wei
    Li, Shoushan
    Xue, Yunxia
    Wang, Meng
    Zhou, Guodong
    [J]. CHINESE LEXICAL SEMANTICS, 2014, 8922 : 231 - 239
  • [44] Semi-supervised dimensional sentiment analysis with variational autoencoder
    Wu, Chuhan
    Wu, Fangzhao
    Wu, Sixing
    Yuan, Zhigang
    Liu, Junxin
    Huang, Yongfeng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 165 : 30 - 39
  • [45] Efficient semi-supervised feature selection based on eigenspace model and manifold regularization
    Gu N.
    Li L.
    Shi C.
    Chen Q.
    [J]. Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2020, 40 (11): : 2968 - 2980
  • [46] Attention Aware Semi-supervised Framework for Sentiment Analysis
    Liu, Jingshuang
    Rong, Wenge
    Tian, Chuan
    Gao, Min
    Xiong, Zhang
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 208 - 215
  • [47] Semi-supervised Multi-view Sentiment Analysis
    Lazarova, Gergana
    Koychev, Ivan
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 : 181 - 190
  • [48] Semi-supervised distributed representations of documents for sentiment analysis
    Park, Saerom
    Lee, Jaewook
    Kim, Kyoungok
    [J]. NEURAL NETWORKS, 2019, 119 : 139 - 150
  • [49] Semi-supervised co-selection: features and instances by a weighting approach
    Makkhongkaew, Raywat
    Benabdeslem, Khalid
    Elghazel, Haytham
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3477 - 3484
  • [50] Semi-Supervised Feature Selection of Educational Data Mining for Student Performance Analysis
    Yu, Shanshan
    Cai, Yiran
    Pan, Baicheng
    Leung, Man-Fai
    [J]. ELECTRONICS, 2024, 13 (03)