SWIMS: Semi-supervised subjective feature weighting and intelligent model selection for sentiment analysis

被引:36
|
作者
Khan, Farhan Hassan [1 ]
Qamar, Usman [1 ]
Bashir, Saba [1 ]
机构
[1] Natl Univ Sci & Technol, Coll Elect & Mech Engn, Dept Comp Engn, Islamabad, Pakistan
关键词
Sentiment analysis; Natural Language Processing (NLP); Movie reviews; Cornell; Feature selection; Support Vector Machine; CLASSIFICATION; LEXICON; ALGORITHMS; EXTRACTION; FRAMEWORK;
D O I
10.1016/j.knosys.2016.02.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment Analysis, also called Opinion Mining, is currently one of the most studied research fields. Its aim is to analyze publics' sentiments, opinions, attitudes etc., towards different elements such as topics, products, individuals, organizations, or services. Sentiment classification can be achieved by machine learning or lexical based methodologies or a combination of both. In an effort to improve the performance of domain independent lexicons, this research incorporates machine learning with a lexical based approach introducing a new framework called SWIMS to determine the feature weight based on a well-known general-purpose sentiment lexicon, SentiWordNet. Support vector machine is used to learn the feature weights and an intelligent model selection approach is employed in order to enhance the classification performance. The features are selected based on their subjectivity and the effects of feature selection with respect to their part of speech information are studied extensively. Seven benchmark datasets have been used in this research including large movie review dataset, multi-domain sentiment dataset and Cornell movie review dataset, all of which are available online. In-depth performance comparison is conducted with the state of art machine learning approaches and lexical based methodologies. The evaluation of performance measures proves that the proposed framework outperforms other techniques for sentiment analysis. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:97 / 111
页数:15
相关论文
共 50 条
  • [21] A recursive feature retention method for semi-supervised feature selection
    Qingqing Pang
    Li Zhang
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 2639 - 2657
  • [22] A recursive feature retention method for semi-supervised feature selection
    Pang, Qingqing
    Zhang, Li
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (09) : 2639 - 2657
  • [23] Assessment of Iterative Semi-Supervised Feature Selection Learning for Sentiment Analyses:Digital Currency Markets
    Akba, Firat
    Medeni, Ihsan Tolga
    Guzel, Mehmet Serdar
    Askerzade, Iman
    [J]. 2020 IEEE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2020), 2020, : 459 - 463
  • [24] Adaptive Feature Selection and Feature Fusion for Semi-supervised Classification
    Wei Du
    Ronald Phlypo
    Tülay Adalı
    [J]. Journal of Signal Processing Systems, 2019, 91 : 521 - 537
  • [25] Adaptive Feature Selection and Feature Fusion for Semi-supervised Classification
    Du, Wei
    Phlypo, Ronald
    Adali, Tulay
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 91 (05): : 521 - 537
  • [26] Sentiment analysis in Turkish: Supervised, semi-supervised, and unsupervised techniques
    Aydin, Cem Rifki
    Gungor, Tunga
    [J]. NATURAL LANGUAGE ENGINEERING, 2021, 27 (04) : 455 - 483
  • [27] Local-to-Global Semi-Supervised Feature Selection
    Hindawi, Mohammed
    Benabdeslem, Khalid
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2159 - 2167
  • [28] Semi-supervised feature selection via multiobjective optimization
    Handl, Julia
    Knowles, Joshua
    [J]. 2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 3319 - +
  • [29] Semi-Supervised Learning with Auto-Weighting Feature and Adaptive Graph
    Nie, Feiping
    Shi, Shaojun
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (06) : 1167 - 1178
  • [30] Semi-supervised neighborhood discrimination index for feature selection
    Pang, Qing-Qing
    Zhang, Li
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 204