10SENT: A stable sentiment analysis method based on the combination of off-the-shelf approaches

被引:11
|
作者
Melo, Philipe F. [1 ]
Dalip, Daniel H. [2 ]
Junior, Manoel M. [1 ]
Goncalves, Marcos A. [1 ]
Benevenuto, Fabricio [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Ciencia Comp, Belo Horizonte, MG, Brazil
[2] Ctr Fed Educ Tecnol Minas Gerais, Belo Horizonte, MG, Brazil
关键词
D O I
10.1002/asi.24117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis has become a very important tool for analysis of social media data. There are several methods developed, covering distinct aspects of the problem and disparate strategies. However, no single technique fits well in all cases or for all data sources. Supervised approaches may be able to adapt to specific situations, but require manually labeled training, which is very cumbersome and expensive to acquire, mainly for a new application. In this context, we propose to combine several popular and effective state-of-the-practice sentiment analysis methods by means of an unsupervised bootstrapped strategy. One of our main goals is to reduce the large variability (low stability) of the unsupervised methods across different domains. The experimental results demonstrate that our combined method (aka, 10SENT) improves the effectiveness of the classification task, considering thirteen different data sets. Also, it tackles the key problem of cross-domain low stability and produces the best (or close to best) results in almost all considered contexts, without any additional costs (e.g., manual labeling). Finally, we also investigate a transfer learning approach for sentiment analysis to gather additional (unsupervised) information for the proposed approach, and we show the potential of this technique to improve our results.
引用
收藏
页码:242 / 255
页数:14
相关论文
共 20 条
  • [1] Off-the-Shelf Technologies for Sentiment Analysis of Social Media Data: Two Empirical Studies
    Carvalho, Arthur
    Harris, Lucas
    [J]. AMCIS 2020 PROCEEDINGS, 2020,
  • [2] Conflict Analysis in Commercial Off-The-Shelf (COTS) Based Development
    Ibrahim, Hamdy
    Wanyama, Tom
    Eberlein, Armin
    Far, Behrouz H.
    [J]. 22ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING & KNOWLEDGE ENGINEERING (SEKE 2010), 2010, : 686 - 691
  • [3] Assessment of off-the-shelf SE-specific sentiment analysis tools: An extended replication study
    Nicole Novielli
    Fabio Calefato
    Filippo Lanubile
    Alexander Serebrenik
    [J]. Empirical Software Engineering, 2021, 26
  • [4] Assessment of off-the-shelf SE-specific sentiment analysis tools: An extended replication study
    Novielli, Nicole
    Calefato, Fabio
    Lanubile, Filippo
    Serebrenik, Alexander
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (04)
  • [5] Off-The-Shelf Artificial Intelligence Technologies for Sentiment and Emotion Analysis: A Tutorial on Using IBM Natural Language Processing
    Carvalho, Arthur
    Levitt, Adam
    Levitt, Seth
    Khaddam, Edward
    Benamati, John
    [J]. COMMUNICATIONS OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2019, 44 (01): : 918 - 943
  • [6] A computational design of experiments based method for evaluation of off-the-shelf total knee replacement implants
    Burge, Thomas A.
    Jeffers, Jonathan R. T.
    Myant, Connor W.
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2023, 26 (06) : 629 - 638
  • [7] What?s the Tone? Easy Doesn?t Do It: Analyzing Performance and Agreement Between Off-the-Shelf Sentiment Analysis Tools
    Boukes, Mark
    van de Velde, Bob
    Araujo, Theo
    Vliegenthart, Rens
    [J]. COMMUNICATION METHODS AND MEASURES, 2020, 14 (02) : 83 - 104
  • [8] Off-the-Shelf Mobile Handset Environments for Deploying Accelerometer based Gait and Activity Analysis Algorithms
    Hynes, Martin
    Wang, Han
    Kilmartin, Liam
    [J]. 2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 5187 - 5190
  • [9] A Compact, 10-kV, 2-ns Risetime Pulsed-Power Circuit Based on Off-the-Shelf Components
    Kesar, Amit S.
    [J]. IEEE TRANSACTIONS ON PLASMA SCIENCE, 2018, 46 (03) : 594 - 597
  • [10] TopicTracker - An advanced software pipeline for text mining on PubMed data: Bridging the gap between off-the-shelf tools and code based approaches
    Spitale, Giovanni
    Germani, Federico
    Biller-Andorno, Nikola
    [J]. HELIYON, 2024, 10 (17)