Machine Learning and Deep Learning Sentiment Analysis Models: Case Study on the SENT-COVID Corpus of Tweets in Mexican Spanish

被引:0
|
作者
Gomez-Adorno, Helena [1 ]
Bel-Enguix, Gemma [2 ]
Sierra, Gerardo [2 ]
Barajas, Juan-Carlos [3 ]
Alvarez, William [1 ]
机构
[1] Univ Nacl Autonoma Mexico, Inst Invest Matemat Aplicadas & Sistemas, Mexico City 04510, Mexico
[2] Univ Nacl Autonoma Mexico, Inst Ingn, Mexico City 04510, Mexico
[3] Univ Nacl Autonoma Mex, Fac Ciencias, Mexico City 04510, Mexico
来源
INFORMATICS-BASEL | 2024年 / 11卷 / 02期
关键词
sentiment analysis; COVID-19; machine learning; social media; Spanish; CLASSIFICATION;
D O I
10.3390/informatics11020024
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This article presents a comprehensive evaluation of traditional machine learning and deep learning models in analyzing sentiment trends within the SENT-COVID Twitter corpus, curated during the COVID-19 pandemic. The corpus, filtered by COVID-19 related keywords and manually annotated for polarity, is a pivotal resource for conducting sentiment analysis experiments. Our study investigates various approaches, including classic vector-based systems such as word2vec, doc2vec, and diverse phrase modeling techniques, alongside Spanish pre-trained BERT models. We assess the performance of readily available sentiment analysis libraries for Python users, including TextBlob, VADER, and Pysentimiento. Additionally, we implement and evaluate traditional classification algorithms such as Logistic Regression, Naive Bayes, Support Vector Machines, and simple neural networks like Multilayer Perceptron. Throughout the research, we explore different dimensionality reduction techniques. This methodology enables a precise comparison among classification methods, with BETO-uncased achieving the highest accuracy of 0.73 on the test set. Our findings underscore the efficacy and applicability of traditional machine learning and deep learning models in analyzing sentiment trends within the context of low-resource Spanish language scenarios and emerging topics like COVID-19.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Sentiment Analysis of COVID-19 Tweets by Machine Learning and Deep Learning Classifiers
    Jain, Ritanshi
    Bawa, Seema
    Sharma, Seemu
    [J]. ADVANCES IN DATA AND INFORMATION SCIENCES, 2022, 318 : 329 - 339
  • [2] A Deep Learning Approach for Sentiment Analysis in Spanish Tweets
    Vizcarra, Gerson
    Mauricio, Antoni
    Mauricio, Leonidas
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 622 - 629
  • [3] Sentiment Analysis on COVID-19 Vaccine Tweets using Machine Learning and Deep Learning Algorithms
    Jain, Tarun
    Verma, Vivek Kumar
    Sharma, Akhilesh Kumar
    Saini, Bhavna
    Purohit, Nishant
    Mahdin, Hairulnizam
    Ahmad, Masitah
    Darman, Rozanawati
    Haw, Su-Cheng
    Shaharudin, Shazlyn Milleana
    Arshad, Mohammad Syafwan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 32 - 41
  • [4] Evaluating sentiment analysis for Arabic Tweets using machine learning and deep learning
    Alshutayri, Areej
    Alamoudi, Huda
    Alshehri, Boushra
    Aldhahri, Eman
    Alsaleh, Iqbal
    Aljojo, Nahla
    Alghoson, Abdullah
    [J]. ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2022, 32 (04): : 7 - 18
  • [5] Multi-Class Sentiment Analysis of COVID-19 Tweets by Machine Learning and Deep Learning Approaches
    Moustafa, Maaskri
    Mokhtar-Mostefaoui, Sid Ahmed
    Hadj-Meghazi, Madani
    Goismi, Mohamed
    [J]. COMPUTACION Y SISTEMAS, 2024, 28 (02): : 507 - 516
  • [6] NLP and Machine Learning for Sentiment Analysis in COVID-19 Tweets: A Comparative Study
    Shaik, Shahedhadeennisa
    Chaitra, S.P.
    [J]. EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [7] A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis
    Rustam, Furqan
    Khalid, Madiha
    Aslam, Waqar
    Rupapara, Vaibhav
    Mehmood, Arif
    Choi, Gyu Sang
    [J]. PLOS ONE, 2021, 16 (02):
  • [8] Multiclass sentiment analysis on COVID-19-related tweets using deep learning models
    Sotiria Vernikou
    Athanasios Lyras
    Andreas Kanavos
    [J]. Neural Computing and Applications, 2022, 34 : 19615 - 19627
  • [9] Multiclass sentiment analysis on COVID-19-related tweets using deep learning models
    Vernikou, Sotiria
    Lyras, Athanasios
    Kanavos, Andreas
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22): : 19615 - 19627
  • [10] Sentiment Analysis of Tweets Using Deep Learning
    Ranganathan, Jaishree
    Tsahai, Tsega
    [J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2022), PT I, 2022, 13725 : 106 - 117