A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis

被引:134
|
作者
Rustam, Furqan [1 ]
Khalid, Madiha [1 ]
Aslam, Waqar [2 ]
Rupapara, Vaibhav [3 ]
Mehmood, Arif [2 ]
Choi, Gyu Sang [4 ]
机构
[1] Khwaja Fareed Univ Engn & Informat Technol, Dept Comp Sci, Rahim Yar Khan, Pakistan
[2] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur, Punjab, Pakistan
[3] Florida Int Univ, Sch Comp & Informat Sci, Miami, FL 33199 USA
[4] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan, Gyeongbuk, South Korea
来源
PLOS ONE | 2021年 / 16卷 / 02期
基金
新加坡国家研究基金会;
关键词
CLASSIFICATION;
D O I
10.1371/journal.pone.0245909
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The spread of Covid-19 has resulted in worldwide health concerns. Social media is increasingly used to share news and opinions about it. A realistic assessment of the situation is necessary to utilize resources optimally and appropriately. In this research, we perform Covid-19 tweets sentiment analysis using a supervised machine learning approach. Identification of Covid-19 sentiments from tweets would allow informed decisions for better handling the current pandemic situation. The used dataset is extracted from Twitter using IDs as provided by the IEEE data port. Tweets are extracted by an in-house built crawler that uses the Tweepy library. The dataset is cleaned using the preprocessing techniques and sentiments are extracted using the TextBlob library. The contribution of this work is the performance evaluation of various machine learning classifiers using our proposed feature set. This set is formed by concatenating the bag-of-words and the term frequency-inverse document frequency. Tweets are classified as positive, neutral, or negative. Performance of classifiers is evaluated on the accuracy, precision, recall, and F-1 score. For completeness, further investigation is made on the dataset using the Long Short-Term Memory (LSTM) architecture of the deep learning model. The results show that Extra Trees Classifiers outperform all other models by achieving a 0.93 accuracy score using our proposed concatenated features set. The LSTM achieves low accuracy as compared to machine learning classifiers. To demonstrate the effectiveness of our proposed feature set, the results are compared with the Vader sentiment analysis technique based on the GloVe feature extraction approach.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Comparison of Supervised Learning Models for COVID-19 Confirmed Cases Prediction using Correlation Analysis
    Kim, Jun-Su
    Choi, Byung-Jae
    2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [42] An Analysis of French-Language Tweets About COVID-19 Vaccines: Supervised Learning Approach
    Sauvayre, Romy
    Vernier, Jessica
    Chauviere, Cedric
    JMIR MEDICAL INFORMATICS, 2022, 10 (05)
  • [43] Analysing sentiment change detection of Covid-19 tweets
    Theocharopoulos, Panagiotis C.
    Tsoukala, Anastasia
    Georgakopoulos, Spiros V.
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (29): : 21433 - 21443
  • [44] Analysing sentiment change detection of Covid-19 tweets
    Panagiotis C. Theocharopoulos
    Anastasia Tsoukala
    Spiros V. Georgakopoulos
    Sotiris K. Tasoulis
    Vassilis P. Plagianakos
    Neural Computing and Applications, 2023, 35 : 21433 - 21443
  • [45] EMOCOV: Machine learning for emotion detection, analysis and visualization using COVID-19 tweets
    Kabir M.Y.
    Madria S.
    Online Social Networks and Media, 2021, 23
  • [46] Sentiment Analysis of COVID-19 Tweets Using Adaptive Neuro-Fuzzy Inference System Models
    Mohammed, Sabri Sabri
    Menaouer, Brahami
    Zohra, Abid Faten Fatima
    Nada, Matta
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2022, 14 (01):
  • [47] Comparing the accuracy of ANN with transformer models for sentiment analysis of tweets related to COVID-19 Pfizer vaccines
    Wu, Xuanyi
    Wang, Bingkun
    Li, Wenling
    CHAOS SOLITONS & FRACTALS, 2024, 185
  • [48] Machine learning sentiment analysis, COVID-19 news and stock market reactions
    Costola, Michele
    Hinz, Oliver
    Nofer, Michael
    Pelizzon, Loriana
    RESEARCH IN INTERNATIONAL BUSINESS AND FINANCE, 2023, 64
  • [49] Sentiment analysis of COVID-19 social media data through machine learning
    Dharmendra Dangi
    Dheeraj K. Dixit
    Amit Bhagat
    Multimedia Tools and Applications, 2022, 81 : 42261 - 42283
  • [50] Sentiment analysis of COVID-19 social media data through machine learning
    Dangi, Dharmendra
    Dixit, Dheeraj K.
    Bhagat, Amit
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42261 - 42283