COVIDSenti: A Large-Scale Benchmark Twitter Data Set for COVID-19 Sentiment Analysis

被引:177
|
作者
Naseem, Usman [1 ]
Razzak, Imran [2 ]
Khushi, Matloob [1 ]
Eklund, Peter W. [2 ]
Kim, Jinman [1 ]
机构
[1] Univ Sydney, Sch Comp Sci, Ultimo, NSW 2006, Australia
[2] Deakin Univ, Sch Informat Technol, Geelong, Vic 3217, Australia
来源
关键词
Social networking (online); COVID-19; Blogs; Pandemics; Sentiment analysis; Statistics; Sociology; epidemic; misinformation; opinion mining; pandemic; sentiment analysis; text mining; Twitter; EVENT DETECTION; ANALYTICS; FRAMEWORK; EMOTIONS; WORLD;
D O I
10.1109/TCSS.2021.3051189
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Social media (and the world at large) have been awash with news of the COVID-19 pandemic. With the passage of time, news and awareness about COVID-19 spread like the pandemic itself, with an explosion of messages, updates, videos, and posts. Mass hysteria manifest as another concern in addition to the health risk that COVID-19 presented. Predictably, public panic soon followed, mostly due to misconceptions, a lack of information, or sometimes outright misinformation about COVID-19 and its impacts. It is thus timely and important to conduct an ex post facto assessment of the early information flows during the pandemic on social media, as well as a case study of evolving public opinion on social media which is of general interest. This study aims to inform policy that can be applied to social media platforms; for example, determining what degree of moderation is necessary to curtail misinformation on social media. This study also analyzes views concerning COVID-19 by focusing on people who interact and share social media on Twitter. As a platform for our experiments, we present a new large-scale sentiment data set COVIDSENTI, which consists of 90 000 COVID-19-related tweets collected in the early stages of the pandemic, from February to March 2020. The tweets have been labeled into positive, negative, and neutral sentiment classes. We analyzed the collected tweets for sentiment classification using different sets of features and classifiers. Negative opinion played an important role in conditioning public sentiment, for instance, we observed that people favored lockdown earlier in the pandemic; however, as expected, sentiment shifted by mid-March. Our study supports the view that there is a need to develop a proactive and agile public health presence to combat the spread of negative sentiment on social media following a pandemic.
引用
收藏
页码:1003 / 1015
页数:13
相关论文
共 50 条
  • [21] Analysis of Public Sentiment on COVID-19 Vaccination Using Twitter
    Jayasurya, Gutti Gowri
    Kumar, Sanjay
    Singh, Binod Kumar
    Kumar, Vinay
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (04) : 1101 - 1111
  • [22] Surveilling COVID-19 Emotional Contagion on Twitter by Sentiment Analysis
    Crocamo, Cristina
    Viviani, Marco
    Famiglini, Lorenzo
    Bartoli, Francesco
    Pasi, Gabriella
    Carra, Giuseppe
    EUROPEAN PSYCHIATRY, 2021, 64 (01)
  • [23] Deep Learning Model for COVID-19 Sentiment Analysis on Twitter
    Contreras Hernandez, Salvador
    Tzili Cruz, Maria Patricia
    Espinola Sanchez, Jose Martin
    Perez Tzili, Angelica
    NEW GENERATION COMPUTING, 2023, 41 (02) : 189 - 212
  • [24] Deep Learning Model for COVID-19 Sentiment Analysis on Twitter
    Salvador Contreras Hernández
    María Patricia Tzili Cruz
    José Martín Espínola Sánchez
    Angélica Pérez Tzili
    New Generation Computing, 2023, 41 : 189 - 212
  • [25] Sentiment Analysis on Twitter About COVID-19 Vaccination in Mexico
    Bernal, Claudia
    Bernal, Miguel
    Noguera, Andrei
    Ponce, Hiram
    Avalos-Gauna, Edgar
    ADVANCES IN SOFT COMPUTING (MICAI 2021), PT II, 2021, 13068 : 96 - 107
  • [26] COVID-19 Vaccine Brand Sentiment on Twitter
    Campan, AlMa
    Truta, Traian Marius
    Huesman, Shawn
    Meda, Vamsi
    Anderson, Jake
    PROCEEDINGS OF THE 2022 WORKSHOP ON OPEN CHALLENGES IN ONLINE SOCIAL NETWORKS, OASIS 2022/33RD ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA, HT 2022, 2022, : 39 - 49
  • [27] Automatically Identifying Self-Reports of COVID-19 Diagnosis on Twitter: An Annotated Data Set, Deep Neural Network Classifiers, and a Large-Scale Cohort
    Klein, Ari Z.
    Kunatharaju, Shriya
    O'Connor, Karen
    Gonzalez-Hernandez, Graciela
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [28] Design and analysis of a large-scale COVID-19 tweets dataset
    Rabindra Lamsal
    Applied Intelligence, 2021, 51 : 2790 - 2804
  • [29] Design and analysis of a large-scale COVID-19 tweets dataset
    Lamsal, Rabindra
    APPLIED INTELLIGENCE, 2021, 51 (05) : 2790 - 2804
  • [30] A large-scale analysis of COVID-19 tweets in the Arab region
    Aya Mourad
    Shady Elbassuoni
    Social Network Analysis and Mining, 2022, 12