COVIDSenti: A Large-Scale Benchmark Twitter Data Set for COVID-19 Sentiment Analysis

被引:177
|
作者
Naseem, Usman [1 ]
Razzak, Imran [2 ]
Khushi, Matloob [1 ]
Eklund, Peter W. [2 ]
Kim, Jinman [1 ]
机构
[1] Univ Sydney, Sch Comp Sci, Ultimo, NSW 2006, Australia
[2] Deakin Univ, Sch Informat Technol, Geelong, Vic 3217, Australia
来源
关键词
Social networking (online); COVID-19; Blogs; Pandemics; Sentiment analysis; Statistics; Sociology; epidemic; misinformation; opinion mining; pandemic; sentiment analysis; text mining; Twitter; EVENT DETECTION; ANALYTICS; FRAMEWORK; EMOTIONS; WORLD;
D O I
10.1109/TCSS.2021.3051189
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Social media (and the world at large) have been awash with news of the COVID-19 pandemic. With the passage of time, news and awareness about COVID-19 spread like the pandemic itself, with an explosion of messages, updates, videos, and posts. Mass hysteria manifest as another concern in addition to the health risk that COVID-19 presented. Predictably, public panic soon followed, mostly due to misconceptions, a lack of information, or sometimes outright misinformation about COVID-19 and its impacts. It is thus timely and important to conduct an ex post facto assessment of the early information flows during the pandemic on social media, as well as a case study of evolving public opinion on social media which is of general interest. This study aims to inform policy that can be applied to social media platforms; for example, determining what degree of moderation is necessary to curtail misinformation on social media. This study also analyzes views concerning COVID-19 by focusing on people who interact and share social media on Twitter. As a platform for our experiments, we present a new large-scale sentiment data set COVIDSENTI, which consists of 90 000 COVID-19-related tweets collected in the early stages of the pandemic, from February to March 2020. The tweets have been labeled into positive, negative, and neutral sentiment classes. We analyzed the collected tweets for sentiment classification using different sets of features and classifiers. Negative opinion played an important role in conditioning public sentiment, for instance, we observed that people favored lockdown earlier in the pandemic; however, as expected, sentiment shifted by mid-March. Our study supports the view that there is a need to develop a proactive and agile public health presence to combat the spread of negative sentiment on social media following a pandemic.
引用
收藏
页码:1003 / 1015
页数:13
相关论文
共 50 条
  • [1] Sentiment Analysis on COVID-19 Twitter Data
    Vijay, Tanmay
    Chawla, Ayan
    Dhanka, Balan
    Karmakar, Purnendu
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (IEEE - ICRAIE-2020), 2020,
  • [2] COVID-19 and Misinformation: A Large-Scale Lexical Analysis on Twitter
    Antypas, Dimosthenis
    Rogers, David
    Preece, Alun
    Camacho-Collados, Jose
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 119 - 126
  • [3] Sentiment Analysis on COVID-19 Twitter Data: A Sentiment Timeline
    Karagkiozidou, Makrina
    Koukaras, Paraskevas
    Tjortjis, Christos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART II, 2022, 647 : 350 - 359
  • [4] Twitter Sentiment Analysis for Large-Scale Data: An Unsupervised Approach
    Rafeeque Pandarachalil
    Selvaraju Sendhilkumar
    G. S. Mahalakshmi
    Cognitive Computation, 2015, 7 : 254 - 262
  • [5] Twitter Sentiment Analysis for Large-Scale Data: An Unsupervised Approach
    Pandarachalil, Rafeeque
    Sendhilkumar, Selvaraju
    Mahalakshmi, G. S.
    COGNITIVE COMPUTATION, 2015, 7 (02) : 254 - 262
  • [6] Evolution of Public Opinion on COVID-19 Vaccination in Japan: Large-Scale Twitter Data Analysis
    Kobayashi, Ryota
    Takedomi, Yuka
    Nakayama, Yuri
    Suda, Towa
    Uno, Takeaki
    Hashimoto, Takako
    Toyoda, Masashi
    Yoshinaga, Naoki
    Kitsuregawa, Masaru
    Rocha, Luis E. C.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (12)
  • [7] COVID-19 pandemic and the economy: sentiment analysis on Twitter data
    Fano, Shira
    Toschi, Gianluca
    INTERNATIONAL JOURNAL OF COMPUTATIONAL ECONOMICS AND ECONOMETRICS, 2022, 12 (04) : 429 - 444
  • [8] COVID-19 Vaccine Sensing: Sentiment Analysis from Twitter Data
    Xu, Han
    Liu, Ruixin
    Luo, Ziling
    Xu, Minghua
    Wang, Bang
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3200 - 3205
  • [9] Sentiment analysis of COVID-19 cases in Greece using Twitter data
    Samaras, Loukas
    Garcia-Barriocanal, Elena
    Sicilia, Miguel-Angel
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [10] Public Sentiment Analysis on Twitter Data during COVID-19 Outbreak
    Abu Kausar, Mohammad
    Soosaimanickam, Arockiasamy
    Nasar, Mohammad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 415 - 422