Design and analysis of a large-scale COVID-19 tweets dataset

被引:102
|
作者
Lamsal, Rabindra [1 ]
机构
[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi 110067, India
关键词
Social computing; Crisis computing; Sentiment analysis; Network analysis; Twitter data; TWITTER; SENTIMENT; TIME;
D O I
10.1007/s10489-020-02029-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As of July 17, 2020, more than thirteen million people have been diagnosed with the Novel Coronavirus (COVID-19), and half a million people have already lost their lives due to this infectious disease. The World Health Organization declared the COVID-19 outbreak as a pandemic on March 11, 2020. Since then, social media platforms have experienced an exponential rise in the content related to the pandemic. In the past, Twitter data have been observed to be indispensable in the extraction of situational awareness information relating to any crisis. This paper presents COV19Tweets Dataset (Lamsal 2020a), a large-scale Twitter dataset with more than 310 million COVID-19 specific English language tweets and their sentiment scores. The dataset's geo version, the GeoCOV19Tweets Dataset (Lamsal 2020b), is also presented. The paper discusses the datasets' design in detail, and the tweets in both the datasets are analyzed. The datasets are released publicly, anticipating that they would contribute to a better understanding of spatial and temporal dimensions of the public discourse related to the ongoing pandemic. As per the stats, the datasets (Lamsal 2020a, 2020b) have been accessed over 74.5k times, collectively.
引用
收藏
页码:2790 / 2804
页数:15
相关论文
共 50 条
  • [41] Evolution of Public Opinion on COVID-19 Vaccination in Japan: Large-Scale Twitter Data Analysis
    Kobayashi, Ryota
    Takedomi, Yuka
    Nakayama, Yuri
    Suda, Towa
    Uno, Takeaki
    Hashimoto, Takako
    Toyoda, Masashi
    Yoshinaga, Naoki
    Kitsuregawa, Masaru
    Rocha, Luis E. C.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (12)
  • [42] Sentimental and spatial analysis of COVID-19 vaccines tweets
    Umair, Areeba
    Masciari, Elio
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2023, 60 (01) : 1 - 21
  • [43] Analysis of COVID-19 Tweets During Lockdown Phases
    Tyagi, Prince
    Goyal, Naman
    Gupta, Trasha
    2021 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND EDUCATION TECHNOLOGY (ICIET 2021), 2021, : 471 - 475
  • [44] Topic based Sentiment Analysis for COVID-19 Tweets
    Abdulaziz, Manal
    Alsolamy, Mashail
    Alotaibi, Alanoud
    Alabbas, Abeer
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (01) : 626 - 636
  • [45] Sentiment Analysis of Pandemic Tweets with COVID-19 as a Prototype
    Almutiri, Mashail
    Alghamdi, Mona
    Elazhary, Hanan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 510 - 518
  • [46] Building and Testing Fine-Grained Dataset of COVID-19 Tweets for Worry Prediction
    Alharbi, Tahani Soud
    Fkih, Fethi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 645 - 652
  • [47] Nostalgia on Twitter: Detection and Analysis of a Large-Scale Dataset
    Stanley Jothiraj, Fiona Victoria
    Hong, Lingzi
    Mashhadi, Afra
    Proceedings of the Association for Information Science and Technology, 2024, 61 (01) : 349 - 360
  • [48] The effect of large-scale anti-contagion policies on the COVID-19 pandemic
    Hsiang, Solomon
    Allen, Daniel
    Annan-Phan, Sebastien
    Bell, Kendon
    Bolliger, Ian
    Chong, Trinetta
    Druckenmiller, Hannah
    Huang, Luna Yue
    Hultgren, Andrew
    Krasovich, Emma
    Lau, Peiley
    Lee, Jaecheol
    Rolf, Esther
    Tseng, Jeanette
    Wu, Tiffany
    NATURE, 2020, 584 (7820) : 262 - +
  • [49] Schools, Universities and Large-Scale Assessment Responses to COVID-19: The Swedish Example
    Wiberg, Marie
    Lyren, Per-Erik
    Pantzare, Anna Lind
    EDUCATION SCIENCES, 2021, 11 (04):
  • [50] Large-Scale and Green? The German Covid-19 Stimulus Package in International Comparison
    Hoerisch, Felix
    Wurster, Stefan
    Siewert, Markus B.
    GERMAN POLITICS, 2023, 32 (04) : 710 - 737