Machine Learning Approach for COVID-19 Detection on Twitter

被引:19
|
作者
Amin, Samina [1 ]
Uddin, M. Irfan [1 ]
Al-Baity, Heyam H. [2 ]
Zeb, M. Ali [1 ]
Khan, M. Abrar [1 ]
机构
[1] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
[2] King Saud Univ, Dept Informat Technol, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 68卷 / 02期
关键词
Artificial intelligence; coronavirus; COVID-19; pandemic; social network; Twitter; machine learning; natural language processing; RANDOM FOREST; DENGUE; TWEETS;
D O I
10.32604/cmc.2021.016896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social networking services (SNSs) provide massive data that can be a very influential source of information during pandemic outbreaks. This study shows that social media analysis can be used as a crisis detector (e.g., understanding the sentiment of social media users regarding various pandemic outbreaks). The novel Coronavirus Disease-19 (COVID-19), commonly known as coronavirus, has affected everyone worldwide in 2020. Streaming Twitter data have revealed the status of the COVID-19 outbreak in the most affected regions. This study focuses on identifying COVID-19 patients using tweets without requiring medical records to find the COVID-19 pandemic in Twitter messages (tweets). For this purpose, we propose herein an intelligent model using traditional machine learning-based approaches, such as support vector machine (SVM), logistic regression (LR), na?ve Bayes (NB), random forest (RF), and decision tree (DT) with the help of the term frequency inverse document frequency (TF-IDF) to detect the COVID-19 pandemic in Twitter messages. The proposed intelligent traditional machine learning-based model classifies Twitter messages into four categories, namely, confirmed deaths, recovered, and suspected. For the experimental analysis, the tweet data on the COVID-19 pandemic are analyzed to evaluate the results of traditional machine learning approaches. A benchmark dataset for COVID-19 on Twitter messages is developed and can be used for future research studies. The experiments show that the results of the proposed approach are promising in detecting the COVID-19 pandemic in Twitter messages with overall accuracy, precision, recall, and F1 score between 70% and 80% and the confusion matrix for machine learning approaches (i.e., SVM, NB, LR, RF, and DT) with the TF-IDF feature extraction technique.
引用
收藏
页码:2231 / 2247
页数:17
相关论文
共 50 条
  • [41] COVID-19 Concerns in US: Topic Detection in Twitter
    Comito, Carmela
    [J]. IDEAS 2021: 25TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, 2021, : 103 - 110
  • [42] COVID-19 and the Futures of Machine Learning
    Arga, Kazim Yalcin
    [J]. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2020, 24 (09) : 512 - 514
  • [43] COVID-19 Public Opinion: A Twitter Healthcare Data Processing Using Machine Learning Methodologies
    Agrawal, Shweta
    Jain, Sanjiv Kumar
    Sharma, Shruti
    Khatri, Ajay
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2023, 20 (01)
  • [44] Practical Machine Learning Techniques for COVID-19 Detection Using Chest
    Mangalmurti, Yurananatul
    Wattanapongsakorn, Naruemon
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (02): : 733 - 752
  • [45] Automatic COVID-19 detection using machine learning and voice recording
    Benmalek E.
    Elmhamdi J.
    Jilbab A.
    Jbari A.
    [J]. Research on Biomedical Engineering, 2023, 39 (03) : 597 - 612
  • [46] Usefulness of machine learning in COVID-19 for the detection and prognosis of cardiovascular complications
    Zimmerman, Allison
    Kalra, Dinesh
    [J]. REVIEWS IN CARDIOVASCULAR MEDICINE, 2020, 21 (03) : 345 - 352
  • [47] Machine Learning and Image Processing Techniques for Covid-19 Detection: A Review
    Appari, Neeraj Venkatasai L.
    Kanojia, Mahendra G.
    Bangera, Kritik B.
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2021), 2022, 417 : 441 - 450
  • [48] Detection of COVID-19 by Machine Learning Using Routine Laboratory Tests
    Cubukcu, Hikmet Can
    Topcu, Deniz Ilhan
    Bayraktar, Nilufer
    Gulsen, Murat
    Sari, Nuran
    Arslan, Ayse Hande
    [J]. AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2022, 157 (05) : 758 - 766
  • [49] Clinical and Laboratory Approach to Diagnose COVID-19 Using Machine Learning
    Krishnaraj Chadaga
    Chinmay Chakraborty
    Srikanth Prabhu
    Shashikiran Umakanth
    Vivekananda Bhat
    Niranjana Sampathila
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2022, 14 : 452 - 470
  • [50] Machine learning approach for COVID-19 crisis using the clinical data
    Kumar, N. R. P.
    Shetty, N. S.
    [J]. INDIAN JOURNAL OF BIOCHEMISTRY & BIOPHYSICS, 2020, 57 (05): : 602 - 605