Machine Learning Approach for COVID-19 Detection on Twitter

被引:19
|
作者
Amin, Samina [1 ]
Uddin, M. Irfan [1 ]
Al-Baity, Heyam H. [2 ]
Zeb, M. Ali [1 ]
Khan, M. Abrar [1 ]
机构
[1] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
[2] King Saud Univ, Dept Informat Technol, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 68卷 / 02期
关键词
Artificial intelligence; coronavirus; COVID-19; pandemic; social network; Twitter; machine learning; natural language processing; RANDOM FOREST; DENGUE; TWEETS;
D O I
10.32604/cmc.2021.016896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social networking services (SNSs) provide massive data that can be a very influential source of information during pandemic outbreaks. This study shows that social media analysis can be used as a crisis detector (e.g., understanding the sentiment of social media users regarding various pandemic outbreaks). The novel Coronavirus Disease-19 (COVID-19), commonly known as coronavirus, has affected everyone worldwide in 2020. Streaming Twitter data have revealed the status of the COVID-19 outbreak in the most affected regions. This study focuses on identifying COVID-19 patients using tweets without requiring medical records to find the COVID-19 pandemic in Twitter messages (tweets). For this purpose, we propose herein an intelligent model using traditional machine learning-based approaches, such as support vector machine (SVM), logistic regression (LR), na?ve Bayes (NB), random forest (RF), and decision tree (DT) with the help of the term frequency inverse document frequency (TF-IDF) to detect the COVID-19 pandemic in Twitter messages. The proposed intelligent traditional machine learning-based model classifies Twitter messages into four categories, namely, confirmed deaths, recovered, and suspected. For the experimental analysis, the tweet data on the COVID-19 pandemic are analyzed to evaluate the results of traditional machine learning approaches. A benchmark dataset for COVID-19 on Twitter messages is developed and can be used for future research studies. The experiments show that the results of the proposed approach are promising in detecting the COVID-19 pandemic in Twitter messages with overall accuracy, precision, recall, and F1 score between 70% and 80% and the confusion matrix for machine learning approaches (i.e., SVM, NB, LR, RF, and DT) with the TF-IDF feature extraction technique.
引用
收藏
页码:2231 / 2247
页数:17
相关论文
共 50 条
  • [31] A Machine Learning Approach to Precision Medicine for COVID-19 Therapeutics
    Siefkas, A.
    Lam, C.
    Zelin, N.
    Barnes, G.
    Hoffman, J.
    Calvert, J.
    Mao, Q.
    Das, R.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2021, 203 (09)
  • [32] Automatic Detection in Twitter of Non-Traumatic Grief Due to Deaths by COVID-19: A Deep Learning Approach
    Mata-Vazquez, Jacinto
    Pachon-Alvarez, Victoria
    Gualda, Estrella
    Araujo-Hernandez, Miriam
    Garcia-Navarro, E. Begona
    [J]. IEEE ACCESS, 2023, 11 : 143402 - 143416
  • [33] Evaluation of Hybrid Unsupervised and Supervised Machine Learning Approach to Detect Self-Reporting of COVID-19 Symptoms on Twitter
    Cai, Mingxiang
    Li, Jiawei
    Nali, Matthew
    Mackey, Tim K.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [34] Are Twitter sentiments during COVID-19 pandemic a critical determinant to predict stock market movements? A machine learning approach
    Jena, Pradyot Ranjan
    Majhi, Ritanjali
    [J]. SCIENTIFIC AFRICAN, 2023, 19
  • [35] Optimised genetic algorithm-extreme learning machine approach for automatic COVID-19 detection
    Albadr, Musatafa Abbas Abbood
    Tiun, Sabrina
    Ayob, Masri
    AL-Dhief, Fahad Taha
    Omar, Khairuddin
    Hamzah, Faizal Amri
    [J]. PLOS ONE, 2020, 15 (12):
  • [36] Detection of COVID-19 Using Protein Sequence Data via Machine Learning Classification Approach
    Aminah, Siti
    Ardaneswari, Gianinna
    Husnah, Mufarrido
    Deori, Ghani
    Prasetyo, Handi Bagus
    [J]. JOURNAL OF APPLIED MATHEMATICS, 2023, 2023
  • [37] A Three-Fold Machine Learning Approach for Detection of COVID-19 from Audio Data
    Kumar, Nikhil
    Mittal, Vishal
    Sharma, Yashvardhan
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT III, 2021, 12951 : 497 - 511
  • [38] A machine learning approach for socialbot targets detection on Twitter
    Abulaish, Muhammad
    Fazil, Mohd
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 4115 - 4133
  • [39] A comprehensive review of COVID-19 detection with machine learning and deep learning techniques
    Das, Sreeparna
    Ayus, Ishan
    Gupta, Deepak
    [J]. HEALTH AND TECHNOLOGY, 2023, 13 (04) : 679 - 692
  • [40] A comprehensive review of COVID-19 detection with machine learning and deep learning techniques
    Sreeparna Das
    Ishan Ayus
    Deepak Gupta
    [J]. Health and Technology, 2023, 13 : 679 - 692