Machine Learning Approach for COVID-19 Detection on Twitter

被引:19
|
作者
Amin, Samina [1 ]
Uddin, M. Irfan [1 ]
Al-Baity, Heyam H. [2 ]
Zeb, M. Ali [1 ]
Khan, M. Abrar [1 ]
机构
[1] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
[2] King Saud Univ, Dept Informat Technol, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 68卷 / 02期
关键词
Artificial intelligence; coronavirus; COVID-19; pandemic; social network; Twitter; machine learning; natural language processing; RANDOM FOREST; DENGUE; TWEETS;
D O I
10.32604/cmc.2021.016896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social networking services (SNSs) provide massive data that can be a very influential source of information during pandemic outbreaks. This study shows that social media analysis can be used as a crisis detector (e.g., understanding the sentiment of social media users regarding various pandemic outbreaks). The novel Coronavirus Disease-19 (COVID-19), commonly known as coronavirus, has affected everyone worldwide in 2020. Streaming Twitter data have revealed the status of the COVID-19 outbreak in the most affected regions. This study focuses on identifying COVID-19 patients using tweets without requiring medical records to find the COVID-19 pandemic in Twitter messages (tweets). For this purpose, we propose herein an intelligent model using traditional machine learning-based approaches, such as support vector machine (SVM), logistic regression (LR), na?ve Bayes (NB), random forest (RF), and decision tree (DT) with the help of the term frequency inverse document frequency (TF-IDF) to detect the COVID-19 pandemic in Twitter messages. The proposed intelligent traditional machine learning-based model classifies Twitter messages into four categories, namely, confirmed deaths, recovered, and suspected. For the experimental analysis, the tweet data on the COVID-19 pandemic are analyzed to evaluate the results of traditional machine learning approaches. A benchmark dataset for COVID-19 on Twitter messages is developed and can be used for future research studies. The experiments show that the results of the proposed approach are promising in detecting the COVID-19 pandemic in Twitter messages with overall accuracy, precision, recall, and F1 score between 70% and 80% and the confusion matrix for machine learning approaches (i.e., SVM, NB, LR, RF, and DT) with the TF-IDF feature extraction technique.
引用
收藏
页码:2231 / 2247
页数:17
相关论文
共 50 条
  • [1] Twitter Discussions and Emotions About the COVID-19 Pandemic: Machine Learning Approach
    Xue, Jia
    Chen, Junxiang
    Hu, Ran
    Chen, Chen
    Zheng, Chengda
    Su, Yue
    Zhu, Tingshao
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (11)
  • [2] Machine Learning in Detecting COVID-19 Misinformation on Twitter
    Alenezi, Mohammed N.
    Alqenaei, Zainab M.
    [J]. FUTURE INTERNET, 2021, 13 (10)
  • [3] A Machine Learning Approach as an Aid for Early COVID-19 Detection
    Martinez-Velazquez, Roberto
    Tobon, Diana P., V
    Sanchez, Alejandro
    El Saddik, Abdulmotaleb
    Petriu, Emil
    [J]. SENSORS, 2021, 21 (12)
  • [4] Machine Learning Approach for Autonomous Detection and Classification of COVID-19 Virus
    Shahin, Osama R.
    Alshammari, Hamoud H.
    Taloba, Ahmed I.
    Abd El-Aziz, Rasha M.
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [5] Robust and efficient COVID-19 detection techniques: A machine learning approach
    Hasan, Md Mahadi
    Murtaz, Saba Binte
    Islam, Muhammad Usama
    Sadeq, Muhammad Jafar
    Uddin, Jasim
    [J]. PLOS ONE, 2022, 17 (09):
  • [6] Twitter Opinion Mining on COVID-19 Vaccinations by Machine Learning Presence
    Islam, Md Babul
    Hasibunnahar, Swarna
    Shukla, Piyush Kumar
    Shukla, Prashant Kumar
    Rawat, Paresh
    Dange, Jyoti
    [J]. PROCEEDINGS OF THIRD DOCTORAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE, DOSCI 2022, 2023, 479 : 37 - 55
  • [7] COVID-19 Detection Using Contemporary Biosensors and Machine Learning Approach: A Review
    Agarwal, Sajal
    Srivastava, Rupam
    Kumar, Santosh
    Prajapati, Yogendra Kumar
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2024, 23 (02) : 291 - 299
  • [8] A Traditional Machine Learning Approach for COVID-19 Detection from CT Images
    Kabir, Sultanul
    Mohammed, Emad A.
    Zaamout, Khobaib
    Ikki, Salama
    [J]. 2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 256 - 263
  • [9] COVID-19 detection using federated machine learning
    Salam, Mustafa Abdul
    Taha, Sanaa
    Ramadan, Mohamed
    [J]. PLOS ONE, 2021, 16 (06):
  • [10] Machine Learning Facemask Detection Models for COVID-19
    Puzi, Asmarani Ahmad
    Zainuddin, Ahmad Anwar
    Sahak, Rohilah
    Yunos, Muhammad Farhan Affendi Mohamad
    Rahman, Siti Husna Abdul
    Ramly, Munirah Mohd
    Maaz, Muhammad
    Kaitane, Wonderful Shammah
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON SEMICONDUCTOR ELECTRONICS (ICSE 2022), 2022, : 148 - 151