Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection

被引:1
|
作者
K. M. Aslam Uddin
Farida Siddiqi Prity
Maisha Tasnim
Sumiya Nur Jannat
Mohammad Omar Faruk
Jahirul Islam
Saydul Akbar Murad
Apurba Adhikary
Anupam Kumar Bairagi
机构
[1] Noakhali Science and Technology University,Department of Information and Communication Engineering
[2] Noakhali Science and Technology University,Department of Statistics
[3] New Mexico Institute of Mining and Technology,Department of Computer Science and Engineering
[4] University of Southern Mississippi,School of Computing Sciences and Engineering
[5] Khulna University,Computing Sciences and Engineering Discipline
来源
Human-Centric Intelligent Systems | 2023年 / 3卷 / 4期
关键词
COVID-19; Socio-demographic; Behavior; Pearson Chi-square; Machine Learning;
D O I
10.1007/s44230-023-00049-9
中图分类号
学科分类号
摘要
The COVID-19 pandemic has unleashed an unprecedented global crisis, releasing a wave of illness, mortality, and economic disarray of unparalleled proportions. Numerous societal and behavioral aspects have conspired to fuel the rampant spread of COVID-19 across the globe. These factors encompass densely populated areas, adherence to mask-wearing protocols, inadequate awareness levels, and various behavioral and social practices. Despite the extensive research surrounding COVID-19 detection, an unfortunate dearth of studies has emerged to meticulously evaluate the intricate interplay between socio-demographic and behavioral factors and the likelihood of COVID-19 infection. Thus, a comprehensive online-based cross-sectional survey was methodically orchestrated, amassing data from a substantial sample size of 500 respondents. The precisely designed survey questionnaire encompassed various variables encompassing socio-demographics, behaviors, and social factors. The Bivariate Pearson’s Chi-square association test was deftly employed to unravel the complex associations between the explanatory variables and COVID-19 infection. The feature importance approach was also introduced to discern the utmost critical features underpinning this infectious predicament. Four distinct Machine Learning (ML) algorithms, specifically Decision Tree, Random Forest, CatBoost, and XGBoost, were employed to accurately predict COVID-19 infection based on a comprehensive analysis of socio-demographic and behavioral factors. The performance of these models was rigorously assessed using a range of evaluation metrics, including accuracy, recall, precision, ROC-AUC score, and F1 score. Pearson’s Chi-square test revealed a statistically significant association between vaccination status and COVID-19 infection. The use of sanitizer and masks, the timing of infection, and the interval between the first and second vaccine doses were significantly correlated with the likelihood of contracting the COVID-19 virus. Among the ML models tested, the XGBoost classifier demonstrated the highest classification accuracy, achieving an impressive 97.6%. These findings provide valuable insights for individuals, communities, and policymakers to implement targeted strategies aimed at mitigating the impact of the COVID-19 pandemic.
引用
收藏
页码:441 / 460
页数:19
相关论文
共 50 条
  • [41] The socio-demographic profile associated with perinatal depression during the COVID-19 era
    Katina Kovacheva
    María F. Rodríguez-Muñoz
    Diego Gómez-Baya
    Sara Domínguez-Salas
    Emma Motrico
    BMC Public Health, 23
  • [42] Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients
    Robotti, Carlo
    Costantini, Giovanni
    Saggio, Giovanni
    Cesarini, Valerio
    Calastri, Anna
    Maiorano, Eugenia
    Piloni, Davide
    Perrone, Tiziano
    Sabatini, Umberto
    Ferretti, Virginia Valeria
    Cassaniti, Irene
    Baldanti, Fausto
    Gravina, Andrea
    Sakib, Ahmed
    Alessi, Elena
    Pietrantonio, Filomena
    Pascucci, Matteo
    Casali, Daniele
    Zarezadeh, Zakarya
    Del Zoppo, Vincenzo
    Pisani, Antonio
    Benazzo, Marco
    JOURNAL OF VOICE, 2024, 38 (03) : 796e1 - 796e13
  • [43] Machine Learning and Deep Learning-Based Detection and Analysis of COVID-19 in Chest X-Ray Images
    Kumar, Kunal
    Shokeen, Harsh
    Gambhir, Shalini
    Kumar, Ashwani
    Saraswat, Amar
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 3, 2023, 492 : 151 - 160
  • [44] A Machine Learning-Based Web Tool for the Severity Prediction of COVID-19
    Christodoulou, Avgi
    Katsarou, Martha-Spyridoula
    Emmanouil, Christina
    Gavrielatos, Marios
    Georgiou, Dimitrios
    Tsolakou, Annia
    Papasavva, Maria
    Economou, Vasiliki
    Nanou, Vasiliki
    Nikolopoulos, Ioannis
    Daganou, Maria
    Argyraki, Aikaterini
    Stefanidis, Evaggelos
    Metaxas, Gerasimos
    Panagiotou, Emmanouil
    Michalopoulos, Ioannis
    Drakoulis, Nikolaos
    BIOTECH, 2024, 13 (03):
  • [45] Analysis and Prediction of COVID-19 in Xinjiang based on Machine Learning
    Liu, Yunxiang
    Xiao, Yan
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 382 - 385
  • [46] Machine learning-based Analysis of COVID-19 Pandemic Impact on US Research Networks
    Kiran, Mariam
    Campbell, Scott
    Wala, Fatema Bannat
    Buraglio, Nick
    Monga, Inder
    ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2021, 51 (04) : 23 - 23
  • [47] Road networks and socio-demographic factors to explore COVID-19 infection during its different waves
    Shahadat Uddin
    Arif Khan
    Haohui Lu
    Fangyu Zhou
    Shakir Karim
    Farshid Hajati
    Mohammad Ali Moni
    Scientific Reports, 14
  • [48] Road networks and socio-demographic factors to explore COVID-19 infection during its different waves
    Uddin, Shahadat
    Khan, Arif
    Lu, Haohui
    Zhou, Fangyu
    Karim, Shakir
    Hajati, Farshid
    Moni, Mohammad Ali
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [49] Socio-demographic factors influencing physical health among youth during the COVID-19 pandemic in Bangladesh
    Ahammad Hossain
    Al Muktadir Munam
    Rejvi Ahmed Bhuiya
    Md. Ruhul Amin
    Mohammad Zulficar Ali
    SN Social Sciences, 3 (7):
  • [50] Machine learning-based analysis of COVID-19 pandemic impact on US research networks
    Kiran M.
    Campbell S.
    Wala F.B.
    Buraglio N.
    Monga I.
    Computer Communication Review, 2021, 51 (04): : 23 - 34