A Comprehensive Overview of the COVID-19 Literature: Machine Learning-Based Bibliometric Analysis

被引:32
|
作者
Abd-Alrazaq, Alaa [1 ]
Schneider, Jens [1 ]
Mifsud, Borbala [2 ]
Alam, Tanvir [1 ]
Househ, Mowafa [1 ]
Hamdi, Mounir [1 ]
Shah, Zubair [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Fdn, Div Informat & Comp Technol, Coll Sci & Engn, POB 5825,Doha Al Luqta St, Doha 00000, Qatar
[2] Hamad Bin Khalifa Univ, Qatar Fdn, Coll Hlth & Life Sci, Doha, Qatar
关键词
novel coronavirus disease; COVID-19; SARS-CoV-2; 2019-nCoV; bibliometric analysis; literature; machine learning; research; review; CORONAVIRUS DISEASE COVID-19; IMPACT;
D O I
10.2196/23703
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Shortly after the emergence of COVID-19, researchers rapidly mobilized to study numerous aspects of the disease such as its evolution, clinical manifestations, effects, treatments, and vaccinations. This led to a rapid increase in the number of COVID-19-related publications. Identifying trends and areas of interest using traditional review methods (eg, scoping and systematic reviews) for such a large domain area is challenging. Objective: We aimed to conduct an extensive bibliometric analysis to provide a comprehensive overview of the COVID-19 literature. Methods: We used the COVID-19 Open Research Dataset (CORD-19) that consists of a large number of research articles related to all coronaviruses. We used a machine learning-based method to analyze the most relevant COVID-19-related articles and extracted the most prominent topics. Specifically, we used a clustering algorithm to group published articles based on the similarity of their abstracts to identify research hotspots and current research directions. We have made our software accessible to the community via GitHub. Results: Of the 196,630 publications retrieved from the database, we included 28,904 in our analysis. The mean number of weekly publications was 990 (SD 789.3). The country that published the highest number of COVID-19-related articles was China (2950/17,270, 17.08%). The highest number of articles were published in bioRxiv. Lei Liu affiliated with the Southern University of Science and Technology in China published the highest number of articles (n=46). Based on titles and abstracts alone, we were able to identify 1515 surveys, 733 systematic reviews, 512 cohort studies, 480 meta-analyses, and 362 randomized control trials. We identified 19 different topics covered among the publications reviewed. The most dominant topic was public health response, followed by clinical care practices during the COVID-19 pandemic, clinical characteristics and risk factors, and epidemic models for its spread. Conclusions: We provide an overview of the COVID-19 literature and have identified current hotspots and research directions. Our findings can be useful for the research community to help prioritize research needs and recognize leading COVID-19 researchers, institutes, countries, and publishers. Our study shows that an AI-based bibliometric analysis has the potential to rapidly explore a large corpus of academic publications during a public health crisis. We believe that this work can be used to analyze other eHealth-related literature to help clinicians, administrators, and policy makers to obtain a holistic view of the literature and be able to categorize different topics of the existing research for further analyses. It can be further scaled (for instance, in time) to clinical summary documentation. Publishers should avoid noise in the data by developing a way to trace the evolution of individual publications and unique authors.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] A comprehensive overview of cellular senescence from 1990 to 2021: A machine learning-based bibliometric analysis
    Li, Chan
    Liu, Zhaoya
    Shi, Ruizheng
    [J]. FRONTIERS IN MEDICINE, 2023, 10
  • [2] A comprehensive overview of psoriatic research over the past 20 years: machine learning-based bibliometric analysis
    Yu, Chenyang
    Huang, Yingzhao
    Yan, Wei
    Jiang, Xian
    [J]. FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [3] A Comprehensive Overview of the Parathyroid Tumor From the Past Two Decades: Machine Learning-Based Bibliometric Analysis
    Zhang, Zeyu
    Xia, Fada
    Li, Xinying
    [J]. FRONTIERS IN ENDOCRINOLOGY, 2022, 12
  • [4] Coronavirus Disease (COVID-19): A Machine Learning Bibliometric Analysis
    De Felice, Francesca
    Polimeni, Antonella
    [J]. IN VIVO, 2020, 34 : 1613 - 1617
  • [5] Supervised Machine Learning-Based Prediction of COVID-19
    Atta-ur-Rahman
    Sultan, Kiran
    Naseer, Iftikhar
    Majeed, Rizwan
    Musleh, Dhiaa
    Gollapalli, Mohammed Abdul Salam
    Chabani, Sghaier
    Ibrahim, Nehad
    Siddiqui, Shahan Yamin
    Khan, Muhammad Adnan
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (01): : 21 - 34
  • [6] A survey of machine learning-based methods for COVID-19 medical image analysis
    Kashfia Sailunaz
    Tansel Özyer
    Jon Rokne
    Reda Alhajj
    [J]. Medical & Biological Engineering & Computing, 2023, 61 : 1257 - 1297
  • [7] A survey of machine learning-based methods for COVID-19 medical image analysis
    Sailunaz, Kashfia
    Ozyer, Tansel
    Rokne, Jon
    Alhajj, Reda
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (06) : 1257 - 1297
  • [8] Machine learning techniques during the COVID-19 Pandemic: A Bibliometric Analysis
    Alavi, Meysam
    Valiollahi, Arefeh
    Kargari, Mehrdad
    [J]. 2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
  • [9] Machine Learning Applications in Prediction Models for COVID-19: A Bibliometric Analysis
    Lv, Hai
    Liu, Yangyang
    Yin, Huimin
    Xi, Jingzhi
    Wei, Pingmin
    [J]. INFORMATION, 2024, 15 (09)
  • [10] Machine learning-based IoT system for COVID-19 epidemics
    Arowolo, Micheal Olaolu
    Ogundokun, Roseline Oluwaseun
    Misra, Sanjay
    Agboola, Blessing Dorothy
    Gupta, Brij
    [J]. COMPUTING, 2023, 105 (04) : 831 - 847