Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study

被引:14
|
作者
Benito-Leon, Julian [1 ]
Dolores del Castillo, Ma [2 ]
Estirado, Alberto [3 ]
Ghosh, Ritwik [4 ]
Dubey, Souvik [5 ]
Serrano, J. Ignacio [2 ]
机构
[1] Univ Hosp 12 Octubre, Dept Neurol, Ave Cordoba S-N, Madrid 28041, Spain
[2] CSIC UPM, Neural & Cognit Engn Grp, Ctr Automat & Robot, Arganda Del Rey, Spain
[3] HM Hosp, Madrid, Spain
[4] Burdwan Med Coll & Hosp, Dept Gen Med, Burdwan, W Bengal, India
[5] Bangur Inst Neurosci, Dept Neuromed, Kolkata, India
关键词
COVID-19; machine learning; outcome; severity; subgroup; emergency; detection; intervention; testing; data set; characterization; LACTATE-DEHYDROGENASE; MORTALITY; DIAGNOSIS;
D O I
10.2196/25988
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Early detection and intervention are the key factors for improving outcomes in patients with COVID-19. Objective: The objective of this observational longitudinal study was to identify nonoverlapping severity subgroups (ie, clusters) among patients with COVID-19, based exclusively on clinical data and standard laboratory tests obtained during patient assessment in the emergency department. Methods: We applied unsupervised machine learning to a data set of 853 patients with COVID-19 from the HM group of hospitals (HM Hospitales) in Madrid, Spain. Age and sex were not considered while building the clusters, as these variables could introduce biases in machine learning algorithms and raise ethical implications or enable discrimination in triage protocols. Results: From 850 clinical and laboratory variables, four tests-the serum levels of aspartate transaminase (AST), lactate dehydrogenase (LDH), C-reactive protein (CRP), and the number of neutrophils-were enough to segregate the entire patient pool into three separate clusters. Further, the percentage of monocytes and lymphocytes and the levels of alanine transaminase (ALT) distinguished cluster 3 patients from the other two clusters. The highest proportion of deceased patients; the highest levels of AST, ALT, LDH, and CRP; the highest number of neutrophils; and the lowest percentages of monocytes and lymphocytes characterized cluster 1. Cluster 2 included a lower proportion of deceased patients and intermediate levels of the previous laboratory tests. The lowest proportion of deceased patients; the lowest levels of AST, ALT, LDH, and CRP; the lowest number of neutrophils; and the highest percentages of monocytes and lymphocytes characterized cluster 3. Conclusions: A few standard laboratory tests, deemed available in all emergency departments, have shown good discriminative power for the characterization of severity subgroups among patients with COVID-19.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Feature Identification Using Interpretability Machine Learning Predicting Risk Factors for Disease Severity of In-Patients with COVID-19 in South Florida
    Datta, Debarshi
    Ray, Subhosit
    Martinez, Laurie
    Newman, David
    Dalmida, Safiya George
    Hashemi, Javad
    Sareli, Candice
    Eckardt, Paula
    DIAGNOSTICS, 2024, 14 (17)
  • [42] Clinical features and haematological parameters associated with COVID-19 severity among hospitalized patients: A retrospective observational study from Tribal Central India
    Kabirpanthi, Vikrant
    Gupta, Vikas
    Singh, Ajit
    JOURNAL OF FAMILY MEDICINE AND PRIMARY CARE, 2022, 11 (10) : 6042 - 6048
  • [43] Lethality risk markers by sex and age-group for COVID-19 in Mexico: a cross-sectional study based on machine learning approach
    Mariano Rojas-García
    Blanca Vázquez
    Kirvis Torres-Poveda
    Vicente Madrid-Marina
    BMC Infectious Diseases, 23
  • [44] Lethality risk markers by sex and age-group for COVID-19 in Mexico: a cross-sectional study based on machine learning approach
    Rojas-Garcia, Mariano
    Vazquez, Blanca
    Torres-Poveda, Kirvis
    Madrid-Marina, Vicente
    BMC INFECTIOUS DISEASES, 2023, 23 (01)
  • [45] Impact of COVID-19 research: a study on predicting influential scholarly documents using machine learning and a domain-independent knowledge graph
    Rabby, Gollam
    D'Souza, Jennifer
    Oelen, Allard
    Dvorackova, Lucie
    Svatek, Vojtech
    Auer, Soeren
    JOURNAL OF BIOMEDICAL SEMANTICS, 2023, 14 (01)
  • [46] Impact of COVID-19 research: a study on predicting influential scholarly documents using machine learning and a domain-independent knowledge graph
    Gollam Rabby
    Jennifer D’Souza
    Allard Oelen
    Lucie Dvorackova
    Vojtěch Svátek
    Sören Auer
    Journal of Biomedical Semantics, 14
  • [47] Correlation between Chest X-Ray Severity in COVID-19 and Age in Mexican-Mestizo Patients: An Observational Cross-Sectional Study
    Albrandt-Salmeron, Arturo
    Espejo-Fonseca, Ruby
    Roldan-Valadez, Ernesto
    BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [48] Early Stage Identification of COVID-19 Patients in Mexico Using Machine Learning: A Case Study for the Tijuana General Hospital
    Castillo-Olea, Cristian
    Conte-Galvan, Roberto
    Zuniga, Clemente
    Siono, Alexandra
    Huerta, Angelica
    Bardhi, Ornela
    Ortiz, Eric
    INFORMATION, 2021, 12 (12)
  • [49] Using D-dimer as a Biomarker to Predict COVID-19 Disease Severity from Clinical Data of Hospitalized Patients: A Machine Learning Approach
    Wu, Yuqi
    Ren, Yang
    Wu, Dezhi
    Xirasagar, Sudha
    Johnson, Joseph
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 664 - 668
  • [50] Identification of Age-Related Characteristic Genes Involved in Severe COVID-19 Infection Among Elderly Patients Using Machine Learning and Immune Cell Infiltration Analysis
    Li, Huan
    Zhao, Jin
    Xing, Yan
    Chen, Jia
    Wen, Ziying
    Ma, Rui
    Han, Fengxia
    Huang, Boyong
    Wang, Hao
    Li, Cui
    Chen, Yang
    Ning, Xiaoxuan
    BIOCHEMICAL GENETICS, 2024,