Automatic Classification Between COVID-19 and Non-COVID-19 Pneumonia Using Symptoms, Comorbidities, and Laboratory Findings: The Khorshid COVID Cohort Study

被引：5

作者：

Marateb, Hamid Reza ^{[1
]}

Ziaie Nezhad, Farzad ^{[1
]}

Mohebian, Mohammad Reza ^{[2
]}

Sami, Ramin ^{[3
]}

Haghjooy Javanmard, Shaghayegh ^{[4
]}

Dehghan Niri, Fatemeh ^{[5
]}

Akafzadeh-Savari, Mahsa ^{[6
]}

Mansourian, Marjan ^{[7
,8
]}

Mananas, Miquel Angel ^{[7
,9
]}

Wolkewitz, Martin ^{[10
,11
]}

Binder, Harald ^{[10
,11
]}

机构：

[1] Univ Isfahan, Engn Fac, Biomed Engn Dept, Esfahan, Iran

[2] Univ Saskatchewan, Dept Elect & Comp Engn, Saskatoon, SK, Canada

[3] Isfahan Univ Med Sci, Sch Med, Dept Internal Med, Esfahan, Iran

[4] Isfahan Univ Med Sci, Cardiovasc Res Inst, Appl Physiol Res Ctr, Dept Physiol,Sch Med, Esfahan, Iran

[5] Isfahan Univ Med Sci, Sch Med, Esfahan, Iran

[6] Isfahan Univ Med Sci, Isfahan Clin Toxicol Res Ctr, Esfahan, Iran

[7] Univ Politecn Catalunya Barcelona Tech UPC, Automat Control Dept ESAII, Biomed Engn Res Ctr CREB, Barcelona, Spain

[8] Isfahan Univ Med Sci, Sch Hlth, Dept Epidemiol & Biostat, Esfahan, Iran

[9] Biomed Res Networking Ctr Bioengn Biomat & Nanome, Madrid, Spain

[10] Univ Freiburg, Inst Med Biometry & Stat, Fac Med, Freiburg, Germany

[11] Univ Freiburg, Inst Med Biometry & Stat, Med Ctr, Freiburg, Germany

来源：

FRONTIERS IN MEDICINE | 2021年 / 8卷

基金：

欧盟地平线“2020”;

关键词：

COVID-19; computer-aided diagnosis; screening; validation studies; machine learning; CORONAVIRUS DISEASE 2019; CLINICAL CHARACTERISTICS; LOGISTIC-REGRESSION; PREDICTION; DIAGNOSIS; SYSTEM; HEALTH; PROGNOSIS; SELECTION; FAMILY;

D O I：

10.3389/fmed.2021.768467

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Coronavirus disease-2019, also known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was a disaster in 2020. Accurate and early diagnosis of coronavirus disease-2019 (COVID-19) is still essential for health policymaking. Reverse transcriptase-polymerase chain reaction (RT-PCR) has been performed as the operational gold standard for COVID-19 diagnosis. We aimed to design and implement a reliable COVID-19 diagnosis method to provide the risk of infection using demographics, symptoms and signs, blood markers, and family history of diseases to have excellent agreement with the results obtained by the RT-PCR and CT-scan. Our study primarily used sample data from a 1-year hospital-based prospective COVID-19 open-cohort, the Khorshid COVID Cohort (KCC) study. A sample of 634 patients with COVID-19 and 118 patients with pneumonia with similar characteristics whose RT-PCR and chest CT scan were negative (as the control group) (dataset 1) was used to design the system and for internal validation. Two other online datasets, namely, some symptoms (dataset 2) and blood tests (dataset 3), were also analyzed. A combination of one-hot encoding, stability feature selection, over-sampling, and an ensemble classifier was used. Ten-fold stratified cross-validation was performed. In addition to gender and symptom duration, signs and symptoms, blood biomarkers, and comorbidities were selected. Performance indices of the cross-validated confusion matrix for dataset 1 were as follows: sensitivity of 96% [confidence interval, CI, 95%: 94-98], specificity of 95% [90-99], positive predictive value (PPV) of 99% [98-100], negative predictive value (NPV) of 82% [76-89], diagnostic odds ratio (DOR) of 496 [198-1,245], area under the ROC (AUC) of 0.96 [0.94-0.97], Matthews Correlation Coefficient (MCC) of 0.87 [0.85-0.88], accuracy of 96% [94-98], and Cohen's Kappa of 0.86 [0.81-0.91]. The proposed algorithm showed excellent diagnosis accuracy and class-labeling agreement, and fair discriminant power. The AUC on the datasets 2 and 3 was 0.97 [0.96-0.98] and 0.92 [0.91-0.94], respectively. The most important feature was white blood cell count, shortness of breath, and C-reactive protein for datasets 1, 2, and 3, respectively. The proposed algorithm is, thus, a promising COVID-19 diagnosis method, which could be an amendment to simple blood tests and screening of symptoms. However, the RT-PCR and chest CT-scan, performed as the gold standard, are not 100% accurate.

引用

页数：14

共 50 条

[1] Cardiovascular outcomes between COVID-19 and non-COVID-19 pneumonia: a nationwide cohort study
Min-Taek Lee
Moon Seong Baek
Tae Wan Kim
Sun-Young Jung
Won-Young Kim
BMC Medicine, 21
[2] Cardiovascular outcomes between COVID-19 and non-COVID-19 pneumonia: a nationwide cohort study
Lee, Min-Taek
Baek, Moon Seong
Kim, Tae Wan
Jung, Sun-Young
Kim, Won-Young
BMC MEDICINE, 2023, 21 (01)
[3] COVID-19 and non-COVID-19 pneumonia: a comparison
Di Mitri, Chiara
Arcoleo, Giuseppe
Mazzuca, Emilia
Camarda, Gaetana
Farinella, Enzo Massimo
Soresi, Maurizio
Carroccio, Antonio
ANNALS OF MEDICINE, 2021, 53 (01) : 2321 - 2331
[4] Comparison and prognostic impact of electrocardiographic findings in COVID-19 pneumonia and non-COVID-19 pneumonia
Kaufmann, C. C.
Ahmed, A.
Huber, K.
WIENER KLINISCHE WOCHENSCHRIFT, 2020, 132 : S180 - S182
[5] Plasma Metabolome Alterations Discriminate between COVID-19 and Non-COVID-19 Pneumonia
More, Tushar H.
Mozafari, Bahareh
Maertens, Andre
Herr, Christian
Lepper, Philipp M.
Danziger, Guy
Volk, Thomas
Hoersch, Sabrina
Krawczyk, Marcin
Guenther, Katharina
Hiller, Karsten
Bals, Robert
METABOLITES, 2022, 12 (11)
[6] Deep learning model for the automatic classification of COVID-19 pneumonia, non-COVID-19 pneumonia, and the healthy: a multi-center retrospective study
Nishio, Mizuho
Kobayashi, Daigo
Nishioka, Eiko
Matsuo, Hidetoshi
Urase, Yasuyo
Onoue, Koji
Ishikura, Reiichi
Kitamura, Yuri
Sakai, Eiro
Tomita, Masaru
Hamanaka, Akihiro
Murakami, Takamichi
SCIENTIFIC REPORTS, 2022, 12 (01)
[7] Deep learning model for the automatic classification of COVID-19 pneumonia, non-COVID-19 pneumonia, and the healthy: a multi-center retrospective study
Mizuho Nishio
Daigo Kobayashi
Eiko Nishioka
Hidetoshi Matsuo
Yasuyo Urase
Koji Onoue
Reiichi Ishikura
Yuri Kitamura
Eiro Sakai
Masaru Tomita
Akihiro Hamanaka
Takamichi Murakami
Scientific Reports, 12
[8] The COVID-19 Pandemic and the Incidence of the Non-COVID-19 Pneumonia in Adults
Huang, Chienhsiu
FRONTIERS IN MEDICINE, 2021, 8
[9] Thrombosis in Covid-19 and non-Covid-19 pneumonia: role of platelets
Violi, Francesco
Cammisotto, Vittoria
Pignatelli, Pasquale
PLATELETS, 2021, 32 (08) : 1009 - 1017
[10] Comparison of COVID-19 and Non-COVID-19 Pneumonia in Down Syndrome
Real de Asua, Diego
Mayer, Miguel A.
del Carmen Ortega, Maria
Borrel, Jose M.
de Jesus Bermejo, Teresa
Gonzalez-Lamuno, Domingo
Manso, Coral
Moldenhauer, Fernando
Carmona-Iragui, Maria
Huls, Anke
Sherman, Stephanie L.
Strydom, Andre
de la Torre, Rafael
Dierssen, Mara
JOURNAL OF CLINICAL MEDICINE, 2021, 10 (16)

← 1 2 3 4 5 →