Correcting data imbalance for semi-supervised COVID-19 detection using X-ray chest images

被引:20
|
作者
Calderon-Ramirez, Saul [1 ,2 ]
Yang, Shengxiang [1 ]
Moemeni, Armaghan [3 ]
Elizondo, David [1 ]
Colreavy-Donnelly, Simon [1 ]
Chavarria-Estrada, Luis Fernando [4 ]
Molina-Cabello, Miguel A. [5 ,6 ]
机构
[1] De Montfort Univ, Ctr Computat Intelligence CCI, Leicester, Leics, England
[2] Inst Tecnol Costa Rica, Cartago, Costa Rica
[3] Univ Nottingham, Sch Comp Sci, Nottingham, England
[4] Imagenes Med Dr Chavarria Estrada, San Jose, Costa Rica
[5] Univ Malaga, Dept Comp Languages & Comp Sci, Malaga, Spain
[6] Inst Invest Biomed Malaga IBIMA, Malaga, Spain
关键词
Coronavirus; COVID-19; Computer aided diagnosis; Data imbalance; Semi-supervised learning; DEEP; RADIOLOGY; FEATURES;
D O I
10.1016/j.asoc.2021.107692
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key factor in the fight against viral diseases such as the coronavirus (COVID-19) is the identification of virus carriers as early and quickly as possible, in a cheap and efficient manner. The application of deep learning for image classification of chest X-ray images of COVID-19 patients could become a useful pre-diagnostic detection methodology. However, deep learning architectures require large labelled datasets. This is often a limitation when the subject of research is relatively new as in the case of the virus outbreak, where dealing with small labelled datasets is a challenge. Moreover, in such context, the datasets are also highly imbalanced, with few observations from positive cases of the new disease. In this work we evaluate the performance of the semi-supervised deep learning architecture known as MixMatch with a very limited number of labelled observations and highly imbalanced labelled datasets. We demonstrate the critical impact of data imbalance to the model's accuracy. Therefore, we propose a simple approach for correcting data imbalance, by re-weighting each observation in the loss function, giving a higher weight to the observations corresponding to the under-represented class. For unlabelled observations, we use the pseudo and augmented labels calculated by MixMatch to choose the appropriate weight. The proposed method improved classification accuracy by up to 18%, with respect to the non balanced MixMatch algorithm. We tested our proposed approach with several available datasets using 10, 15 and 20 labelled observations, for binary classification (COVID-19 positive and normal cases). For multi-class classification (COVID-19 positive, pneumonia and normal cases), we tested 30, 50, 70 and 90 labelled observations. Additionally, a new dataset is included among the tested datasets, composed of chest X-ray images of Costa Rican adult patients. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Chest x-ray images: transfer learning model in COVID-19 detection
    Mao, Siqi
    Kulbayeva, Saltanat
    Osadchuk, Mikhail
    JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2025, 31 (01)
  • [32] Variational Autoencoder Based Imbalanced COVID-19 Detection Using Chest X-Ray Images
    Chatterjee, Sankhadeep
    Maity, Soumyajit
    Bhattacharjee, Mayukh
    Banerjee, Soumen
    Das, Asit Kumar
    Ding, Weiping
    NEW GENERATION COMPUTING, 2023, 41 (01) : 25 - 60
  • [33] Optimal Ensemble learning model for COVID-19 detection using chest X-ray images
    Balasubramaniam, S.
    Kumar, K. Satheesh
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [34] Variational Autoencoder Based Imbalanced COVID-19 Detection Using Chest X-Ray Images
    Sankhadeep Chatterjee
    Soumyajit Maity
    Mayukh Bhattacharjee
    Soumen Banerjee
    Asit Kumar Das
    Weiping Ding
    New Generation Computing, 2023, 41 : 25 - 60
  • [35] FocusCovid: automated COVID-19 detection using deep learning with chest X-ray images
    Agrawal, Tarun
    Choudhary, Prakash
    EVOLVING SYSTEMS, 2022, 13 (04) : 519 - 533
  • [36] Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss
    Chamseddine, Ekram
    Mansouri, Nesrine
    Soui, Makram
    Abed, Mourad
    APPLIED SOFT COMPUTING, 2022, 129
  • [37] Covid-19 Detection Based on Chest X-Ray Images Using DCT Compression and NN
    Taher, Fatma
    Haweel, Reem T.
    Al Bastaki, Usama Mohammad Hassan
    Abdelwahed, Eman
    Rehman, Tariq
    Haweel, Tarek I.
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST 2022), 2022,
  • [38] Detection of COVID-19 from Chest X-Ray Images Using Convolutional Neural Networks
    Sekeroglu, Boran
    Ozsahin, Ilker
    SLAS TECHNOLOGY, 2020, 25 (06): : 553 - 565
  • [39] OSEGNET: OPERATIONAL SEGMENTATION NETWORK FOR COVID-19 DETECTION USING CHEST X-RAY IMAGES
    Degerli, Aysen
    Kiranyaz, Serkan
    Chowdhury, Muhammad E. H.
    Gabbouj, Moncef
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2306 - 2310
  • [40] FocusCovid: automated COVID-19 detection using deep learning with chest X-ray images
    Tarun Agrawal
    Prakash Choudhary
    Evolving Systems, 2022, 13 : 519 - 533