Correcting data imbalance for semi-supervised COVID-19 detection using X-ray chest images

被引:20
|
作者
Calderon-Ramirez, Saul [1 ,2 ]
Yang, Shengxiang [1 ]
Moemeni, Armaghan [3 ]
Elizondo, David [1 ]
Colreavy-Donnelly, Simon [1 ]
Chavarria-Estrada, Luis Fernando [4 ]
Molina-Cabello, Miguel A. [5 ,6 ]
机构
[1] De Montfort Univ, Ctr Computat Intelligence CCI, Leicester, Leics, England
[2] Inst Tecnol Costa Rica, Cartago, Costa Rica
[3] Univ Nottingham, Sch Comp Sci, Nottingham, England
[4] Imagenes Med Dr Chavarria Estrada, San Jose, Costa Rica
[5] Univ Malaga, Dept Comp Languages & Comp Sci, Malaga, Spain
[6] Inst Invest Biomed Malaga IBIMA, Malaga, Spain
关键词
Coronavirus; COVID-19; Computer aided diagnosis; Data imbalance; Semi-supervised learning; DEEP; RADIOLOGY; FEATURES;
D O I
10.1016/j.asoc.2021.107692
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key factor in the fight against viral diseases such as the coronavirus (COVID-19) is the identification of virus carriers as early and quickly as possible, in a cheap and efficient manner. The application of deep learning for image classification of chest X-ray images of COVID-19 patients could become a useful pre-diagnostic detection methodology. However, deep learning architectures require large labelled datasets. This is often a limitation when the subject of research is relatively new as in the case of the virus outbreak, where dealing with small labelled datasets is a challenge. Moreover, in such context, the datasets are also highly imbalanced, with few observations from positive cases of the new disease. In this work we evaluate the performance of the semi-supervised deep learning architecture known as MixMatch with a very limited number of labelled observations and highly imbalanced labelled datasets. We demonstrate the critical impact of data imbalance to the model's accuracy. Therefore, we propose a simple approach for correcting data imbalance, by re-weighting each observation in the loss function, giving a higher weight to the observations corresponding to the under-represented class. For unlabelled observations, we use the pseudo and augmented labels calculated by MixMatch to choose the appropriate weight. The proposed method improved classification accuracy by up to 18%, with respect to the non balanced MixMatch algorithm. We tested our proposed approach with several available datasets using 10, 15 and 20 labelled observations, for binary classification (COVID-19 positive and normal cases). For multi-class classification (COVID-19 positive, pneumonia and normal cases), we tested 30, 50, 70 and 90 labelled observations. Additionally, a new dataset is included among the tested datasets, composed of chest X-ray images of Costa Rican adult patients. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] COVID-19 Diagnosis Using Chest X-ray Images via Classification and Object Detection
    Yoshitsugu, Kenji
    Nakamoto, Yukikazu
    AICCC 2021: 2021 4TH ARTIFICIAL INTELLIGENCE AND CLOUD COMPUTING CONFERENCE, 2021, : 62 - 67
  • [42] COVID-19 Detection in Chest X-ray Images Using a New Channel Boosted CNN
    Khan, Saddam Hussain
    Sohail, Anabia
    Khan, Asifullah
    Lee, Yeon-Soo
    DIAGNOSTICS, 2022, 12 (02)
  • [43] COVID-19 detection in chest X-ray images using deep boosted hybrid learning
    Khan, Saddam Hussain
    Sohail, Anabia
    Khan, Asifullah
    Hassan, Mehdi
    Lee, Yeon Soo
    Alam, Jamshed
    Basit, Abdul
    Zubair, Saima
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 137
  • [44] Fast COVID-19 Detection from Chest X-Ray Images Using DCT Compression
    Taher, Fatma
    Haweel, Reem T.
    Al Bastaki, Usama M. H.
    Abdelwahed, Eman
    Rehman, Tariq
    Haweel, Tarek I.
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2022, 2022
  • [45] Detection of COVID-19 from Chest X-Ray Images using CNN and ANN Approach
    Arowolo, Micheal Olaolu
    Adebiyi, Marion Olubunmi
    Michael, Eniola Precious
    Aigbogun, Happiness Eric
    Abdulsalam, Sulaiman Olaniyi
    Adebiyi, Ayodele Ariyo
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 754 - 759
  • [46] Covid19 Detection Using Chest X-ray Images Along with Corresponding Metadata of the Chest X-ray
    Paul, Sourav
    Das, Ranjita
    Khanal, Bipal
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 135 (04) : 2379 - 2399
  • [47] Detection of COVID-19 in Chest X-ray Images: A Big Data Enabled Deep Learning Approach
    Awan, Mazhar Javed
    Bilal, Muhammad Haseeb
    Yasin, Awais
    Nobanee, Haitham
    Khan, Nabeel Sabir
    Zain, Azlan Mohd
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (19)
  • [48] Identification of COVID-19 with Chest X-ray Images using Deep Learning
    Khandar, Punam
    Thaokar, Chetana
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (05): : 694 - 700
  • [49] Deep Learn in for Screening COVID-19 using Chest X-Ray Images
    Basu, Sanhita
    Mitra, Sushmita
    Saha, Nilanjan
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 2521 - 2527
  • [50] Advance Warning Methodologies for COVID-19 Using Chest X-Ray Images
    Ahishali, Mete
    Degerli, Aysen
    Yamac, Mehmet
    Kiranyaz, Serkan
    Chowdhury, Muhammad E. H.
    Hameed, Khalid
    Hamid, Tahir
    Mazhar, Rashid
    Gabbouj, Moncef
    IEEE ACCESS, 2021, 9 : 41052 - 41065