A medical image classification method based on self-regularized adversarial learning

被引:0
|
作者
Fan, Zong [1 ]
Zhang, Xiaohui [1 ]
Ruan, Su [2 ]
Thorstad, Wade [3 ]
Gay, Hiram [3 ]
Song, Pengfei [4 ]
Wang, Xiaowei [5 ]
Li, Hua [1 ,3 ,6 ]
机构
[1] Univ Illinois, Dept Bioengn, Champaign, IL USA
[2] Univ Rouen, EA 4108, Lab LITIS, Equipe Quantif, Rouen, France
[3] Washington Univ St Louis, Dept Radiat Oncol, St Louis, MO 63130 USA
[4] Univ Illinois, Dept Elect & Comp Engn, Champaign, IL USA
[5] Univ Illinois, Dept Pharmacol & Bioengn, Chicago, IL USA
[6] Canc Ctr Illinois, Urbana, IL USA
关键词
adversarial learning; deep learning; medical image classification;
D O I
10.1002/mp.17320
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
BackgroundDeep learning (DL) techniques have been extensively applied in medical image classification. The unique characteristics of medical imaging data present challenges, including small labeled datasets, severely imbalanced class distribution, and significant variations in imaging quality. Recently, generative adversarial network (GAN)-based classification methods have gained attention for their ability to enhance classification accuracy by incorporating realistic GAN-generated images as data augmentation. However, the performance of these GAN-based methods often relies on high-quality generated images, while large amounts of training data are required to train GAN models to achieve optimal performance.PurposeIn this study, we propose an adversarial learning-based classification framework to achieve better classification performance. Innovatively, GAN models are employed as supplementary regularization terms to support classification, aiming to address the challenges described above.MethodsThe proposed classification framework, GAN-DL, consists of a feature extraction network (F-Net), a classifier, and two adversarial networks, specifically a reconstruction network (R-Net) and a discriminator network (D-Net). The F-Net extracts features from input images, and the classifier uses these features for classification tasks. R-Net and D-Net have been designed following the GAN architecture. R-Net employs the extracted feature to reconstruct the original images, while D-Net is tasked with the discrimination between the reconstructed image and the original images. An iterative adversarial learning strategy is designed to guide model training by incorporating multiple network-specific loss functions. These loss functions, serving as supplementary regularization, are automatically derived during the reconstruction process and require no additional data annotation.ResultsTo verify the model's effectiveness, we performed experiments on two datasets, including a COVID-19 dataset with 13 958 chest x-ray images and an oropharyngeal squamous cell carcinoma (OPSCC) dataset with 3255 positron emission tomography images. Thirteen classic DL-based classification methods were implemented on the same datasets for comparison. Performance metrics included precision, sensitivity, specificity, and F1$F_1$-score. In addition, we conducted ablation studies to assess the effects of various factors on model performance, including the network depth of F-Net, training image size, training dataset size, and loss function design. Our method achieved superior performance than all comparative methods. On the COVID-19 dataset, our method achieved 95.4%+/- 0.6%$95.4\%\pm 0.6\%$, 95.3%+/- 0.9%$95.3\%\pm 0.9\%$, 97.7%+/- 0.4%$97.7\%\pm 0.4\%$, and 95.3%+/- 0.9%$95.3\%\pm 0.9\%$ in terms of precision, sensitivity, specificity, and F1$F_1$-score, respectively. It achieved 96.2%+/- 0.7%$96.2\%\pm 0.7\%$ across all these metrics on the OPSCC dataset. The study to investigate the effects of two adversarial networks highlights the crucial role of D-Net in improving model performance. Ablation studies further provide an in-depth understanding of our methodology.ConclusionOur adversarial-based classification framework leverages GAN-based adversarial networks and an iterative adversarial learning strategy to harness supplementary regularization during training. This design significantly enhances classification accuracy and mitigates overfitting issues in medical image datasets. Moreover, its modular design not only demonstrates flexibility but also indicates its potential applicability to various clinical contexts and medical imaging applications.
引用
收藏
页码:8232 / 8246
页数:15
相关论文
共 50 条
  • [31] Regularized discriminative broad learning system for image classification
    Jin, Junwei
    Qin, Zhenhao
    Yu, Dengxiu
    Li, Yanting
    Liang, Jing
    Chen, C. L. Philip
    KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [32] Elastic net regularized dictionary learning for image classification
    Bin Shen
    Bao-Di Liu
    Qifan Wang
    Multimedia Tools and Applications, 2016, 75 : 8861 - 8874
  • [33] A Lie group kernel learning method for medical image classification
    Liu, Li
    Sun, Haocheng
    Li, Fanzhang
    PATTERN RECOGNITION, 2023, 142
  • [34] Generative adversarial network based regularized image reconstruction for PET
    Xie, Zhaoheng
    Baikejiang, Reheman
    Li, Tiantian
    Zhang, Xuezhu
    Gong, Kuang
    Zhang, Mengxi
    Qi, Wenyuan
    Asma, Evren
    Qi, Jinyi
    PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (12):
  • [35] Generative adversarial networks based regularized image reconstruction for PET
    Xie, Zhaoheng
    Baikejiang, Reheman
    Gong, Kuang
    Zhang, Xuezhu
    Qi, Jinyi
    15TH INTERNATIONAL MEETING ON FULLY THREE-DIMENSIONAL IMAGE RECONSTRUCTION IN RADIOLOGY AND NUCLEAR MEDICINE, 2019, 11072
  • [36] Medical Image Classification Using Self-Supervised Learning-Based Masked Autoencoder
    Fan, Zong
    Wang, Zhimin
    Gong, Ping
    Lee, Christine U.
    Tang, Shanshan
    Zhang, Xiaohui
    Hao, Yao
    Zhang, Zhongwei
    Song, Pengfei
    Chen, Shigao
    Li, Hua
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [37] Self-supervised learning based on StyleGAN for medical image classification on small labeled dataset
    Fan, Zong
    Wang, Zhimin
    Zhang, Chaojie
    Ozbey, Muzaffer
    Villa, Umberto
    Hao, Yao
    Zhang, Zhongwei
    Wang, Xiaowei
    Lia, Hua
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [38] A Crowdsourcing-based Medical Image Classification Method
    He, Shuning
    Pan, Haiwei
    Zhao, Shengnan
    Chen, Chunling
    Bian, Xiaofei
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1492 - 1499
  • [39] Adversarial image reconstruction learning framework for medical image retrieval
    Pinapatruni, Rohini
    Bindu, Chigarapalle Shoba
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (05) : 1197 - 1204
  • [40] Adversarial image reconstruction learning framework for medical image retrieval
    Rohini Pinapatruni
    Shoba Bindu Chigarapalle
    Signal, Image and Video Processing, 2022, 16 : 1197 - 1204