Cancer diagnosis using generative adversarial networks based on deep learning from imbalanced data

被引:41
|
作者
Xiao, Yawen [1 ]
Wu, Jun [2 ]
Lin, Zongli [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[2] East China Normal Univ, Ctr Bioinformat & Computat Biol, Shanghai 200241, Peoples R China
[3] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
基金
中国国家自然科学基金;
关键词
Cancer diagnosis; Deep learning; Gene expression data; Imbalanced data; Wasserstein generative adversarial networks; CLASSIFICATION; PREDICTION; BREAST;
D O I
10.1016/j.compbiomed.2021.104540
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background and objective: Cancer is a serious global disease due to its high mortality, and the key to effective treatment is accurate diagnosis. However, limited by sampling difficulty and actual sample size in clinical practice, data imbalance is a common problem in cancer diagnosis, while most conventional classification methods assume balanced data distribution. Therefore, addressing the imbalanced learning problem to improve the predictive performance of cancer diagnosis is significant. Methods: In the study, we dissect the data imbalance prevalent in cancer gene expression data and present an improved deep learning based Wasserstein generative adversarial network (WGAN) model, which provides a reliable training progress indicator and deeply explores the characteristics of data. The WGAN generates new samples from the minority class and solves the imbalance problem at the data level. Results: We analyze three publicly available data sets on RNA-seq of three kinds of cancer using the proposed WGAN and compare the results with those from two commonly adopted sampling methods. According to the results, through addressing the data imbalance problem, the balanced data distribution and the expanding sample size increase the prediction accuracy in all three data sets. Conclusions: Therefore, the proposed WGAN method is superior in solving the imbalanced learning problem of gene expression data, providing significantly better prediction performance in cancer diagnosis.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Lung cancer diagnosis using Hessian adaptive learning optimization in generative adversarial networks
    E. Thirumagal
    K. Saruladha
    Soft Computing, 2023, 27 : 6223 - 6239
  • [42] Data Augmentation for Imbalanced HRRP Recognition Using Deep Convolutional Generative Adversarial Network
    Song, Yiheng
    Li, Yang
    Wang, Yanhua
    Hu, Cheng
    IEEE ACCESS, 2020, 8 : 201686 - 201695
  • [43] Framework for imbalanced fault diagnosis of rolling bearing using autoencoding generative adversarial learning
    Rathore, Maan Singh
    Harsha, S. P.
    JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2023, 45 (01)
  • [44] Framework for imbalanced fault diagnosis of rolling bearing using autoencoding generative adversarial learning
    Maan Singh Rathore
    S. P. Harsha
    Journal of the Brazilian Society of Mechanical Sciences and Engineering, 2023, 45
  • [45] Efficient Classification of Imbalanced Natural Disasters Data Using Generative Adversarial Networks for Data Augmentation
    Eltehewy, Rokaya
    Abouelfarag, Ahmed
    Saleh, Sherine Nagy
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (06)
  • [46] Imbalanced spectral data analysis using data augmentation based on the generative adversarial network
    Chung, Jihoon
    Zhang, Junru
    Saimon, Amirul Islam
    Liu, Yang
    Johnson, Blake N.
    Kong, Zhenyu
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [47] Improved generative adversarial network for vibration-based fault diagnosis with imbalanced data
    Zhao, Bingxi
    Yuan, Qi
    MEASUREMENT, 2021, 169 (169)
  • [48] Imbalanced data fault diagnosis of hydrogen sensors using deep convolutional generative adversarial network with convolutional neural network
    Sun, Yongyi
    Zhao, Tingting
    Zou, Zhihui
    Chen, Yinsheng
    Zhang, Hongquan
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2021, 92 (09):
  • [49] Research on Imbalanced Data Classification Based on Classroom-Like Generative Adversarial Networks
    Lv, Yancheng
    Lin, Lin
    Liu, Jie
    Guo, Hao
    Tong, Changsheng
    NEURAL COMPUTATION, 2022, 34 (04) : 1045 - 1073
  • [50] A clustering and generative adversarial networks-based hybrid approach for imbalanced data classification
    Ding H.
    Cui X.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (06) : 8003 - 8018