Cancer diagnosis using generative adversarial networks based on deep learning from imbalanced data

被引:41
|
作者
Xiao, Yawen [1 ]
Wu, Jun [2 ]
Lin, Zongli [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[2] East China Normal Univ, Ctr Bioinformat & Computat Biol, Shanghai 200241, Peoples R China
[3] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
基金
中国国家自然科学基金;
关键词
Cancer diagnosis; Deep learning; Gene expression data; Imbalanced data; Wasserstein generative adversarial networks; CLASSIFICATION; PREDICTION; BREAST;
D O I
10.1016/j.compbiomed.2021.104540
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background and objective: Cancer is a serious global disease due to its high mortality, and the key to effective treatment is accurate diagnosis. However, limited by sampling difficulty and actual sample size in clinical practice, data imbalance is a common problem in cancer diagnosis, while most conventional classification methods assume balanced data distribution. Therefore, addressing the imbalanced learning problem to improve the predictive performance of cancer diagnosis is significant. Methods: In the study, we dissect the data imbalance prevalent in cancer gene expression data and present an improved deep learning based Wasserstein generative adversarial network (WGAN) model, which provides a reliable training progress indicator and deeply explores the characteristics of data. The WGAN generates new samples from the minority class and solves the imbalance problem at the data level. Results: We analyze three publicly available data sets on RNA-seq of three kinds of cancer using the proposed WGAN and compare the results with those from two commonly adopted sampling methods. According to the results, through addressing the data imbalance problem, the balanced data distribution and the expanding sample size increase the prediction accuracy in all three data sets. Conclusions: Therefore, the proposed WGAN method is superior in solving the imbalanced learning problem of gene expression data, providing significantly better prediction performance in cancer diagnosis.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Machinery fault diagnosis with imbalanced data using deep generative adversarial networks
    Zhang, Wei
    Li, Xiang
    Jia, Xiao-Dong
    Ma, Hui
    Luo, Zhong
    Li, Xu
    MEASUREMENT, 2020, 152
  • [2] Data Augment in Imbalanced Learning Based on Generative Adversarial Networks
    Zhou, Zhuocheng
    Zhang, Bofeng
    Lv, Ying
    Shi, Tian
    Chang, Furong
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 21 - 30
  • [3] Effective data generation for imbalanced learning using conditional generative adversarial networks
    Douzas, Georgios
    Bacao, Fernando
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 : 464 - 471
  • [4] Data synthesis using deep feature enhanced generative adversarial networks for rolling bearing imbalanced fault diagnosis
    Liu, Shaowei
    Jiang, Hongkai
    Wu, Zhenghong
    Li, Xingqiu
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 163
  • [5] Imbalanced Learning for Fault Diagnosis Problem of Rotating Machinery Based on Generative Adversarial Networks
    Xie, Yuan
    Zhang, Tao
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 6017 - 6022
  • [6] Wind Turbine Fault Diagnosis with Imbalanced SCADA Data Using Generative Adversarial Networks
    Wang, Hong
    Li, Taikun
    Xie, Mingyang
    Tian, Wenfang
    Han, Wei
    ENERGIES, 2025, 18 (05)
  • [7] Research on imbalanced learning based on conditional generative adversarial networks
    Zhao H.-X.
    Shi H.-B.
    Wu J.
    Chen X.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (03): : 619 - 628
  • [8] Learning from class-imbalanced data using misclassification-focusing generative adversarial networks
    Yun, Jaesub
    Lee, Jong-Seok
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 240
  • [9] An imbalanced data learning method for tool breakage detection based on generative adversarial networks
    Shixu Sun
    Xiaofeng Hu
    Yingchao Liu
    Journal of Intelligent Manufacturing, 2022, 33 : 2441 - 2455
  • [10] An imbalanced data learning method for tool breakage detection based on generative adversarial networks
    Sun, Shixu
    Hu, Xiaofeng
    Liu, Yingchao
    JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (08) : 2441 - 2455