Data augmentation using MG-GAN for improved cancer classification on gene expression data

被引:0
|
作者
Poonam Chaudhari
Himanshu Agrawal
Ketan Kotecha
机构
[1] Gokhale Education Society’s R. H. Sapat College of Engineering,
[2] Management Studies and Research,undefined
[3] Symbiosis Institute of Technology,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
Data augmentation; Generative adversarial network; Gene expression dataset; Cancer detection; Modified generator GAN; Multivariate noise; Gaussian distribution; Latent space; Saddle point;
D O I
暂无
中图分类号
学科分类号
摘要
Molecular biology studies on cancer, using gene expression datasets, have revealed that the datasets have a very small number of samples. Obtaining medical data is difficult and expensive due to privacy constraints. Accuracy of classifiers depends greatly on the quality and quantity of input data. The problem of small sample size or small data size has been addressed by augmentation. Owing to the sensitivity of synthetic data samples for the cancer data classification for gene expression data, this paper is motivated to investigate data augmentation using GAN. GAN is based on the principle of two blocks (generator and discriminator) working in a collaborative yet adversarial way. This paper proposes modified generator GAN (MG-GAN) where the generator is fed with original data and multivariate noise to generate data with Gaussian distribution. As the generated data lie within latent space, we reach saddle point faster. GAN has been widely used in data augmentation for image datasets. As per our understanding, this is the first attempt of using GAN for augmentation on gene expression dataset. The performance merit of proposed MG-GAN was compared with KNN and Basic GAN. As compared to KNN and GAN, MG-GAN improves classification accuracy by 18.8% and 11.9%, respectively. The loss value of the error function for MG-GAN is drastically reduced, from 0.6978 to 0.0082, ensuring sensitivity of the generated data. Improved classification accuracy and reduction in the loss value make our improved MG-GAN method better suited for critical applications with sensitive data.
引用
收藏
页码:11381 / 11391
页数:10
相关论文
共 50 条
  • [22] Improved Cancer Classification Using Patient-Specific Biological Pathway Information Via Gene Expression Data
    Young, M.
    Craft, D.
    MEDICAL PHYSICS, 2016, 43 (06) : 3704 - 3705
  • [23] An improved FMM neural network for classification of gene expression data
    Juan, Liu
    Fei, Luo
    Yongqiong, Zhu
    FUZZY INFORMATION AND ENGINEERING, PROCEEDINGS, 2007, 40 : 65 - +
  • [24] Classification using functional data analysis for temporal gene expression data
    Leng, XY
    Müller, HG
    BIOINFORMATICS, 2006, 22 (01) : 68 - 76
  • [25] Classification of Microarray Gene Expression Data using Associative Classification
    Alagukumar, S.
    Lawrance, R.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [26] A genetic filter for cancer classification on gene expression data
    Kim, Yong-Hyuk
    Yoon, Yourim
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1993 - S2002
  • [27] Transfer-GAN: data augmentation using a fine-tuned GAN for sperm morphology classification
    Abbasi, Amir
    Bahrami, Sepideh
    Hemmati, Tahere
    Mirroshandel, Seyed Abolghasem
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (06): : 2440 - 2456
  • [28] Feature Selection and Classification in gene expression cancer data
    Pavithra, D.
    Lakshmanan, B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [29] Classification of Cancer Types based on Gene Expression Data
    He, Yinchao
    Bockmon, Ryan
    Modey, Miracle
    Roscoe, Sarah
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2175 - 2182
  • [30] Marker identification and classification of cancer types using gene expression data and SIMCA
    Bicciato, S
    Luchini, A
    Di Bello, C
    METHODS OF INFORMATION IN MEDICINE, 2004, 43 (01) : 4 - 8