Data augmentation using MG-GAN for improved cancer classification on gene expression data

被引:0
|
作者
Poonam Chaudhari
Himanshu Agrawal
Ketan Kotecha
机构
[1] Gokhale Education Society’s R. H. Sapat College of Engineering,
[2] Management Studies and Research,undefined
[3] Symbiosis Institute of Technology,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
Data augmentation; Generative adversarial network; Gene expression dataset; Cancer detection; Modified generator GAN; Multivariate noise; Gaussian distribution; Latent space; Saddle point;
D O I
暂无
中图分类号
学科分类号
摘要
Molecular biology studies on cancer, using gene expression datasets, have revealed that the datasets have a very small number of samples. Obtaining medical data is difficult and expensive due to privacy constraints. Accuracy of classifiers depends greatly on the quality and quantity of input data. The problem of small sample size or small data size has been addressed by augmentation. Owing to the sensitivity of synthetic data samples for the cancer data classification for gene expression data, this paper is motivated to investigate data augmentation using GAN. GAN is based on the principle of two blocks (generator and discriminator) working in a collaborative yet adversarial way. This paper proposes modified generator GAN (MG-GAN) where the generator is fed with original data and multivariate noise to generate data with Gaussian distribution. As the generated data lie within latent space, we reach saddle point faster. GAN has been widely used in data augmentation for image datasets. As per our understanding, this is the first attempt of using GAN for augmentation on gene expression dataset. The performance merit of proposed MG-GAN was compared with KNN and Basic GAN. As compared to KNN and GAN, MG-GAN improves classification accuracy by 18.8% and 11.9%, respectively. The loss value of the error function for MG-GAN is drastically reduced, from 0.6978 to 0.0082, ensuring sensitivity of the generated data. Improved classification accuracy and reduction in the loss value make our improved MG-GAN method better suited for critical applications with sensitive data.
引用
收藏
页码:11381 / 11391
页数:10
相关论文
共 50 条
  • [1] Data augmentation using MG-GAN for improved cancer classification on gene expression data
    Chaudhari, Poonam
    Agrawal, Himanshu
    Kotecha, Ketan
    SOFT COMPUTING, 2020, 24 (15) : 11381 - 11391
  • [2] GAN-Based Data Augmentation for Prediction Improvement Using Gene Expression Data in Cancer
    Moreno-Barea, Francisco J.
    Jerez, Jose M.
    Franco, Leonardo
    COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 28 - 42
  • [3] SYNTHETIC DATA AUGMENTATION USING GAN FOR IMPROVED LIVER LESION CLASSIFICATION
    Frid-Adar, Maayan
    Klang, Eyal
    Amitai, Michal
    Goldberger, Jacob
    Greenspan, Hayit
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 289 - 293
  • [4] Improved Endoscopic Polyp Classification using GAN Generated Synthetic Data Augmentation
    Sasmal, Pradipta
    Bhuyan, M. K.
    Sonowal, Sourav
    Iwahori, Yuji
    Kasugai, Kunio
    PROCEEDINGS OF 2020 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON 2020), 2020, : 247 - 251
  • [5] Cancer classification using gene expression data
    Lu, Y
    Han, JW
    INFORMATION SYSTEMS, 2003, 28 (04) : 243 - 268
  • [6] Cancer Classification Using Gene Expression Data
    Sonsare, Pravinkumar
    Mujumdar, Aarya
    Joshi, Pranjali
    Morayya, Nipun
    Hablani, Sachal
    Khergade, Vedant
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024, 2024, 945 : 1 - 11
  • [7] Skin Lesion Classification Using GAN based Data Augmentation
    Rashid, Haroon
    Tanveer, M. Asjid
    Khan, Hassan Aqeel
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 916 - 919
  • [8] AEGAN-Pathifier: a data augmentation method to improve cancer classification for imbalanced gene expression data
    Zhang, Qiaosheng
    Wei, Yalong
    Hou, Jie
    Li, Hongpeng
    Zhong, Zhaoman
    BMC Bioinformatics, 2024, 25 (01)
  • [9] GAN Data Augmentation Methods in Rock Classification
    Zhao, Gaochang
    Cai, Zhao
    Wang, Xin
    Dang, Xiaohu
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [10] Synthetic Data Augmentation Using GAN For Improved Automated Visual Inspection
    Rozanec, Joze M.
    Zajec, Patrik
    Theodoropoulos, Spyros
    Koehorst, Erik
    Fortuna, Blaz
    Mladenic, Dunja
    IFAC PAPERSONLINE, 2023, 56 (02): : 11094 - 11099