Efficient Data Augmentation Techniques for Improved Classification in Limited Data Set of Oral Squamous Cell Carcinoma

被引:2
|
作者
Alosaimi, Wael [1 ]
Uddin, M. Irfan [2 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Dept Informat Technol, At Taif 21944, Saudi Arabia
[2] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
来源
关键词
Data science; deep learning; data augmentation; classification; data manipulation; GENERATIVE ADVERSARIAL NETWORKS; SEGMENTATION;
D O I
10.32604/cmes.2022.018433
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Deep Learning (DL) techniques as a subfield of data science are getting overwhelming attention mainly because of their ability to understand the underlying pattern of data in making classifications. These techniques require a considerable amount of data to efficiently train the DL models. Generally, when the data size is larger, the DL models perform better. However, it is not possible to have a considerable amount of data in different domains such as healthcare. In healthcare, it is impossible to have a substantial amount of data to solve medical problems using Artificial Intelligence, mainly due to ethical issues and the privacy of patients. To solve this problem of small dataset, different techniques of data augmentation are used that can increase the size of the training set. However, these techniques only change the shape of the image and hence the classification model does not increase accuracy. Generative Adversarial Networks (GANs) are very powerful techniques to augment training data as new samples are created. This technique helps the classification models to increase their accuracy. In this paper, we have investigated augmentation techniques in healthcare image classification. The objective of this research paper is to develop a novel augmentation technique that can increase the size of the training set, to enable deep learning techniques to achieve higher accuracy. We have compared the performance of the image classifiers using the standard augmentation technique and GANs. Our results demonstrate that GANs increase the training data, and eventually, the classifier achieves an accuracy of 90% compared to standard data augmentation techniques, which achieve an accuracy of up to 70%. Other advanced CNN models are also tested and have demonstrated that more deep architectures can achieve more than 98% accuracy for making classification on Oral Squamous Cell Carcinoma.
引用
收藏
页码:1387 / 1401
页数:15
相关论文
共 50 条
  • [31] Data augmentation using MG-GAN for improved cancer classification on gene expression data
    Poonam Chaudhari
    Himanshu Agrawal
    Ketan Kotecha
    Soft Computing, 2020, 24 : 11381 - 11391
  • [32] An efficient hyperspectral image classification method for limited training data
    Ren, Yitao
    Jin, Peiyang
    Li, Yiyang
    Mao, Keming
    IET IMAGE PROCESSING, 2023, 17 (06) : 1709 - 1717
  • [33] Improved Neural Network Arrhythmia Classification Through Integrated Data Augmentation
    Cayce, Garrett, I
    Depoian, Arthur C., II
    Bailey, Colleen P.
    Guturu, Parthasarathy
    2022 IEEE METROCON, 2022, : 10 - 12
  • [34] SYNTHETIC DATA AUGMENTATION USING GAN FOR IMPROVED LIVER LESION CLASSIFICATION
    Frid-Adar, Maayan
    Klang, Eyal
    Amitai, Michal
    Goldberger, Jacob
    Greenspan, Hayit
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 289 - 293
  • [35] Data augmentation for cancer classification in oncogenomics: an improved KNN based approach
    Poonam Chaudhari
    Himanshu Agarwal
    Vikrant Bhateja
    Evolutionary Intelligence, 2021, 14 : 489 - 498
  • [36] Data augmentation for cancer classification in oncogenomics: an improved KNN based approach
    Chaudhari, Poonam
    Agarwal, Himanshu
    Bhateja, Vikrant
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 489 - 498
  • [37] Efficient Classification of Imbalanced Natural Disasters Data Using Generative Adversarial Networks for Data Augmentation
    Eltehewy, Rokaya
    Abouelfarag, Ahmed
    Saleh, Sherine Nagy
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (06)
  • [38] Efficient Data Augmentation Techniques for Some Classes of State Space Models
    Tan, Linda S. L.
    STATISTICAL SCIENCE, 2023, 38 (02) : 240 - 261
  • [39] Data Augmentation-Based Enhancement for Efficient Network Traffic Classification
    Shin, Chang-Yui
    Choi, Yang-Seo
    Kim, Myung-Sup
    IEEE ACCESS, 2025, 13 : 6006 - 6028
  • [40] Analysis of Correlation Structure of Data Set for Efficient Pattern Classification
    Goswami, Saptarsi
    Chakrabarti, Amlan
    Chakraborty, Basabi
    2015 IEEE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2015, : 24 - 29