Efficient Data Augmentation Techniques for Improved Classification in Limited Data Set of Oral Squamous Cell Carcinoma

被引:2
|
作者
Alosaimi, Wael [1 ]
Uddin, M. Irfan [2 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Dept Informat Technol, At Taif 21944, Saudi Arabia
[2] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
来源
关键词
Data science; deep learning; data augmentation; classification; data manipulation; GENERATIVE ADVERSARIAL NETWORKS; SEGMENTATION;
D O I
10.32604/cmes.2022.018433
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Deep Learning (DL) techniques as a subfield of data science are getting overwhelming attention mainly because of their ability to understand the underlying pattern of data in making classifications. These techniques require a considerable amount of data to efficiently train the DL models. Generally, when the data size is larger, the DL models perform better. However, it is not possible to have a considerable amount of data in different domains such as healthcare. In healthcare, it is impossible to have a substantial amount of data to solve medical problems using Artificial Intelligence, mainly due to ethical issues and the privacy of patients. To solve this problem of small dataset, different techniques of data augmentation are used that can increase the size of the training set. However, these techniques only change the shape of the image and hence the classification model does not increase accuracy. Generative Adversarial Networks (GANs) are very powerful techniques to augment training data as new samples are created. This technique helps the classification models to increase their accuracy. In this paper, we have investigated augmentation techniques in healthcare image classification. The objective of this research paper is to develop a novel augmentation technique that can increase the size of the training set, to enable deep learning techniques to achieve higher accuracy. We have compared the performance of the image classifiers using the standard augmentation technique and GANs. Our results demonstrate that GANs increase the training data, and eventually, the classifier achieves an accuracy of 90% compared to standard data augmentation techniques, which achieve an accuracy of up to 70%. Other advanced CNN models are also tested and have demonstrated that more deep architectures can achieve more than 98% accuracy for making classification on Oral Squamous Cell Carcinoma.
引用
收藏
页码:1387 / 1401
页数:15
相关论文
共 50 条
  • [21] Improved outcomes with oral tongue squamous cell carcinoma in Finland
    Mroueh, Rayan
    Haapaniemi, Aaro
    Grenman, Reidar
    Laranne, Jussi
    Pukkila, Matti
    Almangush, Alhadi
    Salo, Tuula
    Makitie, Antti
    HEAD AND NECK-JOURNAL FOR THE SCIENCES AND SPECIALTIES OF THE HEAD AND NECK, 2017, 39 (07): : 1306 - 1312
  • [22] Classification of Early Cervical Squamous Cell Carcinoma Based on Multi-omics Data
    Wang Xiao-Xi
    Li Xiao-Qin
    Cao A-Cheng
    Hou Zhi-Chao
    Gao Bin
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2021, 48 (10) : 1233 - 1242
  • [23] Enhancing automated strabismus classification with limited data: Data augmentation using StyleGAN2-ADA
    Joo, Jaehan
    Kim, Sang Yoon
    Kim, Donghwan
    Lee, Ji-Eun
    Lee, Seung Min
    Suh, Su Youn
    Kim, Su-Jin
    Kim, Suk Chan
    PLOS ONE, 2024, 19 (05):
  • [24] Data Mining and Bioinformatics of the Expression Data of Esophageal Squamous Cell Carcinoma
    Yao Sun
    Xufeng Li
    Chunyu Jiang
    Mingxing Xiao
    Yanfang Zheng
    Jiren Zhang
    Junguo Bu
    Cell Biochemistry and Biophysics, 2014, 69 : 481 - 485
  • [25] Data Mining and Bioinformatics of the Expression Data of Esophageal Squamous Cell Carcinoma
    Sun, Yao
    Li, Xufeng
    Jiang, Chunyu
    Xiao, Mingxing
    Zheng, Yanfang
    Zhang, Jiren
    Bu, Junguo
    CELL BIOCHEMISTRY AND BIOPHYSICS, 2014, 69 (03) : 481 - 485
  • [26] Text Data Augmentation Techniques for Word Embeddings in Fake News Classification
    Kapusta, Jozef
    Drzik, David
    Steflovic, Kirsten
    Nagy, Kitti Szabo
    IEEE ACCESS, 2024, 12 : 31538 - 31550
  • [27] PERFORMANCE EVALUATION OF MULTILABEL EMOTION CLASSIFICATION USING DATA AUGMENTATION TECHNIQUES
    Ahanin, Zahra
    Ismail, Maizatul Akmar
    Herawan, Tutut
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2024, 37 (02) : 154 - 168
  • [29] Enhanced Cognitive Distortions Detection and Classification Through Data Augmentation Techniques
    Rasmy, Mohamad
    Sabty, Caroline
    Sakr, Nourhan
    El Bolock, Alia
    PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 134 - 145
  • [30] Data augmentation using MG-GAN for improved cancer classification on gene expression data
    Chaudhari, Poonam
    Agrawal, Himanshu
    Kotecha, Ketan
    SOFT COMPUTING, 2020, 24 (15) : 11381 - 11391