Efficient Data Augmentation Techniques for Improved Classification in Limited Data Set of Oral Squamous Cell Carcinoma

被引:2
|
作者
Alosaimi, Wael [1 ]
Uddin, M. Irfan [2 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Dept Informat Technol, At Taif 21944, Saudi Arabia
[2] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
来源
关键词
Data science; deep learning; data augmentation; classification; data manipulation; GENERATIVE ADVERSARIAL NETWORKS; SEGMENTATION;
D O I
10.32604/cmes.2022.018433
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Deep Learning (DL) techniques as a subfield of data science are getting overwhelming attention mainly because of their ability to understand the underlying pattern of data in making classifications. These techniques require a considerable amount of data to efficiently train the DL models. Generally, when the data size is larger, the DL models perform better. However, it is not possible to have a considerable amount of data in different domains such as healthcare. In healthcare, it is impossible to have a substantial amount of data to solve medical problems using Artificial Intelligence, mainly due to ethical issues and the privacy of patients. To solve this problem of small dataset, different techniques of data augmentation are used that can increase the size of the training set. However, these techniques only change the shape of the image and hence the classification model does not increase accuracy. Generative Adversarial Networks (GANs) are very powerful techniques to augment training data as new samples are created. This technique helps the classification models to increase their accuracy. In this paper, we have investigated augmentation techniques in healthcare image classification. The objective of this research paper is to develop a novel augmentation technique that can increase the size of the training set, to enable deep learning techniques to achieve higher accuracy. We have compared the performance of the image classifiers using the standard augmentation technique and GANs. Our results demonstrate that GANs increase the training data, and eventually, the classifier achieves an accuracy of 90% compared to standard data augmentation techniques, which achieve an accuracy of up to 70%. Other advanced CNN models are also tested and have demonstrated that more deep architectures can achieve more than 98% accuracy for making classification on Oral Squamous Cell Carcinoma.
引用
收藏
页码:1387 / 1401
页数:15
相关论文
共 50 条
  • [41] Techniques for early diagnosis of oral squamous cell carcinoma: Systematic review
    Carreras-Torras, Claudia
    Gay-Escoda, Cosme
    MEDICINA ORAL PATOLOGIA ORAL Y CIRUGIA BUCAL, 2015, 20 (03): : E305 - E315
  • [42] A Gaussian Data Augmentation Technique on Highly Dimensional, Limited Labeled Data for Multiclass Classification Using Deep Learning
    Rochac, Juan F. Ramirez
    Liang, Lily
    Zhang, Nian
    Oladunni, Timothy
    2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2019, : 145 - 151
  • [43] Molecular subtype classification and corresponding markers of oral squamous cell carcinoma
    Yu, X. F.
    Li, Z. S.
    Zhang, K. M.
    Chen, X. B.
    JOURNAL OF BIOLOGICAL REGULATORS AND HOMEOSTATIC AGENTS, 2021, 35 (05): : 1611 - 1624
  • [44] Racial disparities in squamous cell carcinoma of the oral tongue among women: A SEER data analysis
    Joseph, Lindsay J.
    Goodman, Michael
    Higgins, Kristin
    Pilai, Rathi
    Ramalingam, Suresh S.
    Magliocca, Kelly
    Patel, Mihir R.
    El-Deiry, Mark
    Wadsworth, J. Trad
    Owonikoko, Taofeek K.
    Beitler, Jonathan J.
    Khuri, Fadlo R.
    Shin, Dong M.
    Saba, Nabil F.
    ORAL ONCOLOGY, 2015, 51 (06) : 586 - 592
  • [45] The Classification of Oral Squamous Cell Carcinoma (OSCC) by Means of Transfer Learning
    Rauf, Ahmad Ridhauddin Abdul
    Isa, Wan Hasbullah Mohd
    Khairuddin, Ismail Mohd
    Razman, Mohd Azraai Mohd
    Arzmi, Mohd Hafiz
    Majeed, Anwar P. P. Abdul
    ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 386 - 391
  • [46] Spatial subsetting enables integrative modeling of oral squamous cell carcinoma multiplex imaging data
    Einhaus, Jakob
    Gaudilliere, Dyani K.
    Hedou, Julien
    Feyaerts, Dorien
    Ozawa, Michael G.
    Sato, Masaki
    Ganio, Edward A.
    Tsai, Amy S.
    Stelzer, Ina A.
    Bruckman, Karl C.
    Amar, Jonas N.
    Sabayev, Maximilian
    Bonham, Thomas A.
    Gillard, Joshua
    Diop, Maigane
    Cambriel, Amelie
    Mihalic, Zala N.
    Valdez, Tulio
    Liu, Stanley Y.
    Feirrera, Leticia
    Lam, David K.
    Sunwoo, John B.
    Schuerch, Christian M.
    Gaudilliere, Brice
    Han, Xiaoyuan
    ISCIENCE, 2023, 26 (12)
  • [47] Finding the combination of multiple biomarkers to diagnose oral squamous cell carcinoma - A data mining approach
    Barbosa, Rommel
    da Costa, Nattane Luiza
    Alves, Mariana de Sa
    Rodrigues, Nayara de Sa
    Bandeira, Celso Muller
    Alves, Monica Ghislaine Oliveira
    Mendes, Maria Anita
    Alves, Levy Anderson Cesar
    Almeida, Janete Dias
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 143
  • [48] Screening pathogenic genes in oral squamous cell carcinoma based on the mRNA expression microarray data
    Ding, Yang
    Liu, Pengfei
    Zhang, Shengsheng
    Tao, Lin
    Han, Jianmin
    INTERNATIONAL JOURNAL OF MOLECULAR MEDICINE, 2018, 41 (06) : 3597 - 3603
  • [49] Grade as a Prognostic Factor in Oral Squamous Cell Carcinoma: A Population-Based Analysis of the Data
    Thomas, Brian
    Stedman, Margaret
    Davies, Louise
    LARYNGOSCOPE, 2014, 124 (03): : 688 - 694
  • [50] Emerging data on nivolumab for esophageal squamous cell carcinoma
    Hirose, Toshiharu
    Yamamoto, Shun
    Kato, Ken
    EXPERT REVIEW OF GASTROENTEROLOGY & HEPATOLOGY, 2021, 15 (08) : 845 - 854