A Deep Learning-Based Approach for Cervical Cancer Classification Using 3D CNN and Vision Transformer

被引:1
|
作者
Abinaya, K. [1 ]
Sivakumar, B. [1 ]
机构
[1] SRM Inst Sci & Technol, Dept Comp Technol, Chennai, Tamil Nadu, India
来源
关键词
Cervical cancer; Vision Transformer; 3D convolution block; 3D feature pyramid network; Kernel extreme learning machine;
D O I
10.1007/s10278-023-00911-z
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Cervical cancer is a significant health problem worldwide, and early detection and treatment are critical to improving patient outcomes. To address this challenge, a deep learning (DL)-based cervical classification system is proposed using 3D convolutional neural network and Vision Transformer (ViT) module. The proposed model leverages the capability of 3D CNN to extract spatiotemporal features from cervical images and employs the ViT model to capture and learn complex feature representations. The model consists of an input layer that receives cervical images, followed by a 3D convolution block, which extracts features from the images. The feature maps generated are down-sampled using max-pooling block to eliminate redundant information and preserve important features. Four Vision Transformer models are employed to extract efficient feature maps of different levels of abstraction. The output of each Vision Transformer model is an efficient set of feature maps that captures spatiotemporal information at a specific level of abstraction. The feature maps generated by the Vision Transformer models are then supplied into the 3D feature pyramid network (FPN) module for feature concatenation. The 3D squeeze-and-excitation (SE) block is employed to obtain efficient feature maps that recalibrate the feature responses of the network based on the interdependencies between different feature maps, thereby improving the discriminative power of the model. At last, dimension minimization of feature maps is executed using 3D average pooling layer. Its output is then fed into a kernel extreme learning machine (KELM) for classification into one of the five classes. The KELM uses radial basis kernel function (RBF) for mapping features in high-dimensional feature space and classifying the input samples. The superiority of the proposed model is known using simulation results, achieving an accuracy of 98.6%, demonstrating its potential as an effective tool for cervical cancer classification. Also, it can be used as a diagnostic supportive tool to assist medical experts in accurately identifying cervical cancer in patients.
引用
收藏
页码:280 / 296
页数:17
相关论文
共 50 条
  • [31] A survey of deep learning-based 3D shape generation
    Qun-Ce Xu
    Tai-Jiang Mu
    Yong-Liang Yang
    [J]. Computational Visual Media, 2023, 9 : 407 - 442
  • [32] Deep Learning-Based 3D Printer Fault Detection
    Verana, Mark
    Nwakanma, Cosmas Ifeanyi
    Lee, Jae Min
    Kim, Dong Seong
    [J]. 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2021), 2021, : 99 - 102
  • [33] Multiplex Detection of Foodborne Pathogens using 3D Nanostructure Swab and Deep Learning-Based Classification of Raman Spectra
    Kang, Hyunju
    Lee, Junhyeong
    Moon, Jeong
    Lee, Taegu
    Kim, Jueun
    Jeong, Yeonwoo
    Lim, Eun-Kyung
    Jung, Juyeon
    Jung, Yongwon
    Lee, Seok Jae
    Lee, Kyoung G.
    Ryu, Seunghwa
    Kang, Taejoon
    [J]. SMALL, 2024, 20 (35)
  • [34] An efficient deep learning approach for arrhythmia classification using 3D temporal SVCG
    Simone, Lorenzo
    Camporeale, Mauro Giuseppe
    Lomonte, Nunzia
    Dimauro, Giovanni
    Gervasi, Vincenzo
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, ICDH, 2023, : 234 - 239
  • [35] Deep Learning of Volumetric 3D CNN for fMRI in Alzheimer's Disease Classification
    Parmar, Harshit S.
    Nutter, Brian
    Long, Rodney
    Antani, Sameer
    Mitra, Sunanda
    [J]. MEDICAL IMAGING 2020: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2021, 11317
  • [36] A deep learning-based transformer model for photovoltaic fault forecasting and classification
    Khalil, Ihsan Ullah
    Ul Haq, Azhar
    ul Islam, Naeem
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2024, 228
  • [37] Deep Learning-Based Action Recognition Using 3D Skeleton Joints Information
    Tasnim, Nusrat
    Islam, Md. Mahbubul
    Baek, Joong-Hwan
    [J]. INVENTIONS, 2020, 5 (03) : 1 - 15
  • [38] Deep Learning-Based Pathological Diagnosis of Cervical Cancer
    Wang, Shuhao
    Liu, Aijun
    [J]. LABORATORY INVESTIGATION, 2023, 103 (03) : S1326 - S1326
  • [39] Deep learning-based microarray cancer classification and ensemble gene selection approach
    Rezaee, Khosro
    Jeon, Gwanggil
    Khosravi, Mohammad R.
    Attar, Hani H.
    Sabzevari, Alireza
    [J]. IET SYSTEMS BIOLOGY, 2022, 16 (3-4) : 120 - 131
  • [40] A Deep Learning based CNN framework approach for Plankton Classification
    Rawat, Sarthak Singh
    Bisht, Abhishek
    Nijhawan, Rahul
    [J]. 2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 268 - 273