A Deep Learning-Based Approach for Cervical Cancer Classification Using 3D CNN and Vision Transformer

被引:1
|
作者
Abinaya, K. [1 ]
Sivakumar, B. [1 ]
机构
[1] SRM Inst Sci & Technol, Dept Comp Technol, Chennai, Tamil Nadu, India
来源
关键词
Cervical cancer; Vision Transformer; 3D convolution block; 3D feature pyramid network; Kernel extreme learning machine;
D O I
10.1007/s10278-023-00911-z
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Cervical cancer is a significant health problem worldwide, and early detection and treatment are critical to improving patient outcomes. To address this challenge, a deep learning (DL)-based cervical classification system is proposed using 3D convolutional neural network and Vision Transformer (ViT) module. The proposed model leverages the capability of 3D CNN to extract spatiotemporal features from cervical images and employs the ViT model to capture and learn complex feature representations. The model consists of an input layer that receives cervical images, followed by a 3D convolution block, which extracts features from the images. The feature maps generated are down-sampled using max-pooling block to eliminate redundant information and preserve important features. Four Vision Transformer models are employed to extract efficient feature maps of different levels of abstraction. The output of each Vision Transformer model is an efficient set of feature maps that captures spatiotemporal information at a specific level of abstraction. The feature maps generated by the Vision Transformer models are then supplied into the 3D feature pyramid network (FPN) module for feature concatenation. The 3D squeeze-and-excitation (SE) block is employed to obtain efficient feature maps that recalibrate the feature responses of the network based on the interdependencies between different feature maps, thereby improving the discriminative power of the model. At last, dimension minimization of feature maps is executed using 3D average pooling layer. Its output is then fed into a kernel extreme learning machine (KELM) for classification into one of the five classes. The KELM uses radial basis kernel function (RBF) for mapping features in high-dimensional feature space and classifying the input samples. The superiority of the proposed model is known using simulation results, achieving an accuracy of 98.6%, demonstrating its potential as an effective tool for cervical cancer classification. Also, it can be used as a diagnostic supportive tool to assist medical experts in accurately identifying cervical cancer in patients.
引用
收藏
页码:280 / 296
页数:17
相关论文
共 50 条
  • [1] Deep learning-based Cervical Cancer Classification
    Khoulqi, Ichrak
    Idrissi, Najlae
    [J]. 2022 INTERNATIONAL CONFERENCE ON TECHNOLOGY INNOVATIONS FOR HEALTHCARE, ICTIH, 2022, : 30 - 33
  • [2] Satellite Images Analysis and Classification using Deep Learning-based Vision Transformer Model
    Adegun, Adekanmi Adeyinka
    Viriri, Serestina
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1275 - 1279
  • [3] Dose prediction in HDR brachytherapy for cervical cancer using 3D transformer-based deep learning
    Jian, W.
    Zhu, L.
    Zhang, Y.
    Zhang, B.
    Wang, X.
    [J]. RADIOTHERAPY AND ONCOLOGY, 2023, 182 : S408 - S409
  • [4] A deep learning based approach for automated plant disease classification using vision transformer
    Borhani, Yasamin
    Khoramdel, Javad
    Najafi, Esmaeil
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] A deep learning based approach for automated plant disease classification using vision transformer
    Yasamin Borhani
    Javad Khoramdel
    Esmaeil Najafi
    [J]. Scientific Reports, 12
  • [6] Deep learning-based approaches for robust classification of cervical cancer
    Pacal, Ishak
    Kilicarslan, Serhat
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18813 - 18828
  • [7] Deep learning-based approaches for robust classification of cervical cancer
    Ishak Pacal
    Serhat Kılıcarslan
    [J]. Neural Computing and Applications, 2023, 35 : 18813 - 18828
  • [8] DeepDoseNet: A Deep Learning-Based Approach for 3D Dose Prediction
    Soomro, M. H.
    Alves, V. Leandro
    Nourzadeh, H.
    Siebers, J.
    [J]. MEDICAL PHYSICS, 2021, 48 (06)
  • [9] Deep learning-based 3D reconstruction: a survey
    Taha Samavati
    Mohsen Soryani
    [J]. Artificial Intelligence Review, 2023, 56 : 9175 - 9219
  • [10] Deep learning-based 3D reconstruction: a survey
    Samavati, Taha
    Soryani, Mohsen
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) : 9175 - 9219