A Deep Learning-Based Approach for Cervical Cancer Classification Using 3D CNN and Vision Transformer

被引:1
|
作者
Abinaya, K. [1 ]
Sivakumar, B. [1 ]
机构
[1] SRM Inst Sci & Technol, Dept Comp Technol, Chennai, Tamil Nadu, India
来源
关键词
Cervical cancer; Vision Transformer; 3D convolution block; 3D feature pyramid network; Kernel extreme learning machine;
D O I
10.1007/s10278-023-00911-z
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Cervical cancer is a significant health problem worldwide, and early detection and treatment are critical to improving patient outcomes. To address this challenge, a deep learning (DL)-based cervical classification system is proposed using 3D convolutional neural network and Vision Transformer (ViT) module. The proposed model leverages the capability of 3D CNN to extract spatiotemporal features from cervical images and employs the ViT model to capture and learn complex feature representations. The model consists of an input layer that receives cervical images, followed by a 3D convolution block, which extracts features from the images. The feature maps generated are down-sampled using max-pooling block to eliminate redundant information and preserve important features. Four Vision Transformer models are employed to extract efficient feature maps of different levels of abstraction. The output of each Vision Transformer model is an efficient set of feature maps that captures spatiotemporal information at a specific level of abstraction. The feature maps generated by the Vision Transformer models are then supplied into the 3D feature pyramid network (FPN) module for feature concatenation. The 3D squeeze-and-excitation (SE) block is employed to obtain efficient feature maps that recalibrate the feature responses of the network based on the interdependencies between different feature maps, thereby improving the discriminative power of the model. At last, dimension minimization of feature maps is executed using 3D average pooling layer. Its output is then fed into a kernel extreme learning machine (KELM) for classification into one of the five classes. The KELM uses radial basis kernel function (RBF) for mapping features in high-dimensional feature space and classifying the input samples. The superiority of the proposed model is known using simulation results, achieving an accuracy of 98.6%, demonstrating its potential as an effective tool for cervical cancer classification. Also, it can be used as a diagnostic supportive tool to assist medical experts in accurately identifying cervical cancer in patients.
引用
收藏
页码:280 / 296
页数:17
相关论文
共 50 条
  • [21] A Multitask Learning-Based Vision Transformer for Plant Disease Localization and Classification
    Hemalatha, S.
    Jayachandran, Jai Jaganath Babu
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [22] A Machine Learning Approach to Diagnosing Lung and Colon Cancer Using a Deep Learning-Based Classification Framework
    Masud, Mehedi
    Sikder, Niloy
    Nahid, Abdullah-Al
    Bairagi, Anupam Kumar
    AlZain, Mohammed A.
    [J]. SENSORS, 2021, 21 (03) : 1 - 21
  • [23] A deep learning-based approach for lesion classification in 3D 18F-DCFPyL PSMA PET images of patients with prostate cancer
    Leung, Kevin
    Sadaghiani, Mohammad Salehi
    Dalaie, Pejman
    Tulbah, Rima
    Yin, Yafu
    VandenBerg, Ryan
    Leal, Jeffrey
    Ashrafinia, Saeed
    Gorin, Michael
    Du, Yong
    Rowe, Steven
    Pomper, Martin
    [J]. JOURNAL OF NUCLEAR MEDICINE, 2020, 61
  • [24] A Transfer Learning-Based Deep CNN Approach for Classification and Diagnosis of Acute Lymphocytic Leukemia Cells
    Magpantay, Leo Dominick C.
    Alon, Helcy D.
    Austria, Yolanda D.
    Melegrito, Mark P.
    Fernando, Glenn John O.
    [J]. 2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 280 - 284
  • [25] A deep learning approach to the classification of 3D CAD models
    Fei-wei QIN
    Lu-ye LI
    Shu-ming GAO
    Xiao-ling YANG
    Xiang CHEN
    [J]. Frontiers of Information Technology & Electronic Engineering, 2014, (02) : 91 - 106
  • [26] A deep learning approach to the classification of 3D CAD models
    Qin, Fei-wei
    Li, Lu-ye
    Gao, Shu-ming
    Yang, Xiao-ling
    Chen, Xiang
    [J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2014, 15 (02): : 91 - 106
  • [27] A deep learning approach to the classification of 3D CAD models
    Fei-wei Qin
    Lu-ye Li
    Shu-ming Gao
    Xiao-ling Yang
    Xiang Chen
    [J]. Journal of Zhejiang University SCIENCE C, 2014, 15 : 91 - 106
  • [28] A novel 3D GPR image arrangement for deep learning-based underground object classification
    Kim, Namgyu
    Kim, Sehoon
    An, Yun-Kyu
    Lee, Jong-Jae
    [J]. INTERNATIONAL JOURNAL OF PAVEMENT ENGINEERING, 2021, 22 (06) : 740 - 751
  • [29] A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images
    Liu, Chiyu
    Sun, Cunjie
    [J]. IEEE ACCESS, 2024, 12 : 93389 - 93397
  • [30] A survey of deep learning-based 3D shape generation
    Xu, Qun-Ce
    Mu, Tai-Jiang
    Yang, Yong-Liang
    [J]. COMPUTATIONAL VISUAL MEDIA, 2023, 9 (03) : 407 - 442