Multi-Class Skin Cancer Classification Using Vision Transformer Networks and Convolutional Neural Network-Based Pre-Trained Models

被引:13
|
作者
Arshed, Muhammad Asad [1 ,2 ]
Mumtaz, Shahzad [3 ]
Ibrahim, Muhammad [2 ]
Ahmed, Saeed [1 ]
Tahir, Muhammad [4 ,5 ]
Shafi, Muhammad [6 ]
机构
[1] Univ Management & Technol, Sch Syst & Technol, Lahore 54770, Pakistan
[2] Islamia Univ Bahawalpur, Dept Comp Sci, Bahawalpur 63100, Pakistan
[3] Islamia Univ Bahawalpur, Dept Data Sci, Bahawalpur 63100, Pakistan
[4] Univ Manitoba, Dept Elect & Comp Engn, Winnipeg, MB R3T 5V6, Canada
[5] Abdul Wali Khan Univ, Dept Comp Sci, Mardan 23200, Pakistan
[6] Sohar Univ, Fac Comp & Informat Technol, Sohar 311, Oman
关键词
skin cancer diagnosis; multi-class; vision transformer; pretrained models; fine tuning; transfer learning; data augmentation;
D O I
10.3390/info14070415
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Skin cancer, particularly melanoma, has been recognized as one of the most lethal forms of cancer. Detecting and diagnosing skin lesions accurately can be challenging due to the striking similarities between the various types of skin lesions, such as melanoma and nevi, especially when examining the color images of the skin. However, early diagnosis plays a crucial role in saving lives and reducing the burden on medical resources. Consequently, the development of a robust autonomous system for skin cancer classification becomes imperative. Convolutional neural networks (CNNs) have been widely employed over the past decade to automate cancer diagnosis. Nonetheless, the emergence of the Vision Transformer (ViT) has recently gained a considerable level of popularity in the field and has emerged as a competitive alternative to CNNs. In light of this, the present study proposed an alternative method based on the off-the-shelf ViT for identifying various skin cancer diseases. To evaluate its performance, the proposed method was compared with 11 CNN-based transfer learning methods that have been known to outperform other deep learning techniques that are currently in use. Furthermore, this study addresses the issue of class imbalance within the dataset, a common challenge in skin cancer classification. In addressing this concern, the proposed study leverages the vision transformer and the CNN-based transfer learning models to classify seven distinct types of skin cancers. Through our investigation, we have found that the employment of pre-trained vision transformers achieved an impressive accuracy of 92.14%, surpassing CNN-based transfer learning models across several evaluation metrics for skin cancer diagnosis.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Classification of Freshwater Zooplankton by Pre-trained Convolutional Neural Network in Underwater Microscopy
    Hong, Song
    Mehdi, Syed Raza
    Huang, Hui
    Shahani, Kamran
    Zhang, Yangfang
    Junaidullah
    Raza, Kazim
    Khan, Mushtaq Ali
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (07) : 252 - 258
  • [32] Efficient pollen grain classification using pre-trained Convolutional Neural Networks: a comprehensive study
    Masoud A. Rostami
    Behnaz Balmaki
    Lee A. Dyer
    Julie M. Allen
    Mohamed F. Sallam
    Fabrizio Frontalini
    [J]. Journal of Big Data, 10
  • [33] Classification of Pistachio Varieties Using Pre-trained Architectures and a Proposed Convolutional Neural Network Model
    Idress, Khaled Adil Dawood
    Oztekin, Yesim Benal
    Gadalla, Omsalma Alsadig Adam
    Baitu, Geofrey Prudence
    [J]. 15TH INTERNATIONAL CONGRESS ON AGRICULTURAL MECHANIZATION AND ENERGY IN AGRICULTURE, ANKAGENG 2023, 2024, 458 : 148 - 163
  • [34] Concatenation-based pre-trained convolutional neural networks using attention mechanism for environmental sound classification
    Ashurov, Asadulla
    Yi, Zhou
    Liu, Hongqing
    Yu, Zhao
    Li, Manhai
    [J]. APPLIED ACOUSTICS, 2024, 216
  • [35] Automated Classification of Urinary Cells: Using Convolutional Neural Network Pre-trained on Lung Cells
    Teramoto, Atsushi
    Michiba, Ayano
    Kiriyama, Yuka
    Sakurai, Eiko
    Shiroki, Ryoichi
    Tsukamoto, Tetsuya
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [36] Advancing maxillofacial prosthodontics by using pre-trained convolutional neural networks: Image-based classification of the maxilla
    Ali, Islam E.
    Sumita, Yuka
    Wakabayashi, Noriyuki
    [J]. JOURNAL OF PROSTHODONTICS-IMPLANT ESTHETIC AND RECONSTRUCTIVE DENTISTRY, 2024, 33 (07): : 645 - 654
  • [37] ConvTimeNet: A Pre-trained Deep Convolutional Neural Network for Time Series Classification
    Kashiparekh, Kathan
    Narwariya, Jyoti
    Malhotra, Pankaj
    Vig, Lovekesh
    Shroff, Gautam
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [38] Layered Convolutional Neural Networks for Multi-Class Image Classification
    Kasinets, Dzmitry
    Saeed, Amir K.
    Johnson, Benjamin A.
    Rodriguez, Benjamin M.
    [J]. REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2024, 2024, 13034
  • [39] Age Classification Using Convolutional Neural Networks with the Multi-class Focal Loss
    Liu, Wei
    Chen, Lin
    Chen, Yajun
    [J]. 3RD INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL AND ROBOTICS ENGINEERING (CACRE 2018), 2018, 428
  • [40] Hippocampus segmentation and classification for dementia analysis using pre-trained neural network models
    Priyanka, Ahana
    Ganesan, Kavitha
    [J]. BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2021, 66 (06): : 581 - 592